In order to get edit data flowing into the Data Lake, we decided in T202348 to create a new schema called Schema:EditAttemptStep. This schema is an evolution of the Edit schema. The property names have been updated to make them Data Lake-compatible. Two properties were also added in order to facilitate joining with Schema:ReadingDepth.
The properties that have their names changed between the old and the new schema are as follows:
Old name | New name |
---|---|
isOversample | is_oversample |
action.init.type | init_type |
action.init.mechanism | init_mechanism |
action.init.timing | init_timing |
action.ready.timing | ready_timing |
action.loaded.timing | loaded_timing |
action.saveIntent.timing | save_intent_timing |
action.saveAttempt.timing | save_attempt_timing |
action.saveSuccess.timing | save_success_timing |
action.saveFailure.type | save_failure_type |
action.saveFailure.message | save_failure_message |
action.saveFailure.timing | save_failure_timing |
action.abort.type | abort_type |
action.abort.mechanism | abort_mechanism |
action.abort.timing | abort_timing |
editor | editor_interface |
mediawiki.version | mw_version |
page.id | page_id |
page.title | page_title |
page.ns | page_ns |
page.revid | revision_id |
editingSessionId | editing_session_id |
user.id | user_id |
user.editCount | user_editcount |
user.class | user_class |
The following two properties have been added in the new schema:
- page_token
- session_token
The code points that previously logged using the Edit schema will need to be updated to reflect the new schema, hence this task.