Product team want to add an editors monthly table in Druid to use in Superset/Turnilo with the following dimensions. The monthly aggregations are compiled from editors_daily dataset.
column | data type | description | notes |
wiki_db | string | The wiki database the editors worked in | |
country_code | string | The 2-letter ISO country code this group of editors geolocated to, including Unknown (--) | |
users_are_anonymous | boolean | Whether or not this group of editors edited anonymously | |
platform(s) | string | Access method (iOS, Android, Mobile web, Desktop) | |
user_tenure_bucket | string | Bucketed time between user creation and the first edits for the given month | possible values: (Under 1 day, 1 to 7 days, 7 to 30 days, ..., Over 10 years, Undefined). |
user_tenure_type | string | editor registed in current month then "new" else "returning", NULL when users_are_anonymous | possible values: new, returning, NULL |
activity_level | string | How many edits this group of editors performed | possible values: (1-4, 5-99, 100-999, 1000-9999, 10000+) |
distinct_editors | bigint | Number of editors meeting this activity level | |
namespace_zero_distinct_editors | bigint | Number of editors meeting this activity level, with only namespace zero edits | |
talkpage_distinct_editors | string | Number of editors meeting this activity level, with only talk page edits | |
userpage_distinct_editors | string | Number of editors meeting this activity level, with only user page edits | |
month | string | [partition] The month in YYYY-MM format | |