Background/Goal
Current Situation:
Currently, we do not have a way to associate new editors with geo-location. The reason being is that fingerprint-based queries are time-limited because of our privacy policy. However, we’ve built aggregate tables that do not link usernames with geolocation such as:
wmf.geoeditors_monthly
geoeditors_edits_monthly
unique_editors_by_country_monthly
Purpose:
It would be useful to create a similarly aggregated tables that displays a count of new editors and returning editors (editors who have done more than 4 edits in the month they registered in) by geolocation and wiki. The ideal would be two separate tables which contain aggregated counts of new active editors and returning active editors The benefit would be to granulate our data and improve analysis by being able to link new editors with geo-location to identify any regional trends that could explain changes in our editor counts (ex. T351759). There have been substantial YoY drops in new active editors since October 2023 and it has become difficult to perform geo analysis on possible causes for these drops without last year’s data. Previous work has been done here by data engineering and GDI: https://codeshare.io/1YDQzj that was not productionized. Updated sample code is contained in the design specs google sheet document linked below.
Ideal Delivery Date: June 2024
Stakeholders: Foundation-level Metrics committee, Regional Learning Sessions, Foundation and Movement leadership
KR/Hypothesis(Initiative)
Success metrics
- How we will measure success
Example areas:
- Deadlines
- User satisfaction
- Performance
- Accessibility
- Maintenance
- Movement impact
- Scalability
- Data Quality
- Integration
- Compliance
In scope
- known scope
Out of Scope
- known boundaries
Artifacts & Resources
Link to diagrams
Link to specifications, architecture and design docs
https://docs.google.com/document/d/1d_EXAiuSjviidNVru0XNhlg6BhEBWA9Wg79l1BTTlNo/edit
Link to product one pagers