Data Platform Engineering Bug Report or Data Problem Form.
Please fill out the following
Please ensure you set priority
What kind of problem are you reporting?
- Access related problem
- Service related problem
- Data related problem
For a data related problem:
- Is this a data quality issue?
- What datasets and/or dashboards are affected?
- Wikistats Contributing Metrics, "Active editors by country"
- What are the observed vs expected results? Please include information such as location of data, any initial assessments, sql statements, screenshots.
- Expected results: clicking on "Active editors by country" will return counts of registered editors with 5 or more edits in a month, per existing definitions (https://meta.wikimedia.org/wiki/Research:Wikistats_metrics/Active_editors, https://meta.wikimedia.org/wiki/Research:Active_editor)
- Observed results: clicking on "Active editors by country" will returns counts of registered AND IP editors with 5 or more edits in a month. This recently resulted in confusion, with a staff member asking in Slack:
Looking at Jan 2024, why is the total number of active editors for enwiki 41k here, but when looking at country-level, summed, it's ~62k, per adding the total number for editors with 5-99 edits (56k) with the total number for editors with 100+ edits (6k)?)
For the DE Team to fill out
Which systems does this effect?
- Hive
- Druid
- Superset
- Turnilo
- WikiDumps
- Wikistats
- Airflow
- HDFS
- Goblin
- Scqoop
- Dashiki
- DataHub
- Spark
- Jupyter
- Modern Event Platform
- Event Logging
- Other
Impact Assessment:
Does this problem qualify as an incident?
- Yes
- No
Does this violate an SLO?
- Yes
- No
| Value Calculator | Rank |
|---|---|
| Will this improve the efficiency of a teams workflow? | 1-3 |
| Does this have an effect of our Core Metrics? | 1-3 |
| Does this align with our strategic goals? | 1-3 |
| Is this a blocker for another team? | 1-3 |