Data Platform Engineering Bug Report or Data Problem Form.
Please fill out the following
Please ensure you set priority
What kind of problem are you reporting?
- Access related problem
- Service related problem
- Data related problem
For a data related problem:
- Is this a data quality issue? Yes
- What datasets and/or dashboards are affected? wmf.unique_devices_per_domain_monthly
- What are the observed vs expected results?
Observed results: I ran the following query
SELECT SUM(uniques_estimate) as monthly_uniques, year, month, domain FROM wmf.unique_devices_per_domain_monthly WHERE year=2025 GROUP BY year, month, domain
In the results, I found that within the domain field some instances with trailing dot (".org.") -- this results in duplicate rows for that domain, but with different values. See screenshots below. Some of these domains with the trailing dot have no associated data (or rather, they have unique devices = 0); but some of them do have numerical data (see, e.g., the en.m.wikipedia screenshot below).
Expected results: We would expect a single row of data per domain (per month & year).
It would be great if (1) the pipeline was fixed the normalize the trailing dot, and (2) the existing data were corrected to merge the entries.


