Page MenuHomePhabricator

Incorrect number of content pages on stats.wikimedia.org
Closed, ResolvedPublicBUG REPORT

Description

Steps to replicate the issue (include links if applicable):

What happens?:
There's a much larger amount of content pages (hundreds of thousands, or even millions) on Wikimedia Statistics (2nd step), compared to number of articles/content pages on Meta-Wiki/Special:Statistics (1st step).

What should have happened instead?:
Numbers from both metrics should be almost paired (small inequalities due to monthly update on Wikistats): 6.76 million articles on enwiki, 1.9 million for eswiki, 741k for cawiki, 564k on fiwiki, etc.

Software version (skip for WMF-hosted wikis like Wikipedia):

Other information (browser name/version, screenshots, etc.):
Probably the same bug as T354074 reported for Latvian Wiki, but it affects multiple Wikipedias: at least the first 50 ones, except maybe Egyptian Arabic and Chcechen Wikipedias.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Hi,

You are right. The bug we have fixed related to the Latvian Wiki is not specific for that Wiki. It's a bug related to how we were filtering the results for every API request regardless the requested wiki project, but it was raised in that specific case. Your bug should be fixed when we deploy the current fix to production.

By the way, the fix for the Latvian wiki bug has just been deployed in production environment. From now on you should see right results for any Wiki. If it's not the case, please ping us

Change 989094 had a related patch set uploaded (by Santiago Faci; author: Santiago Faci):

[generated-data-platform/aqs/edit-analytics@main] Incorrect number of content pages on stats.wikimedia.org

https://gerrit.wikimedia.org/r/989094

Yes, it appears to be fixed now. Thanks.

We are still working in a secondary bug related to the first one. You can catch up in the related ticket T354074: Wikistats - incorrect number of content articles for Latvian Wikipedia .
The new patch is already done and running in the staging environment and we are reviewing the new results there just to confirm that everything is fine before deploying to production.
Anyway, I'll keep you also posted here when we move forward.

Finally, the second bug is not needed because the data shown in wikistats was already fine. Wikistats shouldn't include redirects in any case when showing results for this count.
We can consider this ticket as resolved.
Thank you!

Sfaci reopened this task as Open.
Sfaci triaged this task as High priority.