Page MenuHomePhabricator

Categories returned don't match
Closed, DuplicatePublic

Description

The pywikibot-core tests testing the categories of Albert Einstein on the English Wikipedia are failing recently:

Now when looking at the call [[https://en.wikipedia.org/w/api.php?action=query&prop=categories&titles=Albert%20Einstein&clprop=hidden|action=query&prop=categories&titles=Albert Einstein&clprop=hidden]] it returns just 6 hidden categories. Just for clarification it's clprop not clshow and a similar call to [[https://en.wikipedia.org/w/api.php?action=query&prop=categories&titles=The%20Big%20Bang%20Theory&clprop=hidden|action=query&prop=categories&titles=The Big Bang Theory&clprop=hidden]] returns also normal (not hidden) categories.

{
    "warnings": {
        "query": {
            "*": "Formatting of continuation data will be changing soon. To continue using the current formatting, use the 'rawcontinue' parameter. To begin using the new format, pass an empty string for 'continue' in the initial query."
        }
    },
    "query": {
        "pages": {
            "736": {
                "pageid": 736,
                "ns": 0,
                "title": "Albert Einstein",
                "categories": [
                    {
                        "ns": 14,
                        "title": "Category:All articles lacking reliable references",
                        "hidden": ""
                    },
                    {
                        "ns": 14,
                        "title": "Category:Articles lacking reliable references from March 2015",
                        "hidden": ""
                    },
                    {
                        "ns": 14,
                        "title": "Category:CS1 maint: Unrecognized language",
                        "hidden": ""
                    },
                    {
                        "ns": 14,
                        "title": "Category:Good articles",
                        "hidden": ""
                    },
                    {
                        "ns": 14,
                        "title": "Category:Wikipedia indefinitely move-protected pages",
                        "hidden": ""
                    },
                    {
                        "ns": 14,
                        "title": "Category:Wikipedia indefinitely semi-protected pages",
                        "hidden": ""
                    }
                ]
            }
        }
    }
}

I tried purging it and only a few hours ago ClueBot NG reverted a vandalism edit. That vandalism also removed all normal categories so it might be connected.

Event Timeline

XZise raised the priority of this task from to Needs Triage.
XZise updated the task description. (Show Details)
XZise subscribed.
Restricted Application added subscribers: Aklapper, Unknown Object (MLST). · View Herald TranscriptMar 28 2015, 2:29 PM

Hmm seems to be just a caching issue. I now get all categories, but shouldn't have action=purge fixed that?

This reminds me of a long-standing bug (which I can't find at the moment to link to) where the links tables either somehow don't get updated after a revert or somehow the updated entries get overwritten by those from an older revision. I've tried to reproduce it at various times to try to track it down, but I've never been able to.

A purge won't fix it since that doesn't refresh links tables, but a null edit or an API action=purge&forcelinkupdate=1 will.

This reminds me of a long-standing bug (which I can't find at the moment to link to) where the links tables either somehow don't get updated after a revert or somehow the updated entries get overwritten by those from an older revision. I've tried to reproduce it at various times to try to track it down, but I've never been able to.

T19154