Page MenuHomePhabricator

categorylinks not updating on en.wikipedia.org?
Closed, ResolvedPublic

Description

https://en.wikipedia.org/w/api.php?action=query&prop=categories&titles=Category:Duchy+of+Parma says:

{
    "batchcomplete": "",
    "query": {
        "pages": {
            "49605012": {
                "pageid": 49605012,
                "ns": 14,
                "title": "Category:Duchy of Parma"
            }
        }
    }
}

But according to https://en.wikipedia.org/wiki/Category:Duchy_of_Parma, this category is a member of many categories (e.g., "Former countries on the Italian Peninsula").

Compare with a working example: https://en.wikipedia.org/w/api.php?action=query&prop=categories&titles=Category:Tourism+in+Illinois and https://en.wikipedia.org/wiki/Category:Tourism_in_Illinois. Both of these pages show three categories that "Category:Tourism in Illinois" is a member of.

It feels like categorylinks may not be getting updated? Job queue issue, maybe?

Event Timeline

MariaDB [enwiki_p]> select cl_to from page join categorylinks on cl_from = page_id where page_title = 'Tourism_in_Illinois' and page_namespace = 14;
+------------------------------------------------------+
| cl_to                                                |
+------------------------------------------------------+
| Commons_category_with_local_link_same_as_on_Wikidata |
| Economy_of_Illinois                                  |
| Tourism_in_the_United_States_by_state                |
+------------------------------------------------------+
3 rows in set (0.00 sec)
MariaDB [enwiki_p]> select cl_to from page join categorylinks on cl_from = page_id where page_title = 'Duchy_of_Parma' and page_namespace = 14;
Empty set (0.02 sec)

Confirmed not a labsdb issue:

mysql:wikiadmin@db1052 [enwiki]> select cl_to from page join categorylinks on cl_from = page_id where page_title = 'Duchy_of_Parma' and page_namespace = 14;
Empty set (0.00 sec)

@elukey: I'm told you may have been doing Redis maintenance recently?

Anomie subscribed.

This has nothing to do with the API, the API is just reporting what's in the database.

P2708 is a database query and its results showing uncategorized categories, according to Tool Labs' enwiki_p categorylinks around March 2, 2016. Some of these categories are truly uncategorized, but many are false positives due to missing rows in the categorylinks table.