Page MenuHomePhabricator

SQL metadata files problem
Closed, InvalidPublicBUG REPORT

Description

Posted on behalf of The Anome:

I see that the Wikipedia dumps are now being generated using new infrastructure, and the old dumps have fallen into disrepair. Unfortunately, this includes the production of the SQL metadata files that contain the page-to-category mappings needed to drive my bot, so The Anomebot2 is now on indefinite hiatus again until this gets fixed. There is simply no way I can handle the entire XML dumps, with their terabytes of text, and I shouldn't need to anyway if the developers get their act together. I imagine I'm not the only bot operator in this situation.

Can someone on the development team fix this?

The specific SQL files I use are enwiki-[date]-page.sql.gz and enwiki-[date]-categorylinks.sql.gz, as can be found for example here: https://dumps.wikimedia.org/enwiki/20260101/ "

More discussion at https://en.wikipedia.org/wiki/Wikipedia:Bots/Noticeboard#SQL_files_no_longer_being_generated

Event Timeline

The specific SQL files I use are enwiki-[date]-page.sql.gz and enwiki-[date]-categorylinks.sql.gz, as can be found for example here: https://dumps.wikimedia.org/enwiki/20260101/

Please note that both of these dumps continue to be generated and are not deprecated. The schedule did change: we used to generate them twice per month (on the 1st and the 20th) and now we generate them only once (on the 1st).