Page MenuHomePhabricator

Tool should save revision ID instead of recentchanges ID
Closed, ResolvedPublic

Description

Due to an oversight on my part the tool is currently saving recentchanges IDs to the database instead of revision IDs. These are much less useful because the recentchanges table only stores ~30 days of data.

We can re-import revision IDs pre-August 2018 from the old database, but we'll have a gap between then and 30 days from when this task is completed unless we go back to do a one-off task to fill in the gaps. This might be worthwhile anyway to retrieve missing data between August and September 2018 when the tool was down.

Event Timeline

Samwalton9-WMF created this task.

Making good progress on this at https://github.com/Samwalton9/hashtags/tree/db-rev-ids. I think the code part is done, and now I just need to import/merge the old data.

Code changes are now in the repo (https://github.com/Samwalton9/hashtags/pull/8/). Still working on database updates.

All rev_ids from the last database backup from the old tool are now imported. Now working on filling in the gaps from the revision table.

This is now the tool behaviour. Filling in gaps is a separate task.