The Hashtags tool has a database of edit data for edits which use a #hashtag in the edit summary. When building the tool I didn't properly consider the difference between a recentchanges ID and a revision ID, and recorded the recentchanges ID.
I now have a database of 700,000 edits, for which I've stored username, page title, edit summary, timestamp, and recentchanges ID. I'd like to update this so that each edit has the infinitely more useful revision ID stored too. Work on this has been completed for edits going forward, but I still have all this data that needs to have revision IDs attached retroactively.
It would be relatively straightforward to do 700,000 individual database queries on the Wiki replicas to match edit metadata and retrieve a revision ID, but I'm not sure if this is excessive usage of the databases (by my estimates this would take approximately 20 hours to run). An additional complication is that for some % of edits in my testing, the timestamp in the revision table appeared to be 1 second off what it was in the recentchanges table, though I haven't explored that fully. If individual queries per-edit isn't feasible, I'm not sure what the best way to go about doing this matching is. Advice appreciated.