Page MenuHomePhabricator

Need a mechanism / process to purge invalid / stale entries from the Linter database
Open, MediumPublic


T165070, T162920, and T163549 are all examples where the underlying linter data are invalid and should be removed from the database. Separately, T148609 saw a revert of Linter on large wikis. Any existing linter data for those wikis will likely be stale / invalid when the linter is re-enabled on those wikis.

So, we need some mechanism of isolating those entries and clearing them from the db. Maybe this is simply someone running a database query to delete relevant entries. The filter is different in each case. It is not clear what a good solution is. But, filing this task to brainstorm some solutions / strategies.

One obvious harmless query would be to purge all entries for titles where the revision id for the linter data is no longer the most recent id for the title.

Event Timeline

ssastry triaged this task as High priority.Aug 6 2017, 3:09 PM

This came up again in and @Legoktm said we can write a maintenance script for this.

ssastry lowered the priority of this task from High to Medium.

Not currently working on this.

Two tasks address some of these stale linter data issues:
Delete lint error records when content model changes from wikitext
Delete lint errors when blank page saved... (the title of this ticket implies content model change also needed but only requirement is saving a blank page regardless of content model.

Part of this should be resolved by T165538 - gerrit 761070 when content model changes.

Clearing the linter table of orphaned "stale" records needs a maintenance script to be implemented.