Following the bugfix for T314120: The image suggestion data pipeline generates too many weighted_tags, we re-generated the search indices diffs (AKA deltas). Note that the bug seems to have affected Commons.
Tasks
- clean up the Commons index from tags produced by T314120
Given the analytics_platform_eng.image_suggestions_search_index_delta Hive table:
- ingest the 2022-07-11 snapshot
- ingest the 2022-07-18 snapshot
Summary
- bad Commons IDs as per P32103 look fixed
- image suggestions notifications can be sent out
- see reports in T314473#8133329 and T314473#8133342