As a developer working on autocomplete i would like MachineVision predictions for historical search results so i can use them in T250436
Search Platform would like to use MachineVision predictions as part of a process to work with user submitted queries and determine which are useful to propose to other users as query completionypes. In initial explorations we found it viable to bring this data together, but the data is not complete enough for our use case. While the MachineVision database for commonswiki contains results for 5.7M pages, the overlap between that set and the ~16M titles returned by search between jan 1 and feb 8th is only around 70k images.
6.5M pages to import by page_id, unsorted: https://analytics.wikimedia.org/published/datasets/one-off/ebernhardson/common_fulltext_search_page_ids.commonswiki.20210101-20210210.csv.gz
AC: MachineVision databases for commonswiki contain predictions for common search results