Page MenuHomePhabricator
Feed Search

Aug 15 2022

Darcyisverycute claimed T283466: topic overlap between Wikipedia language versions.

So to fill out the rest of the matrix I just need to work out a way to programmatically combine the queries into a table and run on a database dump, or run queries of the form in my presentation sequentially (possibly also with a database dump). The full matrix is ~170 language wikis across 250+ languages, so about 28900 queries to run in total if we wanted the full table. @Lydia_Pintscher do you have any advice on scaling up this approach?

Aug 15 2022, 10:55 AM · WMA-Hackathon-2025, Wikidata Integration in Wikimedia projects, Wikimania-Hackathon-2022, patch-welcome, Wikidata