Page MenuHomePhabricator

Remove Wikidata sitelinks to invalid sites
Closed, ResolvedPublic

Description

Due to T301247 it was possible to create sitelinks on Wikidata to all existing sites, even if they aren't supported for sitelinks (for example incubatorwiki).

We should look for all such site links and remove them, as they can cause problems for data re-user and editors (these site links are not visible in the UI and can't easily be removed via the API). These site links can also cause further problems, like T284211, both in our software and for third parties using Wikidata's data.

Event Timeline

I wrote a one-off script for this and ran it on tool forge:

hoo@tools-sgebastion-07:~$ php T301904.php 
item_id,ips_site_id
Q4847311,incubatorwiki
Q4654925,incubatorwiki
Q2018526,incubatorwiki
Q6173452,ruwikimedia
Q29579,incubatorwiki
Q47647413,ruwikimedia
Q61993734,ruwikimedia
Q87450764,ruwikimedia
Q5683132,incubatorwiki
Q2201314,incubatorwiki
Q148,incubatorwiki
Q11725,incubatorwiki
Q11736,incubatorwiki
Q52,incubatorwiki
Q5462387,incubatorwiki

Reopened, so we can track the removal of invalid sitelinks.

Manuel renamed this task from Look for Wikidata sitelinks to invalid sites to Remove Wikidata sitelinks to invalid sites.Apr 19 2022, 2:28 PM

I believe I've removed them all now - https://w.wiki/58RE is no longer returning any results, at least.