Page MenuHomePhabricator

Add "haswbsitelink" to find items missing in a certain wiki
Open, LowestPublic

Description

Sitelink is basically a specific kind of identifier (though not store in statements) So we may image we can search them like haswbstatement:P31=Q484170 -haswbsitelink:jawiki in order to find communes of France that is missing in jawiki.

(probably haswbsitelink:enwiki=Test should also work, but this can already be achieved via Special:ItemByTitle.)

Related Objects

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJul 31 2018, 10:39 PM
Aklapper renamed this task from New feature haswbsitelink to Add "haswbsitelink" to find items missing in a certain wiki.Aug 1 2018, 6:21 AM
debt triaged this task as Lowest priority.Aug 2 2018, 5:22 PM
debt moved this task from needs triage to later on... on the Discovery-Search board.
debt added a subscriber: debt.

sitelinks is a wikidata property and using SPARKL queries might be able to do this

sitelinks is a wikidata property and using SPARKL queries might be able to do this

This is not efficient at all. For example:

SELECT ?item {
  ?article schema:about ?item.
  ?article schema:inLanguage "en" .
  MINUS {?item rdfs:label ?label FILTER(lang(?label)="en")}
} LIMIT 1000

This query can not run

Another query that can not run:

SELECT (COUNT(distinct ?item) as ?c){
  ?item wdt:P1566 [].
  ?page schema:about ?item. ?page schema:isPartOf <https://ceb.wikipedia.org/>.
  }