Page MenuHomePhabricator

Extract a list of the 200 most viewed black historical figures from WDQS
Closed, ResolvedPublic

Description

As a memeber of the Brand Team, I would like to raise awareness around Black historical figures by adding portraits to pages that don't have them yet. I need a list of 200 most viewed pages about black historical figures that don't have an image depicting them. This data can probably be extracted from Wikidata Query Service.

  • English Wikipedia only for this request
  • "historical" == deceased

Details

Due Date
Feb 5 2021, 12:00 AM

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

cc @Miriam who might have some advice based on her work on the algorithm to find unillustrated articles.

HI @Gehel ! Are you looking at English Wikipedia only? Or all Wikis?
I can send you a list of QIDs for unillustrated enwiki pages which match the QIDs returned by a Wikidata query similar to this: https://w.wiki/v3P?

@Miriam, I'll check with Tas and post the response here

@Miriam this would be great indeed!
I think we can precise the query with (generalize the ethnicity and filter on Q5):

SELECT ?item WHERE {
  ?item wdt:P31 wd:Q5 ;
        wdt:P172/wdt:P279* wd:Q817393 .
}

from Tas:

  • deliver list by Fri, 5 Feb
  • only English Wikipedia for this task
MPhamWMF set Due Date to Feb 5 2021, 12:00 AM.

also @Miriam both queries still return wikipedia pages that do have images

@Mstyles at what granularity do you need pageview counts? We can use the webrequest or pageview_hourly tables from Hive if we want to access pageviews in the last 3 motnhs, or the pageview API for data aggregated monthly, e.g.: https://wikimedia.org/api/rest_v1/metrics/pageviews/per-article/en.wikipedia/all-access/all-agents/Serena_Williams/monthly/2015100100/2015103100

I joined the data from @dcausse 's query with our unillustrated article list for English Wikipedia, then queried the pageview API to get pageviews for December 2020, here you have the result: https://docs.google.com/spreadsheets/d/1FFCqwo0XsC6jJG3t7CGMhhOuzY9IQpNQTVyW8IbCNRM/edit?usp=sharing

Is this something similar to what you were looking for, @Mstyles, @Gehel @MPhamWMF ?

@Miriam. This ticket forgot to specify that "historical" here refers to individuals who are currently deceased. I will update it. Is it possible to also include the enwiki link in a column as well? Those two things aside, I think this looks pretty good to me. @CBogen ?

Hi @MPhamWMF, I modified the Wikidata query to include historical people only, and added the Wiki url - here is the result, let me know if this works! https://docs.google.com/spreadsheets/d/1FFCqwo0XsC6jJG3t7CGMhhOuzY9IQpNQTVyW8IbCNRM/edit?usp=sharing

Thanks, Miriam! This looks great! I'll send it on to Tas for final approval (can assume everything is done on your end unless I post otherwise here).

Gehel claimed this task.