Page MenuHomePhabricator

Update 2020 data about top articles for Wikipedia's 20th birthday
Closed, ResolvedPublic

Description

Request from Communications department via @hdothiduc.

  • Update full year data for 2020 about top articles.
  • Remove known false positives.

Event Timeline

cchen triaged this task as Medium priority.Jan 25 2021, 8:34 AM
cchen moved this task from Triage to Current Quarter on the Product-Analytics board.

Great, thank you very much Connie!
@EdErhart-WMF, from the data you see on the data page right now, is only 2017's most-viewed English Wikipedia article problematic in your view?

I really don't have any expertise regarding removing known false positives, I trust you both fully on this!

I believe so for the English-language articles!

Thank you Ed.
So then I would use the article with the second highest pageviews (incl redirect) according to your data, Connie. That would be "Donald Trump" and we probably would need to amend the text explaining the caveats at the bottom of the data page - what do you think @cchen @EdErhart-WMF

@cchen @EdErhart-WMF

You can find the new data for the top viewed and top edited articles here.
As far I can see, it's only the top edited article in AR Wikipedia that is different.
I will check this ranking with the translators - see if they have any problems with the articles or know of any false positives.

@cchen, can you check the numbers? Looking at the total views for the Chinese article seems kind of weird - way to high?

@EdErhart-WMF I want to ask you, can you help me with the best phrasing to put at the end of the page stating why top viewed article for 2017 is Donald Trump and not Darth Vader. I don't want to say anything wrong or that can be interpreted in a wrong way.

@hdothiduc I would say something like "After this data was published, we changed the English Wikipedia's most-viewed article of 2017 from Darth Vader to Donald Trump after the former was identified as a false positive," and link false positive to this: https://pageviews.toolforge.org/topviews/faq/#false_positive

@hdothiduc thanks for pointing this out. the pageviews for #1 article in Chinese Wikipedia has one more digit by mistake. I've updated the number in the file.