Page MenuHomePhabricator

Add more popular articles per country data to AQS
Open, HighPublic

Description

Develop API endpoint with most popular articles per country.

  • let's do a small design document on API to make sure we have a sensical endpoint
  • Probably a Top 50 per wiki is sufficient.
  • Should only include agent_type="user" (both "spider" and "bot" should be excluded)

Event Timeline

Nuria created this task.Sep 23 2020, 9:59 PM
Nuria updated the task description. (Show Details)

Hi @Nuria,
This is Megha Jain and I am a newbie to Wikimedia , but would love to contribute to the open source community.
I am proficient with SQL , Python and Perl , data analytics and have developed basic web apps in JS and PHP. Currently I am a Data Science student , exploring new things.
If this seems like a good first issue for a newbie to understand Wiki Datasets , could I look into this ? Will be grateful for any guidance or pointers :)

Nuria added a comment.Sep 28 2020, 5:37 PM

@Meghajain171192 thanks for your interest, this is a ticket that requires access to private data and our computation environment, which we cannot give to volunteers. Now, we have an all Js project that you might be interested to contribute, see for example: https://phabricator.wikimedia.org/T263973 and https://wikitech.wikimedia.org/wiki/Analytics/Systems/Wikistats_2

Thanks a ton for your reply @Nuria :) Will look into "Wikistats Bug - easy to understand language for pageviews" and will also work on getting access to gerrit !

AMuigai moved this task from Backlog to Watching on the Inuka-Team board.Oct 1 2020, 1:44 PM
Nuria added a comment.Oct 7 2020, 5:45 PM

@lexnasser As a remainder here is the design document for the prior AQS endpoint: https://drive.google.com/drive/u/0/folders/1bcy6Iyb_bLwD1jcfjhL4vtKZvD-CN22L

Let's start on this task in the same way, building a design document that specifies queries semantics for APi. The parent task can be used to get input from stakeholders if needed.

fdans triaged this task as High priority.Oct 8 2020, 5:24 PM
fdans moved this task from Incoming to Analytics Query Service on the Analytics board.