Page MenuHomePhabricator

Metrics: Get the quality score changes per month for at least 2019 and 2020 in a sankey diagram based on the new model
Closed, ResolvedPublic

Description

We want to get the quality score changes per month for at least 2019 and 2020 in a sankey diagram based on the new model’s quality judgements so we can get a better understanding of how data quality evolved over the last year and a half. Specifically we want to see in this diagram how many articles moved from one quality class to another.

Acceptance criteria:

  • we have a sankey diagram for at least 2019 and 2020

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Current status: blocked on script run finishing to get the individual scores

So the script is finished and I could build these:

The file names have the the time span of when to when. It's start of Q to Q and in one case, it's Jan 2019 to Jan 2020.

jan2019 to jan 2020.png (520×848 px, 83 KB)

jan2020 to apr 2020.png (520×848 px, 63 KB)

apr2020 to aug 2020.png (520×848 px, 64 KB)

apr2019 to july 2019.png (520×848 px, 57 KB)

oct2019 to jan 2020.png (520×848 px, 62 KB)