Page MenuHomePhabricator

Andre to provide "New Authors" 2017 data from Sept, Oct and Nov to Srishti soon
Closed, ResolvedPublic

Description

Data needed for conducting the survey which is part of the New Developers Quarterly Report > https://www.mediawiki.org/wiki/User:SSethi_(WMF)/Quarterly/2018-01

Event Timeline

Data on https://wikimedia.biterg.io/app/kibana#/dashboard/C_Gerrit_Demo is updated now.
See https://wikimedia.biterg.io/goto/5beebba001197edd0f95224b1d96afde
Currently importing the latest DB dump into a local DB.
SQL query to run afterwards: SELECT email FROM identities WHERE source = "gerrit" AND (name = "..." OR name = "...")
(Felt slightly easier/faster than manually searching for all those email addresses in the JSON file itself.)

Sent data via email.

Question:

  • Why in the link above, we see 51 independent contributors, but you provided me emails of total 42 contributors? Also, in those 42 emails, there were a few duplicates, so eliminating that, I got 36 in return. Not sure why.

Question:

  • Why in the link above, we see 51 independent contributors, but you provided me emails of total 42 contributors?

Because T157898 is not fixed, so affilations are not up to date, so I manually removed non-volunteers.

Also, in those 42 emails, there were a few duplicates, so eliminating that, I got 36 in return. Not sure why.

Because mistakes happen. :)
Edit: Well, I should take this as helpful feedback to avoid that problem next time, instead of replying by such a short unhelpful line... so, thanks for bringing it up! It's because I only ran SELECT email FROM identities WHERE ... and did not use DISTINCT. Thanks.