Page MenuHomePhabricator

Data for audit report
Closed, ResolvedPublic

Description

Per Tony

  • For the year ended June 30, 2015, the educational content of the Foundation’s largest project, Wikipedia, grew by approximately ___ million articles to more than ___ million articles total.
  • For the year ended June 30, 2015, volunteers added approximately ___ million images, movies, and sound files to the Foundation’s multimedia repository, making the total ____ million files.
  • Volunteers also contribute in several ways to the Foundation’s wiki software: volunteer software developers add new functionality to the code base, and volunteer language specialists add to the code base by translating the wiki interface into different languages. By the year ended June 30, 2015, the source code hosted in the Foundation’s version control repository contained approximately ____ million lines of code through the effort of approximately 2,278 contributors, in which approximately 862 are active contributors.

Event Timeline

DarTar assigned this task to ezachte.
DarTar raised the priority of this task from to High.
DarTar updated the task description. (Show Details)
DarTar moved this task to Time Sensitive on the Research board.
DarTar added a subscriber: Qgil.

About software developers:

https://www.openhub.net/orgs/wikimedia is not showing lines of code anymore, and I don't know how to get this information. I'm not sure whether it is relevant either.

I don't have data about June 30, but as of today, we count

3,807 code contributors in total
781 active in the past 365 days

In order to compensate for the lack of metrics about LOC, we know that as of today we have 470,790 commits merged, from which 123,592 were accepted in the past 365 days.

All this according to http://korma.wmflabs.org/browser/

@ezachte asks:

Question to Quim:
In 2013 I made a quick script to mine data for all 81 mediawiki projects on ohloh/openhub via screenscraping (see attachment for project names).
For each I can request https://www.openhub.net/p/mediawiki/analyses/latest/code_history and find data for June 2014/2015
The file format has changed since 2013, I can update, but want to check first if aggregating for all projects makes sense to you? (there may be noise and overlap, and of course we can't add up each project's contributors)

OpenHub's list of projects under the Wikimedia organization has two important problems: it is out of date (big problem) and it includes projects not hosted in Wikimedia servers (minor problem). I woder whether http://korma.wmflabs.org/browser/data_sources.html would serve you better.

For the first question, feel free to reuse the method and numbers for the WMF quarterly report from T106502#1523010 :

(As remarked earlier at T97344#1289201 , the Big Article Recount of spring 2015 should not have affected these global numbers too much.)

Thanks @Qgil

I wonder whether http://korma.wmflabs.org/browser/data_sources.html would serve you better.

I took a quick glance, and can see similarities, but the overall setup is different enough to make a quick patch to the existing script not viable. Making a new script is probably overkill for just this demand anyway (and too short notice). So I'd rather stick with your proposal to reword the statement "today we have 470,790 commits merged, from which 123,592 were accepted in the past 365 days."

Does that make sense to others?

@Tbayer thanks, last week we came to similar numbers in mail for 1) via the report card (not equal, as often is the case, due to different measurement point). I think the difference is minor in this context.

So to sum it up, I propose these numbers:

  1. For the year ended June 30, 2015, the educational content of the Foundation’s largest project, Wikipedia, grew by approximately 3.3 million articles to more than 35.5 million articles total.
  1. For the year ended June 30, 2015, volunteers added approximately 5.2 million images, movies, and sound files to the Foundation’s multimedia repository, making the total 27.1 million files.
  1. Volunteers also contribute in several ways to the Foundation’s wiki software: volunteer software developers add new functionality to the code base, and volunteer language specialists add to the code base by translating the wiki interface into different languages. As of last week we had 470,790 commits merged, from which 123,592 were accepted in the past 365 days, through the effort of approximately 3,807 contributors, in which approximately 781 are active contributors.
DarTar subscribed.

@ezachte, I understand this is completed on our end, moving it to Done.

Is there any action pending here?

No, can be closed I think. Doing it.

ezachte set Security to None.