Page MenuHomePhabricator

An expert panel to produce recommendations on open data sharing for public good
Open, MediumPublic

Description

Over the past years, we've engaged in multiple conversations with researchers and chapters interested in the use of Wikimedia traffic data to conduct epidemiological/surveillance research for public good. While we recognize the importance of this body of research, and the role data WMF collects could play in advancing it (compared to other platforms), we haven't been able to identify a sustainable model tor sharing this data. The current approach (granting server access under NDAs to a small group of researchers, under our formal collaboration policy) doesn't scale technically and organizationally.

We decided we'll convene a panel of experts asking them to produce recommendations to the organization and the movement on possible models (including costs and risks) to enable the public sharing of data Wikimedia collects (in an aggregate form) that may help make significant progress on global public health issues, while protecting the privacy of our editors and readers.

Invitees

Confirmed invitees who expressed an interest in participating:

  • Daniela Paolotti and Ciro Cattuto (ISI Foundation)
  • Thomas Mollet (ECDC)

Other potential invitees

Other collaborators who worked on previous proposals include:

  • Shilad Sen
  • Reid Priedhorsky and Geoffrey Fairchild

Tentative timeline

Q1-FY19 (Jul - Sep 2018)

References

Related projects:

cc'ing @Nuria @VColeman @debt @JBennett for visibility

Event Timeline

DarTar triaged this task as Medium priority.Mar 9 2018, 10:58 PM
DarTar created this task.
DarTar updated the task description. (Show Details)

I think @Ijon is real interested on this too.

leila removed subscribers: Tbayer, DarTar, VColeman.

@Nuria I've removed the Research tag but myself and others from our team are subscribed to this task. If you pick this up again and need help, let us know.

@Nuria can you provide an update about the status of this task? (I'm getting more questions about it in Wikidata Con 2019;).

@leila sorry, but we reprioritized this task to be able to work in the three upcoming public datasets.

  1. geoeditors, editors stats per country: T131280: Make aggregate data on editors per country per wiki publicly available Probably out this coming week
  1. mediarequests, stats of file requests per project T210313: Statistics for views of individual Wikimedia images probably releasing this coming week
  1. mediawiki reconstruction history , releasing publicy this quarter

When 1/2/3 are finished we can further take a look at this work but given resource limitations we really cannot provide a timeline.

@leila sorry, but we reprioritized this task to be able to work in the three upcoming public datasets.

  1. geoeditors, editors stats per country: T131280: Make aggregate data on editors per country per wiki publicly available Probably out this coming week
  1. mediarequests, stats of file requests per project T210313: Statistics for views of individual Wikimedia images probably releasing this coming week
  1. mediawiki reconstruction history , releasing publicy this quarter

When 1/2/3 are finished we can further take a look at this work but given resource limitations we really cannot provide a timeline.

Has #1 and #2 been released?

Aklapper added a subscriber: Aklapper.

Resetting assignee (inactive account)