Page MenuHomePhabricator

[Session] Presenting mwsql: A new Python library for working with Wikimedia SQL dumps
Closed, ResolvedPublic

Description

Username or display name: Slavina S (Slst2020)

Session type (select one):

  • Presentation (including Q/A) - 25 mins
  • Discussion (including Q/A) - 55 mins
  • Workshop (including Q/A) - 55 mins

Date and time:

August 13th, 9:00 UTC

Session Details

Short description of the session (~150 words):

mwsql is a new Python library with utilities for loading Wikimedia SQL dump files and converting them into more user-friendly formats such as CSV, Pandas dataframes, or native Python objects.

The main goal of this session is to demo the basic usage of this library in PAWS, and chat about how it could be further improved to better serve the community.

Target audience:

Users of Wikimedia Dumps; developers and technical writers, new or experienced, who are interested in contributing to the Wikimedia data ecosystem; anyone looking to get involved in a beginner-friendly open source data project.

(Optional) Additional resources:

Wikimedia dumps
mwsql source
mwsql on PyPI
mwsql docs

Event Timeline

@Slst2020: Thanks for participating in the Hackathon! We hope you had a great time.

  • If this session / event took place: Please change the task status to resolved via the Add Action...Change Status dropdown.
    • If there are specific follow-up tasks from this session / event: Please create dedicated tasks and add another active project tag to those tasks, so others can find those tasks (as likely nobody in the future will look at the Hackathon workboard when trying to find something they are interested in).
  • In this session / event did not take place: Please set the task status to declined.

Thank you,
your Hackathon venue housekeeping service

Slst2020 claimed this task.