Page MenuHomePhabricator

[Epic] XML MediaWiki data dumps for right to fork
Open, MediumPublic

Description

Description

The technical initiative to Make MediaWiki XML content dump available for external consumption will continue to support https://dumps.wikimedia.org/’s primary goal: right to fork. This is essential work to maintain and modernize the process in order to decrease the maintenance burden.

KR/Hypothesis(Initiative)

Additional use cases for future enhancement of data dumps will come from the findings from T337887: Consult Dumps users and other community members about the future of Dumps and likely roll up into SDS 3.4.1: If our trusted datasets are all in the same place following the same conventions in dimension semantics, naming, and granularity considerations; it will be easier to combine and extract the data and serve data that can be easily evaluated in terms of privacy.

Use Case

Data dumps for right to fork.

User Story/ies

As a right to forker,
I need to access dumps the same way I always have
So my workflow is not interrupted.

Outcome

0% regression in current level of availability
No breaking changes for right to forkers
Decrease time to monthly data dump by 50%

Success criteria

  • Right to forkers (and other dumps users) will not notice any regressions in service level and may benefit from an increase in availability.
  • Amount of time on maintenance tasks is decreased.

Acceptance Criteria

meets current level of access in terms of frequency
Is maintainable by lean team focused on other priorities
Uses existing infrastructure capabilities or aligns with infrastructure enhancements within timeframe
Is completed by end of Q2 23/24
Data SRE are able to monitor success as part of their other monitoring workflows {telemetry task to be added}
Usage is instrumented so we can understand how often the dumps are being used {instrumentation task to be added}
Nice to have; required or optional contact info, opt in user surveys, improved mechanisms to learn about and communicate with users

Dependencies

Artifacts & Resources

Dumps: Users and Usages

Related Objects

StatusSubtypeAssignedTask
OpenVirginiaPoundstone
ResolvedMilimetric
ResolvedMilimetric
ResolvedSpikeMilimetric
ResolvedMilimetric
ResolvedVirginiaPoundstone
ResolvedMilimetric
ResolvedMilimetric
ResolvedMilimetric
Resolvedxcollazo
Resolvedxcollazo
DuplicateNone
Resolvedxcollazo
DuplicateNone
Resolvedxcollazo
Resolvedxcollazo
Resolvedxcollazo
Resolvedxcollazo
ResolvedJEbe-WMF
Resolvedxcollazo
Resolvedxcollazo
DuplicateNone
Resolvedxcollazo
Resolvedxcollazo
ResolvedJEbe-WMF
Resolvedxcollazo
Resolvedxcollazo
ResolvedJEbe-WMF
Resolvedxcollazo
Resolvedxcollazo
Resolvedxcollazo
Resolvedxcollazo
OpenQuiddity
ResolvedVirginiaPoundstone
Openxcollazo
OpenNone
OpenNone
Resolvedxcollazo
Opengmodena
Opengmodena
Opengmodena
OpenNone
OpenNone
OpenNone
OpenNone
Openxcollazo
OpenNone
OpenNone
OpenNone
OpenNone
OpenVirginiaPoundstone
OpenNone

Event Timeline