Description
The technical initiative to Make MediaWiki XML content dump available for external consumption will continue to support https://dumps.wikimedia.org/’s primary goal: right to fork. This is essential work to maintain and modernize the process in order to decrease the maintenance burden.
KR/Hypothesis(Initiative)
Additional use cases for future enhancement of data dumps will come from the findings from T337887: Consult Dumps users and other community members about the future of Dumps and likely roll up into SDS 3.4.1: If our trusted datasets are all in the same place following the same conventions in dimension semantics, naming, and granularity considerations; it will be easier to combine and extract the data and serve data that can be easily evaluated in terms of privacy.
Use Case
Data dumps for right to fork.
User Story/ies
As a right to forker,
I need to access dumps the same way I always have
So my workflow is not interrupted.
Outcome
0% regression in current level of availability
No breaking changes for right to forkers
Decrease time to monthly data dump by 50%
Success criteria
- Right to forkers (and other dumps users) will not notice any regressions in service level and may benefit from an increase in availability.
- Amount of time on maintenance tasks is decreased.
Acceptance Criteria
meets current level of access in terms of frequency
Is maintainable by lean team focused on other priorities
Uses existing infrastructure capabilities or aligns with infrastructure enhancements within timeframe
Is completed by end of Q2 23/24
Data SRE are able to monitor success as part of their other monitoring workflows {telemetry task to be added}
Usage is instrumented so we can understand how often the dumps are being used {instrumentation task to be added}
Nice to have; required or optional contact info, opt in user surveys, improved mechanisms to learn about and communicate with users
Dependencies
- {...list of technical tasks required to achieve this}
- Example task that supports this user story: T340861: Implement a backfill job for the dumps hourly table
- Establish and document current baseline on wiki
- Consult Dumps users and other community members about the future of Dumps T337887: Consult Dumps users and other community members about the future of Dumps
Artifacts & Resources
Dumps: Users and Usages