After T411410: Decommission WDQS full graph endpoint (wdqs2009) we will publish a copy of the journal file to clouddumps hosts.
See this thread (internal slack) for details.
The journal meets criteria for "low risk" dataset. As due diligence, back in december, I filled in the WMF Record of Data Publication according to the Data Publication Guidelines.
We will need to split the file into 130 x 10GBs chunks. The journal can be used to build self-hosted instances of Wikidata Query Service,
and will be available until March 2026.
The files will be uploaded after WDQS is turned off and no database writes are in flight. We will not need to take a snaphot of the host lvm volume.
AC:
- Blazegraph journal file is available (chunked) at https://dumps.wikimedia.org.