Currently there are a lot of issues for evaluation and analysis of Blazegraph replacements, such as:
T206560 - Evaluate alternatives to BG (including lots of subtasks around testing and evaluating alternatives)
T306725 - Decide which BG services to migrate (assuming a migration is bound to happen)
... but no issue for the migration itself. It seems unavoidable and urgent, hence this task.
We should migrate before reloads fail: Blazegraph instability has been slowing down data reloads on WDQS, and may prevent them altogether next time. As the Query Service is the public-facing part of Wikidata in many contexts, this feels like preventing WD itself from being updated.
@Gehel [[ https://lists.wikimedia.org/hyperkitty/list/wikidata@lists.wikimedia.org/message/7QTJBRU2T3J22SNV4TGBRML4QNBGCEOU/ | wrote in Feb 2023 ]]:
> TL;DR: We expect to successfully complete the recent data reload on Wikidata Query Service soon, but we've encountered multiple failures related to the size of the graph, and anticipate that this issue may worsen in the future. Although we succeeded this time, we cannot guarantee that future reload attempts will be successful given the current trend of the data reload process. Thank you for your understanding
**Proposal**:
* Migrate WD to a different db backend before we next need to reload the query service. (Even if there is a double-backend solution for a time: T290839)
* Document the migration process for ourselves and for other wikibase users.
**Motivation to do this now**:
# We need a new production-quality backend. Practicing + testing a migration helps practice future recovery workflows.
# Working through the migration process will bring needed attention to this critical step in WD growth
# Whatever the challenges, waiting until a backend failure happens will be worse.
# There is an ongoing tax for delaying migration: more issues opening every season for fixing slowness, failures, or other inconsistencies with BG.