The work from T389026: Rethink rev_sha1 field will remove revision.rev_sha1 from MariaDB.
We depend on it:
In this task we need to adapt our code.
Some questions:
- Looks like DumpsV1 code will need to compute this on the fly. Should we also compute it on the fly on File Export, and just remove the revision_sha1 field from `wmf_content.mediawiki_content_history_v1?
- Alternatively we can compute it on ingestiong to wmf_content.mediawiki_content_history_v1 to honor its schema?
Decision: Given upstream MW is dropping the revision.rev_sha1 column, and that the corresponding column on our side is not used by any production workloads (code search, slack thread), we agreed to drop revision_sha1 from wmf_content.mediawiki_content_history_v1 and wmf_content.mediawiki_content_current_v1, and to compute it on the fly for File Export to honor the XSD Schema.
Some resources:
Def of RevisionSlots::computeSha1()
Def of base_convert: https://github.com/wikimedia/base-convert/blob/master/src/Functions.php#L40