Gerrit data can be useful to write a whole new class of tools :)
Version: unspecified
Severity: enhancement
Gerrit data can be useful to write a whole new class of tools :)
Version: unspecified
Severity: enhancement
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | jcrespo | T50930 Database replication problems - production and labs (tracking) | |||
Resolved | • chasemp | T52422 Replicate the Phabricator database to labsdb |
This may not be all that relevant given the likelihood of switching to phabricator in the medium term.
In that case we can re-purpose the bug to make Phabricator's database available on labs after the switch happens.
No idea what "Phabricator" database this is about. Everything?
Plus no usecase described here at all.
Plus I don't see how to sanitize that data (legal).
I think the original use case (If I recall correctly from IRC) was to get gerrit /and bugzilla/ databases available from labs.
I don't think there's a problem in principle, though it will require a great deal of consideration. (Sanitation in particular - it's not an insurmountable problem since we do exactly that for the project databases).
I mostly intended to keep this bug open as a task tracker for the longer term.
(In reply to Andre Klapper from comment #4)
No idea what "Phabricator" database this is about. Everything?
Plus no usecase described here at all.
Plus I don't see how to sanitize that data (legal).
Yes, everything that is not-private.
Usecase is to create tools like [[mw:Gerrit/Reports]] that don't need to rely on hacks like bug 52329.
Feel free to provide links to how it's done for MW DBs if it can be done "the same way".
A couple of code links:
I don't believe there's an abundance of documentation about this, but @Springle and @yuvipanda can probably provide more background.
Things you would need:
Then we'll have to:
Lots of manual work + lots of DBA work :)
So we have visited this in a few other tickets and the tldr is there is a huge amount of data that is sensitive in Phab for fundraising, procurement, security, etc. It's not viable to blacklist this data. The best we can do is whitelist public tasks and the associated objects and publish a structure data dump.
That is here: