Page MenuHomePhabricator

Upgrade DataHub V0.8.38
Closed, ResolvedPublic

Assigned To
Authored By
odimitrijevic
Jun 7 2022, 2:48 PM
Referenced Files
Restricted File
Jun 17 2022, 5:20 PM
Restricted File
Jun 17 2022, 5:20 PM
Restricted File
Jun 16 2022, 3:33 PM

Description

Version 0.8.36 of DataHub provides a UI for editing the Data Glossary, which is a feature we would really like to try.

They skipped version 0.8.37 due to an error on their part, then they have followed it up quickly with 0.8.38

Version 0.8.38 includes more Glossary related features:

  • Link Glossary Terms to one another via the UI - You can now link Glossary Terms from the UI with the following relationships:
    • Contains - relate Terms when one is a superset of another, eg. Address contains Zip Code
    • Inherits - relate Terms with one is a subtype of another, eg. Email inherits PII

Event Timeline

EChetty triaged this task as High priority.
EChetty moved this task from Next Up to In Progress on the Data-Catalog board.
EChetty moved this task from In Progress to Next Up on the Data-Catalog board.
EChetty moved this task from Next Up to In Progress on the Data-Catalog board.
BTullis renamed this task from Upgrade DataHub V0.8.36 to Upgrade DataHub V0.8.38.Jun 10 2022, 3:42 PM
BTullis updated the task description. (Show Details)

I have started work on this and I am following the guidelines here: https://wikitech.wikimedia.org/wiki/Analytics/Systems/DataHub/Upgrading

Get update from upstream

git remote update linkedin-github

Push master branch to Gerrit's master banch

git push origin linkedin-github/master:master

Push tags to Gerrit

git push origin --tags

Rebase our changes against v0.8.38

git rebase -i v0.8.38

Fixed one small merge conflict in docker/mysql-setup/init.sh

vi  docker/mysql-setup/init.sh
git add docker/
git rebase --continue

Push the updated wmf branch.

git push --force-with-lease

Change 804611 had a related patch set uploaded (by Btullis; author: Btullis):

[analytics/datahub@wmf] Release v0.8.38 of DataHub using WMF customization

https://gerrit.wikimedia.org/r/804611

I have now made a small change to the README-WMF.md file so that it triggers the build pipeline:
https://gerrit.wikimedia.org/r/c/analytics/datahub/+/804611

Change 804611 merged by jenkins-bot:

[analytics/datahub@wmf] Release v0.8.38 of DataHub using WMF customization

https://gerrit.wikimedia.org/r/804611

Change 805826 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Update the version of the datahub containers that is deployed

https://gerrit.wikimedia.org/r/805826

Change 805826 merged by jenkins-bot:

[operations/deployment-charts@master] Update the version of the datahub containers that is deployed

https://gerrit.wikimedia.org/r/805826

Change 805839 merged by jenkins-bot:

[analytics/datahub@wmf] Update the name of the binary used to launch datahub-frontend

https://gerrit.wikimedia.org/r/805839

Change 806232 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Update the container image used by DataHub 0.8.38

https://gerrit.wikimedia.org/r/806232

Change 806232 merged by jenkins-bot:

[operations/deployment-charts@master] Update the container image used by DataHub 0.8.38

https://gerrit.wikimedia.org/r/806232

Unfortunately, we're getting an error after deployment of 0.8.38.
{F35247060}
I will investigate now.

Change 806381 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Disable the telemetry for DataHub

https://gerrit.wikimedia.org/r/806381

Change 806381 merged by jenkins-bot:

[operations/deployment-charts@master] Disable the telemetry for DataHub

https://gerrit.wikimedia.org/r/806381

Change 806396 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Disable native authentication for DataHub

https://gerrit.wikimedia.org/r/806396

Change 806396 merged by jenkins-bot:

[operations/deployment-charts@master] Disable native authentication for DataHub

https://gerrit.wikimedia.org/r/806396

Change 806427 had a related patch set uploaded (by Btullis; author: Btullis):

[analytics/datahub@wmf] Update the blubber configuration for the datahub-frontend

https://gerrit.wikimedia.org/r/806427

Change 806427 merged by jenkins-bot:

[analytics/datahub@wmf] Update the blubber configuration for the datahub-frontend

https://gerrit.wikimedia.org/r/806427

Change 806435 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Update the container used for datahub deployments

https://gerrit.wikimedia.org/r/806435

Change 806435 merged by jenkins-bot:

[operations/deployment-charts@master] Update the container used for datahub deployments

https://gerrit.wikimedia.org/r/806435

This is now working. The issue was caused by a small change to their build process, which we hadn't copied to the blubber pipeline.
The result was that when we built the datahub-frontend container it was missing the web-react components.

Nice things about this release:

  1. Analytics is working now: https://datahub.wikimedia.org/analytics

{F35249401}

  1. The Glossary is now editable in the UI - This is accessible from the Govern menu at the top right.

{F35249403}