Page MenuHomePhabricator

Set up automated deployment strategy for Product Analytics job files
Closed, ResolvedPublic

Description

While setting up the first Oozie job that will go in the new analytics/wmf-products/jobs repo (T261953), @JAllemandou pointed out that we should set up a system that deploys each new version of the job files in a new timestamped folder in HDFS (as Analytics already does for their jobs). This is important because if we instead overwrite the job files, this could break any in-progress jobs.

Event Timeline

LGoto triaged this task as Low priority.Nov 16 2020, 5:35 PM
LGoto moved this task from Triage to Backlog on the Product-Analytics board.

Change 651794 had a related patch set uploaded (by Neil P. Quinn-WMF; owner: Neil P. Quinn-WMF):
[analytics/wmf-product/jobs@master] Set up and document deployment strategy for jobs

https://gerrit.wikimedia.org/r/651794

nshahquinn-wmf raised the priority of this task from Low to Medium.Jan 6 2021, 3:48 PM

Across Product Analytics, we've identified a fairly significant need for Oozie jobs, so work like this to streamline use of them is now more useful. (On the other hand, it looks like Analytics Engineering has a definite plan to switch to Airflow at some point, but we don't have a lot of information on that yet.)

Change 651794 merged by Neil P. Quinn-WMF:
[analytics/wmf-product/jobs@master] Set up and document deployment strategy for jobs

https://gerrit.wikimedia.org/r/651794

Change 658578 had a related patch set uploaded (by Neil P. Quinn-WMF; owner: Neil P. Quinn-WMF):
[analytics/wmf-product/jobs@master] Improve documentation on deploying Oozie jobs

https://gerrit.wikimedia.org/r/658578

Change 658578 merged by Neil P. Quinn-WMF:
[analytics/wmf-product/jobs@master] Improve documentation on deploying Oozie jobs

https://gerrit.wikimedia.org/r/658578