Page MenuHomePhabricator

Upgrade Refinery Jobs to Spark 3
Closed, ResolvedPublic

Description

This task is to update refinery jobs to Spark 3.

As a follow up task the Spark 3 jobs need to be tested with the exception of the dynamic allocation (expectation is that it will succeed). There should be no change in behavior. Most Spark jobs don't have unit tests since there isn't sufficient test data.

Event Timeline

Change 656897 had a related patch set uploaded (by Ottomata; author: Joal):

[analytics/refinery/source@master] [WIP] Update to spark-3 and scala-2.12

https://gerrit.wikimedia.org/r/656897

odimitrijevic renamed this task from Upgrade to Spark 3 to Upgrade Refinery Jobs to Spark 3.Sep 20 2021, 4:36 PM
odimitrijevic triaged this task as Medium priority.
odimitrijevic updated the task description. (Show Details)
odimitrijevic moved this task from Incoming to Smart Tools for Better Data on the Analytics board.

Hm, I think this task is also about installing and supporting Spark 3 in favor of Spark 2, with the eventual goal of removing Spark 2. This means making sure everything works with conda and jupyter and airflow.

Hm, I think this task is also about installing and supporting Spark 3 in favor of Spark 2, with the eventual goal of removing Spark 2. This means making sure everything works with conda and jupyter and airflow.

We thought it would be better to create a parent task for this one and all you mention. Doing it now.

Change 798662 had a related patch set uploaded (by Joal; author: Joal):

[analytics/refinery/source@master] Bump version to 0.2.0-SNAPSHOT to release 0.2.0

https://gerrit.wikimedia.org/r/798662

Change 798662 merged by Joal:

[analytics/refinery/source@master] Bump version to 0.2.0-SNAPSHOT to release 0.2.0

https://gerrit.wikimedia.org/r/798662

Ottomata claimed this task.

I think this task is long done! Please reopen if I am incorrect.