Page MenuHomePhabricator

Antoine_Quhen (aqu)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Jan 4 2022, 1:16 PM (73 w, 5 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
AQuhen (WMF) [ Global Accounts ]

Recent Activity

Tue, May 30

Antoine_Quhen moved T318346: Add Python Linter Checks to CI from Next Up to In Progress on the Data Pipelines (Sprint 13) board.
Tue, May 30, 4:08 PM · Data Pipelines (Sprint 13), Data-Engineering-Planning

Mon, May 29

Antoine_Quhen added a comment to T326570: Migrate custom gitlab runner that runs Dockerfiles to releng's new production infra.

https://gitlab.wikimedia.org/repos/releng/gitlab-trusted-runner/-/merge_requests/27/commits
https://gitlab.wikimedia.org/repos/releng/gitlab-trusted-runner/-/merge_requests/26/commits

Mon, May 29, 3:26 PM · Data Pipelines
Antoine_Quhen updated the task description for T326570: Migrate custom gitlab runner that runs Dockerfiles to releng's new production infra.
Mon, May 29, 3:07 PM · Data Pipelines
Antoine_Quhen updated the task description for T326570: Migrate custom gitlab runner that runs Dockerfiles to releng's new production infra.
Mon, May 29, 3:07 PM · Data Pipelines
Antoine_Quhen claimed T318346: Add Python Linter Checks to CI.
Mon, May 29, 6:13 AM · Data Pipelines (Sprint 13), Data-Engineering-Planning

Thu, May 25

Antoine_Quhen renamed T336718: Write data to Iceberg formatted tables (mediawiki.page_content_change) from Write data to Iceberg formatted tables to Write data to Iceberg formatted tables (mediawiki.page_content_change).
Thu, May 25, 4:06 PM · Data Pipelines (Sprint 13)
Antoine_Quhen moved T336718: Write data to Iceberg formatted tables (mediawiki.page_content_change) from In Progress to Next Up on the Data Pipelines (Sprint 13) board.
Thu, May 25, 3:29 PM · Data Pipelines (Sprint 13)
Antoine_Quhen placed T336718: Write data to Iceberg formatted tables (mediawiki.page_content_change) up for grabs.
Thu, May 25, 3:28 PM · Data Pipelines (Sprint 13)
Antoine_Quhen claimed T336718: Write data to Iceberg formatted tables (mediawiki.page_content_change).
Thu, May 25, 3:28 PM · Data Pipelines (Sprint 13)
Antoine_Quhen closed T335917: Update Sqoop for externallinks table changes, a subtask of T312666: Remove duplication in externallinks table, as Resolved.
Thu, May 25, 11:08 AM · Patch-For-Review, MediaWiki-Page-derived-data, Performance-Team (Radar), DBA
Antoine_Quhen closed T335917: Update Sqoop for externallinks table changes as Resolved.
Thu, May 25, 11:08 AM · Data Pipelines (Sprint 13), Data-Engineering
Antoine_Quhen moved T335917: Update Sqoop for externallinks table changes from Ready to Deploy to Done on the Data Pipelines (Sprint 13) board.
Thu, May 25, 11:07 AM · Data Pipelines (Sprint 13), Data-Engineering
Antoine_Quhen moved T336798: Fix druid_load_pageviews_daily_aggregated_monthly from Ready to Deploy to Done on the Data Pipelines (Sprint 13) board.
Thu, May 25, 11:07 AM · Data Pipelines (Sprint 13)

Wed, May 24

Antoine_Quhen moved T335917: Update Sqoop for externallinks table changes from In Review to Ready to Deploy on the Data Pipelines (Sprint 13) board.
Wed, May 24, 4:03 PM · Data Pipelines (Sprint 13), Data-Engineering
Antoine_Quhen added a comment to T330236: Event partitions missing since 2023-02-21T10:00 for stream without events (canary events not produced?).

There is a second problem hidden behind the missing Scala lib: the Guava version mismatch between the one provided by Hadoop and the one included in eventutilities.

Wed, May 24, 2:55 PM · Data Pipelines, Data-Engineering-Planning, Event-Platform Value Stream
Antoine_Quhen added a comment to T335917: Update Sqoop for externallinks table changes.

Squooping test is conclusive and the patch could be merged right now.

Wed, May 24, 1:53 PM · Data Pipelines (Sprint 13), Data-Engineering
Antoine_Quhen created P48501 Sqooping test following schema update.
Wed, May 24, 1:52 PM · Data Pipelines
Antoine_Quhen added a comment to T330236: Event partitions missing since 2023-02-21T10:00 for stream without events (canary events not produced?).

Refinery-source does not ship Scala anymore because it was included in wikihadoop, which is not included anymore.
https://archiva.wikimedia.org/#artifact-dependencies/org.wikimedia/wikihadoop/0.3-wmf1

Wed, May 24, 9:01 AM · Data Pipelines, Data-Engineering-Planning, Event-Platform Value Stream

Tue, May 23

Antoine_Quhen added a project to T330236: Event partitions missing since 2023-02-21T10:00 for stream without events (canary events not produced?): Data Pipelines.
Tue, May 23, 4:36 PM · Data Pipelines, Data-Engineering-Planning, Event-Platform Value Stream
Antoine_Quhen moved T333004: Setup config to allow lineage instrumentation from In Review to Blocked/Paused on the Data Pipelines (Sprint 13) board.

Thanks all for the reviews. Even if the dag is working, it could be great to decide the single source of truce for our datasets metadata? Right now, its located in:

  • airflow-dags/../dataset.yml
  • airflow-dags/../..._dag.py
  • DataHub
Tue, May 23, 4:00 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning

Mon, May 22

Antoine_Quhen changed the status of T335917: Update Sqoop for externallinks table changes from Open to In Progress.
Mon, May 22, 9:07 PM · Data Pipelines (Sprint 13), Data-Engineering
Antoine_Quhen changed the status of T335917: Update Sqoop for externallinks table changes, a subtask of T312666: Remove duplication in externallinks table, from Open to In Progress.
Mon, May 22, 9:07 PM · Patch-For-Review, MediaWiki-Page-derived-data, Performance-Team (Radar), DBA
Antoine_Quhen created P48460 Update mediawiki.externallinks migration script.
Mon, May 22, 9:03 PM · Data-Engineering
Antoine_Quhen added a comment to T335917: Update Sqoop for externallinks table changes.

Do you know if there is a DB with the new schema version? It would be cool to have a place to test the import.

Mon, May 22, 8:57 PM · Data Pipelines (Sprint 13), Data-Engineering
Antoine_Quhen closed T325266: Replace refinery-source Guava caches by Caffeine as Resolved.
Mon, May 22, 12:44 PM · Event-Platform Value Stream, Data-Engineering-Planning
Antoine_Quhen closed T325266: Replace refinery-source Guava caches by Caffeine, a subtask of T327072: Java Prep for Webrequest Load, as Resolved.
Mon, May 22, 12:44 PM · Patch-For-Review, Data Pipelines (sprint 10)

Tue, May 16

Antoine_Quhen claimed T335917: Update Sqoop for externallinks table changes.
Tue, May 16, 10:44 AM · Data Pipelines (Sprint 13), Data-Engineering
Antoine_Quhen closed T326193: Airflow upgrade (refactor deb creation + version bump + switch to PostgreSQL) as Resolved.
Tue, May 16, 10:42 AM · Data Pipelines
Antoine_Quhen added a parent task for T336745: Split Cassandra Airflow dags by dataset: T336739: Post Oozie -> Airflow migration refactorings.
Tue, May 16, 10:17 AM · Data Pipelines
Antoine_Quhen added a subtask for T336739: Post Oozie -> Airflow migration refactorings: T336745: Split Cassandra Airflow dags by dataset.
Tue, May 16, 10:17 AM · Data Pipelines
Antoine_Quhen created T336745: Split Cassandra Airflow dags by dataset.
Tue, May 16, 10:17 AM · Data Pipelines
Antoine_Quhen added a parent task for T336744: Harmonize tags across Airflow dags: T336739: Post Oozie -> Airflow migration refactorings.
Tue, May 16, 10:06 AM · Data Pipelines
Antoine_Quhen added a subtask for T336739: Post Oozie -> Airflow migration refactorings: T336744: Harmonize tags across Airflow dags.
Tue, May 16, 10:06 AM · Data Pipelines
Antoine_Quhen created T336744: Harmonize tags across Airflow dags.
Tue, May 16, 10:06 AM · Data Pipelines
Antoine_Quhen moved T336739: Post Oozie -> Airflow migration refactorings from Backlog to Epics on the Data Pipelines board.
Tue, May 16, 9:58 AM · Data Pipelines
Antoine_Quhen added a parent task for T336741: Make sure all partitions sensors are using the Dataset helpers: T336739: Post Oozie -> Airflow migration refactorings.
Tue, May 16, 9:58 AM · Data Pipelines
Antoine_Quhen added a subtask for T336739: Post Oozie -> Airflow migration refactorings: T336741: Make sure all partitions sensors are using the Dataset helpers.
Tue, May 16, 9:58 AM · Data Pipelines
Antoine_Quhen created T336741: Make sure all partitions sensors are using the Dataset helpers.
Tue, May 16, 9:57 AM · Data Pipelines
Antoine_Quhen added a parent task for T336738: Refactor our existing Airflow dags to use EasyDAG & DagProperties: T336739: Post Oozie -> Airflow migration refactorings.
Tue, May 16, 9:52 AM · Data Pipelines
Antoine_Quhen added a subtask for T336739: Post Oozie -> Airflow migration refactorings: T336738: Refactor our existing Airflow dags to use EasyDAG & DagProperties.
Tue, May 16, 9:52 AM · Data Pipelines
Antoine_Quhen created T336739: Post Oozie -> Airflow migration refactorings.
Tue, May 16, 9:51 AM · Data Pipelines
Antoine_Quhen updated the task description for T336738: Refactor our existing Airflow dags to use EasyDAG & DagProperties.
Tue, May 16, 9:48 AM · Data Pipelines
Antoine_Quhen created T336738: Refactor our existing Airflow dags to use EasyDAG & DagProperties.
Tue, May 16, 9:47 AM · Data Pipelines

Mon, May 15

Antoine_Quhen added a comment to T333004: Setup config to allow lineage instrumentation.

Here is a standardized version of the first iteration for easy use by ppl without knowledge of DataHub: https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/386

Mon, May 15, 2:15 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning

Thu, May 11

Antoine_Quhen added a comment to T333004: Setup config to allow lineage instrumentation.

Some propositions for an immediate and more useful next step:

Thu, May 11, 3:30 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning
Antoine_Quhen moved T333004: Setup config to allow lineage instrumentation from In Progress to In Review on the Data Pipelines (Sprint 12) board.
Thu, May 11, 1:50 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning
Antoine_Quhen added a comment to T333004: Setup config to allow lineage instrumentation.

https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/382

Thu, May 11, 1:45 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning

Wed, May 10

Antoine_Quhen added a comment to T333004: Setup config to allow lineage instrumentation.

Update: I'm emitting metadata to Kafka from an ad-hoc Airflow data lineage task. The configuration is setting up the communication with Kafka and the schema registry, Karapace. Then the metadata is well-fetched by the mce-consumer service on the DataHub side. Now I'm looking to use the detailed version of the data lineage event, containing more information than just the link upstream<>downstream.

Wed, May 10, 8:55 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning

May 4 2023

Antoine_Quhen moved T334101: [Airflow] Migrate mediawiki geoeditors druid loading job from In Review to Ready to Deploy on the Data Pipelines (Sprint 12) board.
May 4 2023, 12:57 PM · Patch-For-Review, Data Pipelines (Sprint 12)

May 3 2023

Antoine_Quhen closed T332707: Auto clean /wmf/data/raw/webrequests_data_loss as Resolved.

I've checked the result on HDFS. It performs as expected.

May 3 2023, 4:50 PM · Data Pipelines (Sprint 12)

May 2 2023

Antoine_Quhen moved T333004: Setup config to allow lineage instrumentation from Next Up to In Progress on the Data Pipelines (Sprint 12) board.
May 2 2023, 2:01 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning
Antoine_Quhen claimed T333004: Setup config to allow lineage instrumentation.
May 2 2023, 2:00 PM · Data Pipelines (Sprint 13), Patch-For-Review, Data-Engineering-Planning

Apr 24 2023

Antoine_Quhen moved T334101: [Airflow] Migrate mediawiki geoeditors druid loading job from Next Up to In Progress on the Data Pipelines (Sprint 12) board.
Apr 24 2023, 4:04 PM · Patch-For-Review, Data Pipelines (Sprint 12)
Antoine_Quhen claimed T334101: [Airflow] Migrate mediawiki geoeditors druid loading job.
Apr 24 2023, 9:19 AM · Patch-For-Review, Data Pipelines (Sprint 12)
Antoine_Quhen moved T332707: Auto clean /wmf/data/raw/webrequests_data_loss from In Review to Ready to Deploy on the Data Pipelines (Sprint 11) board.
Apr 24 2023, 9:11 AM · Data Pipelines (Sprint 12)

Apr 14 2023

Antoine_Quhen moved T332707: Auto clean /wmf/data/raw/webrequests_data_loss from In Progress to In Review on the Data Pipelines (Sprint 11) board.
Apr 14 2023, 8:20 AM · Data Pipelines (Sprint 12)

Apr 13 2023

Antoine_Quhen updated the task description for T332707: Auto clean /wmf/data/raw/webrequests_data_loss.
Apr 13 2023, 3:19 PM · Data Pipelines (Sprint 12)
Antoine_Quhen created T334678: webrequest / webrequest raw quality check .
Apr 13 2023, 3:19 PM · Data Pipelines
Antoine_Quhen added a comment to T332707: Auto clean /wmf/data/raw/webrequests_data_loss.

OK to separate the migration from this task.

Apr 13 2023, 2:27 PM · Data Pipelines (Sprint 12)
Antoine_Quhen added a comment to T327073: Write Airflow DAG to move the webrequest load job to airflow..

Bug: There is an extra systemd check making sure SUCCESS files are generated:
https://github.com/wikimedia/operations-puppet/blob/fc98a524be9be65935b8d80b506ca33af5d442b2/modules/profile/manifests/analytics/refinery/job/data_check.pp#L27

Apr 13 2023, 2:26 PM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen updated the task description for T332707: Auto clean /wmf/data/raw/webrequests_data_loss.
Apr 13 2023, 10:00 AM · Data Pipelines (Sprint 12)

Apr 12 2023

Antoine_Quhen moved T327073: Write Airflow DAG to move the webrequest load job to airflow. from Ready to Deploy to Done on the Data Pipelines (Sprint 11) board.
Apr 12 2023, 10:36 AM · Data Pipelines (Sprint 11), Patch-For-Review

Apr 11 2023

Antoine_Quhen added a comment to T333001: Setup for allowing Airflow deployment via Git Repository.

I like idea A because the conda env encapsulates all needed libs.

Apr 11 2023, 5:24 PM · Data Pipelines (Sprint 12)
Antoine_Quhen updated the task description for T334493: anlytics/refinery deployment broken at refinery-deploy-to-hdfs.
Apr 11 2023, 4:55 PM · Patch-For-Review, Data Pipelines (Sprint 12)
Antoine_Quhen created T334493: anlytics/refinery deployment broken at refinery-deploy-to-hdfs.
Apr 11 2023, 4:54 PM · Patch-For-Review, Data Pipelines (Sprint 12)
Antoine_Quhen moved T332707: Auto clean /wmf/data/raw/webrequests_data_loss from Next Up to In Progress on the Data Pipelines (Sprint 11) board.
Apr 11 2023, 8:55 AM · Data Pipelines (Sprint 12)

Apr 7 2023

Antoine_Quhen closed T333923: Monthly pageview stats for March 2023 missing as Resolved.

I can confirm that now the data looks good in both:

Apr 7 2023, 7:36 AM · Data-Engineering, Data-Engineering-Wikistats

Apr 6 2023

Antoine_Quhen added a comment to T333923: Monthly pageview stats for March 2023 missing.

The data has been regenerated and should be pushed automatically to the web endpoint at 5 am UTC.

Apr 6 2023, 7:39 PM · Data-Engineering, Data-Engineering-Wikistats
Antoine_Quhen added a comment to T333923: Monthly pageview stats for March 2023 missing.

https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/352

Apr 6 2023, 3:58 PM · Data-Engineering, Data-Engineering-Wikistats

Apr 3 2023

Antoine_Quhen updated the task description for T333001: Setup for allowing Airflow deployment via Git Repository.
Apr 3 2023, 4:33 PM · Data Pipelines (Sprint 12)

Mar 31 2023

Antoine_Quhen added a comment to T327073: Write Airflow DAG to move the webrequest load job to airflow..

Optionally, some updates to the java code: https://gerrit.wikimedia.org/r/c/analytics/refinery/source/+/904778

Mar 31 2023, 1:08 PM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen updated the task description for T327073: Write Airflow DAG to move the webrequest load job to airflow..
Mar 31 2023, 8:37 AM · Data Pipelines (Sprint 11), Patch-For-Review

Mar 30 2023

Antoine_Quhen updated the task description for T327073: Write Airflow DAG to move the webrequest load job to airflow..
Mar 30 2023, 1:10 PM · Data Pipelines (Sprint 11), Patch-For-Review

Mar 29 2023

Antoine_Quhen updated the task description for T327073: Write Airflow DAG to move the webrequest load job to airflow..
Mar 29 2023, 4:04 PM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen moved T327073: Write Airflow DAG to move the webrequest load job to airflow. from In Progress to In Review on the Data Pipelines (sprint 10) board.
Mar 29 2023, 4:04 PM · Data Pipelines (Sprint 11), Patch-For-Review

Mar 24 2023

Antoine_Quhen updated the task description for T327073: Write Airflow DAG to move the webrequest load job to airflow..
Mar 24 2023, 11:34 AM · Data Pipelines (Sprint 11), Patch-For-Review

Mar 21 2023

Antoine_Quhen updated the task description for T332707: Auto clean /wmf/data/raw/webrequests_data_loss.
Mar 21 2023, 2:22 PM · Data Pipelines (Sprint 12)
Antoine_Quhen updated the task description for T332707: Auto clean /wmf/data/raw/webrequests_data_loss.
Mar 21 2023, 2:20 PM · Data Pipelines (Sprint 12)
Antoine_Quhen created T332707: Auto clean /wmf/data/raw/webrequests_data_loss.
Mar 21 2023, 2:19 PM · Data Pipelines (Sprint 12)

Mar 17 2023

Antoine_Quhen added a comment to T332031: Upgrade platform_eng Airflow instance to 2.5.1.

No history was lost. Some dags have been renamed: https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/commit/760f31789ee20f3e6e263fa4733ff51202fa52a0

Mar 17 2023, 9:23 AM · Structured-Data-Backlog (Current Work)

Mar 10 2023

Antoine_Quhen updated the task description for T326193: Airflow upgrade (refactor deb creation + version bump + switch to PostgreSQL).
Mar 10 2023, 8:30 AM · Data Pipelines

Mar 7 2023

Antoine_Quhen moved T305842: Migrate the referrer job from Next Up to In Progress on the Data Pipelines (Sprint 11) board.
Mar 7 2023, 5:04 PM · Data Pipelines (sprint 10)

Mar 6 2023

Antoine_Quhen moved T327073: Write Airflow DAG to move the webrequest load job to airflow. from In Progress to In Review on the Data Pipelines (Sprint 11) board.

Here is the Airflow Job: https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/260

Mar 6 2023, 5:04 PM · Data Pipelines (Sprint 11), Patch-For-Review

Mar 3 2023

Antoine_Quhen updated the task description for T327073: Write Airflow DAG to move the webrequest load job to airflow..
Mar 3 2023, 2:42 PM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen updated the task description for T327073: Write Airflow DAG to move the webrequest load job to airflow..
Mar 3 2023, 9:47 AM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen moved T327072: Java Prep for Webrequest Load from In Review to Ready to Deploy on the Data Pipelines (Sprint 11) board.
Mar 3 2023, 9:43 AM · Patch-For-Review, Data Pipelines (sprint 10)

Feb 28 2023

Antoine_Quhen added a comment to T327073: Write Airflow DAG to move the webrequest load job to airflow..

In the description, I've added a list of jobs that look like dependencies.

Feb 28 2023, 9:58 AM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen updated the task description for T327073: Write Airflow DAG to move the webrequest load job to airflow..
Feb 28 2023, 9:57 AM · Data Pipelines (Sprint 11), Patch-For-Review

Feb 23 2023

Antoine_Quhen added a comment to T326193: Airflow upgrade (refactor deb creation + version bump + switch to PostgreSQL).

Note: we should create a new branch, main_airflow_2_5_1, in airflow-dags to deploy the code only one the instances which are migrated. Hopefully, it shouldn't take long. Then when the migration process is finished we could merge to main and deploy from main.

Feb 23 2023, 3:53 PM · Data Pipelines
Antoine_Quhen moved T330199: Migrate virtual page view from Oozie to Airflow from Next Up to Next Up on the Data Pipelines (Sprint 11) board.
Feb 23 2023, 3:46 PM · Data Pipelines (sprint 10)
Antoine_Quhen edited projects for T330199: Migrate virtual page view from Oozie to Airflow, added: Data Pipelines (Sprint 11); removed Data Pipelines.
Feb 23 2023, 3:46 PM · Data Pipelines (sprint 10)
Antoine_Quhen claimed T330199: Migrate virtual page view from Oozie to Airflow.
Feb 23 2023, 3:44 PM · Data Pipelines (sprint 10)
Antoine_Quhen claimed T327073: Write Airflow DAG to move the webrequest load job to airflow..
Feb 23 2023, 3:42 PM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen moved T327073: Write Airflow DAG to move the webrequest load job to airflow. from Next Up to In Progress on the Data Pipelines (Sprint 11) board.
Feb 23 2023, 3:42 PM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen edited projects for T327073: Write Airflow DAG to move the webrequest load job to airflow., added: Data Pipelines (Sprint 11); removed Data Pipelines.
Feb 23 2023, 3:42 PM · Data Pipelines (Sprint 11), Patch-For-Review
Antoine_Quhen assigned T330154: Check if new airflow package is usable on both buster and bullseye to BTullis.
Feb 23 2023, 3:41 PM · Data Pipelines (Sprint 11)
Antoine_Quhen updated Other Assignee for T330154: Check if new airflow package is usable on both buster and bullseye, added: BTullis.
Feb 23 2023, 3:40 PM · Data Pipelines (Sprint 11)

Feb 21 2023

Antoine_Quhen added a subtask for T326193: Airflow upgrade (refactor deb creation + version bump + switch to PostgreSQL): T330154: Check if new airflow package is usable on both buster and bullseye.
Feb 21 2023, 8:14 PM · Data Pipelines
Antoine_Quhen added a parent task for T330154: Check if new airflow package is usable on both buster and bullseye: T326193: Airflow upgrade (refactor deb creation + version bump + switch to PostgreSQL).
Feb 21 2023, 8:14 PM · Data Pipelines (Sprint 11)
Antoine_Quhen created T330199: Migrate virtual page view from Oozie to Airflow.
Feb 21 2023, 5:43 PM · Data Pipelines (sprint 10)