Page MenuHomePhabricator

Image suggestions data pipeline onboarding request
Closed, ResolvedPublic

Description

Draft merge request at https://gitlab.wikimedia.org/repos/generated-data-platform/datapipelines/-/merge_requests/51. marked as ready to be merged

From https://www.mediawiki.org/wiki/Platform_Engineering_Team/Data_Value_Stream/Data_Pipeline_Onboarding/#Onboarding:

Event Timeline

mfossati changed the status of subtask T307899: Data pipeline grooming from Open to In Progress.May 9 2022, 4:33 PM
mfossati changed the task status from Open to In Progress.May 17 2022, 10:48 AM

Waiting for the patch to be merged before closing this

Note that code review comes from Data Platform.

@CBogen this is still open since code review & merge hasn't happened yet

Closing this: Data Platform has onboarded the pipeline.

NOTE: while the Airflow DAG was reviewed by Data Platform, the pyspark components were reviewed @Cparle and me.