Mjolnir CLI helpers provide the HivePartition class for managing datasets. Additionally, they include helper methods for filtering input data based on date and time partitions.
This logic should be refactored into discolytics.
Mjolnir CLI helpers provide the HivePartition class for managing datasets. Additionally, they include helper methods for filtering input data based on date and time partitions.
This logic should be refactored into discolytics.
A few extra complications:
ebernhardson opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1246
WIP: search: Simplify mjolnir dag
ebernhardson opened https://gitlab.wikimedia.org/repos/search-platform/discolytics/-/merge_requests/51
Draft: Add support to HivePartition for features needed by Mjolnir
ebernhardson opened https://gitlab.wikimedia.org/repos/search-platform/mjolnir/-/merge_requests/15
Draft: Migrate to discolytics HivePartition
dcausse merged https://gitlab.wikimedia.org/repos/search-platform/discolytics/-/merge_requests/51
Add support to HivePartition for features needed by Mjolnir
ebernhardson merged https://gitlab.wikimedia.org/repos/search-platform/mjolnir/-/merge_requests/15
Migrate to discolytics HivePartition
ebernhardson merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1246
search: Simplify mjolnir dag
This has now been fully deployed, but we will want to make sure it runs to completion this week. It typically starts thursday at 00:00 and runs for 20-30h.