Parent task for ongoing work to create an event_sanitized database using iceberg tables.
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T311743 [Iceberg] Epic: Icebergify event_sanitized database | |||
Open | None | T311737 [Iceberg] Migrate event_sanitized_iceberg to event_sanitized | |||
Open | None | T225751 Consider renaming event and event_sanitized Hive databases | |||
Duplicate | None | T236794 Find a strategy to mitigate small-files handling for long-term kept events | |||
Open | JAllemandou | T311739 [Iceberg] Update Refine Sanitize to insert into Iceberg tables | |||
Resolved | xcollazo | T311738 [Iceberg] Debianize and install iceberg support for Spark, Presto, and optionally Hive | |||
Resolved | Ottomata | T311525 Upgrade to latest PrestoDB and enable iceberg support | |||
Declined | Stevemunene | T324011 SPIKE: Spin up a Test Trino instance (Evaluate Trino) | |||
Resolved | xcollazo | T335721 Add support for Iceberg in Spark | |||
Open | xcollazo | T336012 Add support for Iceberg to the Spark Docker Image |
Event Timeline
Comment Actions
Relevant slack discussion: https://app.slack.com/client/E012JBDTTHA/CSV483812
We could take advantage of this migration to delete some unused data and implement a smaller default retention policy than all-time.