User Story
As a Data Engineering SRE, I need to setup and install Apache Iceberg on the Analytics cluster
Why?
So that the team can begin migrating datasets to Iceberg
Success Criteria
- Apache Iceberg installed and team can being migration work
- Presto: https://phabricator.wikimedia.org/T311525 ( Presto 0.273.3 is compiled against, and the release tarball includes, Iceberg 0.13.1.)
- Spark
We want to use this as an opportunity to discourage users from using Hive MapReduce engine. We have chosen not to support iceberg in Hive MR, forcing users to use Spark or Presto to query iceberg tables.