This should be a quick spike to understand how hard it would be to replicate mw stream enrichment with pyflink. The goal is to;
- Run a read-only python implementation of Mediawiki Stream Enrichment on YARN (https://gitlab.wikimedia.org/-/snippets/42).
- Collect resource allocation and latency metrics for a long running pyflink job.
- Help inform integration paths with the upcoming Flink catalog. https://phabricator.wikimedia.org/T322022.
- Help requirement collection for https://phabricator.wikimedia.org/T322125 .