Background/Goal
//This task contains some notes/WIP on behaviour experienced with Flink wmf images. Needs grooming/investigation to turn it into a feature request.
As a developer I would like to run pytest based CI jobs atop the Flink wmf image.
Current status
I came across some potential issues with PYTHONPATH (pyflink cannot be found) and missing python deps (protobuf, grpc) when trying to run pytest inside a flink wmf container.
Manually installing the python deps (pip install protobugf), and setting up a Python path along the lines of https://gitlab.wikimedia.org/-/snippets/55 fixes part of these issues.
When pyflink is provided (pip install pyflink or via eventutulities-python[provided]) pytest executes correctly.
I was also unable to replicate with a vanilla python dist (might be dealing with stale envs on my end though).
Note: a similar issue with missing deps (protobuf) has been reported when running Flink datastream examples via Helm chart. The issue was not encountered with other examples (datagen) or mediawiki-stream-enrichment. Both cases might be pulling in transient deps.
Key Tasks/Dependencies
Acceptance Criteria
- pyflink examples and tests work with base flink image