**//As a data scientist, I need wmfdata to access MariaDB replicas when it is used in a notebook executed on the cluster so that I can schedule the notebook as a data pipeline through Airflow.
//**
In the Product Analytics ETL modernization sync-up on 26 June 2023 ([[ https://docs.google.com/document/d/1sXveksUzjoBfwSeoK6Q07vHdS2MgmsmcXeRG9YQ3MYM/edit | notes ]]) we identified that in the [[ https://github.com/wikimedia/wmfdata-python/blob/2ae3b559898f40d84493d475c0e2a83969b65985/wmfdata/mariadb.py | current wmfdata-python MariaDB module ]]:
- It checks POSIX group membership to determine which cnf to retrieve username & password from for connecting
- It uses the `analytics-mysql` executable to determine which host & port to use for connecting (after parsing output)
**To make it usable on the cluster**:
- [] Need a way of specifying which cnf to use (e.g. if we store the mysql password on HDFS and need to read it as `analytics-product` system user)
- [] Need a way of retrieving host & port info