The following tables will need to be loaded for the metrics planned as of now in T366044. The tables are:
- cx_translators
- cx_translations
- cx_corpora
The idea is to have Spark jobs (similar to sqoop) that will simply fetch and load to the destination tables, and have separate pipelines for necessary aggregations.