Documentation: Wikidata Concepts Monitor ETL job.
The 'Wikidata Concept Monitor ETL' job was created by Goran Milovanovic a few years ago and produces results originally visible here (the site is broken).
The job is scheduled to run regularly on our hadoop cluster and uses spark with R.
We are currently migrating all our jobs from spark version 2 to spark version 3, and this job still uses spark version 2.
In this task we should:
- Figure out the components which need to be migrated to Spark 3
- Do the migration