Goal:
Migrate the wikidata_json_entity jobs to Airflow:
Job Details:
Input | Processing | Output |
Dumps | Spark | Hive Table |
Success Criteria:
- Have the 1 Weekly Job Migrated (14 Days)
NOTE: this job is extremely similar to the new job @Snwachukwu is writing, and also to mediawiki/wikitext 2 jobs - We should take advantage of that and build a DAG factory for dumps processing.