This will need to be updated to be compatible with the categorylinks schema changes.
Done is:
- cl_target_id and cl_collation_id added to Sqoop's categorylinks table definition in python/refinery/sqoop.py
- Columns cl_to and cl_collation dropped from Sqoop's categorylinks table definition
- Table collation added to Sqoop
- Table collation added to Sqoop's Puppet script invocation definition
- Table categorylinks CREATE HQL script updated with two new columns added
- Table categorylinks CREATE HQL script updated with two columns dropped
- Table wmf_raw.wikimedia_categorylinks updated in Hive with two new columns (ALTER TABLE ADD COLUMN...)
- Table wmf_raw.wikimedia_collation created in Hive
- Table wmf_raw.wikimedia_collation added to mediawiki history load DAG
- Table wmf_raw.wikimedia_collation added to bin/refinery-drop-mediawiki-snapshots script