Section-Topics already has an optional filter that handles media links. It can replace one section alignment component, namely the script that extract section images.
This is an opportunity to remove duplicate behavior and consolidate shared logic.
Tasks
- merge more fine-grained logic from section alignment's article_images.py into section topics
handle_media - remove article_images.py from the section alignment pipeline
- remove the corresponding task in the section alignment DAG
- make sure section alignment's recommendation.py takes as input section topic's image dataset in the section alignment DAG
- update tests
- merge section-alignment suggestions into section topics
- remove the section-alignment DAG
- add section alignment suggestions to section topics' DAG
- update image suggestions DAG