Page MenuHomePhabricator

Let section alignment consume section topics output
Open, Needs TriagePublic

Description

Section-Topics already has an optional filter that handles media links. It can replace one section alignment component, namely the script that extract section images.
This is an opportunity to remove duplicate behavior and consolidate shared logic.

NOTE: this ticket accounts for one section alignment input, while the model that learns alignments is a separate task.

Tasks

  • merge more fine-grained logic from section alignment's article_images.py into section topic's handle_media
  • remove article_images.py from the section alignment pipeline
  • remove the corresponding task in the section alignment DAG
  • make sure section alignment's recommendation.py takes as input section topic's image dataset in the section alignment DAG

Event Timeline

HI @mfossati one question, will you need research support for this task?

Hey @Miriam. no. We might ping @MunizaA in case we need help on the section alignment code.

Given that no research work is needed for now, I'm going to remove the task from our backlog. Please add the Research tag back if you need the team's help for a specific component of it. thanks!