Page MenuHomePhabricator

[Investigation]Create References Parser PoC [Timebox 10 days]
Closed, ResolvedPublic5 Estimated Story Points

Description

As a WME engineer i want to validate the selected approach and see it's complexities more in details. In order to do so i want to create a first attempt for a references parser.

TODO

  • Agree with Product input pages cardinality and edge cases
  • Parser should work on 6 SC languages
  • Extract cite web citation and reference
  • Make it configurable per language, input is variable and configuration also.

Acceptance Criteria

  • Parser PoC that extracts at least cite web across languages and exports to a file
  • Includes what is parsed sucessfully and whats not, per article.
  • Includes what's detected and what's not, per article
  • Document edge cases by language and template.

Event Timeline

REsquito-WMF renamed this task from Create References HTML Parser PoC to [Investigation]Create References HTML Parser PoC [Time slot for 10 days/ weeks].Nov 11 2024, 2:28 PM
REsquito-WMF renamed this task from [Investigation]Create References HTML Parser PoC [Time slot for 10 days/ weeks] to [Investigation]Create References Parser PoC [Time slot for 10 days/ weeks].Nov 11 2024, 2:30 PM
REsquito-WMF updated the task description. (Show Details)
REsquito-WMF set the point value for this task to 5.Nov 11 2024, 2:45 PM
JArguello-WMF raised the priority of this task from Medium to High.Nov 11 2024, 3:25 PM
JArguello-WMF renamed this task from [Investigation]Create References Parser PoC [Time slot for 10 days/ weeks] to [Investigation]Create References Parser PoC [Timebox 10 days].Nov 13 2024, 3:15 PM