Acceptance Criteria:
- Investigate if we can do the following: Format text (like add appropriate templates, heading, text formatting, proper formatting for poems, etc) with hOCR/ALTO
Acceptance Criteria:
We also looked into this in T286347, and it was too hard! :(
There is some stuff that we can do: T250185: Make Wikisource-OCR handle paragraphs better