Page MenuHomePhabricator

[Epic] Add Transkribus support to Wikimedia OCR
Open, Needs TriagePublicFeature

Description

Transkribus is a system for structural and text analysis of digitised text documents. The Wikisource Loves Manuscripts project aims to integrate Transcribus OCR with the Wikisource proofreading workflow, initially for Balinese Wikisource (but later for more languages). This task is the Epic for tracking the project.

The work is in two main parts:

  • Add Transkribus to the OCR tool at https://ocr.wmcloud.org
  • Add Transkribus to the on-wiki UI on Wikisources (in the Wikisource extension)

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript