Add kraken OCR engine to Wikimedia OCR
Open, Needs TriagePublicFeature
Actions

Assigned To

None

Authored By

	sweil
	Aug 28 2023, 8:27 AM

Description

Wikimedia OCR currently uses the free Tesseract OCR engine (which only supports printed text) and the commercial Google and Transkribus OCR engines.

The free kraken OCR engine supports printed and handwritten text. Like Tesseract, kraken is used in the OCR-D project for OCR of historic prints. It is much slower than Tesseract, but sometimes gets better results and would be the only available non-commercial OCR engine for handwritings.

I suggest to start with my free models for German print and German handwriting (they are not limited to German, but can be used with other languages which use Latin script as well), but there exist many more models, for example for Arabic or Hebrew script.

I already have implemented a prototype and sent a draft pull request for Wikimedia OCR.

Related Objects
Search...

Status	Subtype	Assigned	Task
Open	Feature	None	T345055 Add kraken OCR engine to Wikimedia OCR
Resolved		sweil	T346413 Install Kraken OCR (and web service) on a new Wikisource VPS
Resolved		None	T346854 Increase quota for wikisource project (for new OCR service)

Event Timeline

sweil created this task.Aug 28 2023, 8:27 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 28 2023, 8:28 AM

sweil updated the task description. (Show Details)Aug 28 2023, 8:28 AM

sweil updated the task description. (Show Details)

MusikAnimal removed a project: Community-Tech.Aug 30 2023, 2:08 PM

Restricted Application added a project: Community-Tech. · View Herald TranscriptAug 30 2023, 2:08 PM

MusikAnimal removed a project: Community-Tech.Aug 30 2023, 2:08 PM

Samwilson moved this task from Backlog to Kraken on the Wikimedia OCR board.Sep 15 2023, 5:38 AM

The current implementation offers 3 different models for the text recognition.
Is there a need for non Latin scripts as well? Which ones? Arabic? Hebrew? Others?

Kraken also supports different models for the segmentation (region and line detection).
The segmentation model should be selectable from the web interface and the API, too.

In T345055#9174507, @sweil wrote:

Kraken also supports different models for the segmentation (region and line detection).
The segmentation model should be selectable from the web interface and the API, too.

The implementation now supports different segmentation models for kraken, too. Currently either default or ubma_segmentation can be selected.

sweil closed subtask T346413: Install Kraken OCR (and web service) on a new Wikisource VPS as Resolved.Sep 28 2023, 11:37 AM

Add kraken OCR engine to Wikimedia OCROpen, Needs TriagePublicFeatureActions

Description

Related ObjectsSearch...

Event Timeline

Add kraken OCR engine to Wikimedia OCR
Open, Needs TriagePublicFeature
Actions

Related Objects
Search...