Wikimedia kraken OCR now does not have any Arabic models, Please add these Arabic models:
Description
Description
Event Timeline
Comment Actions
Hi @hubaishan
I would like to take the task. I have few questions regarding this task. I have downloaded the repo from (https://github.com/wikimedia/wikimedia-ocr)
- Kraken isn't integrated yet — should implementing the full Kraken engine (not just the Arabic models) be in scope here?
- Is Kraken already installed on the Toolforge server?
- Should both Zenodo models (all_arabic_scripts.mlmodel + arabic_best.mlmodel) be added, or just the Arabic-specific one?
- Should RTL mode be enabled by default for Arabic?
Thanks!