Hello all, there was been concern around the new Transcribe Text [Extract Text] UI in Wikisource and we wanted to address concerns and document all of the options for moving forward. This phabricator ticket will outline rationale for all so that you each have a view into options for next steps. Please vote on your preference after you consider the pros and cons.
As we understand them, the concerns around the new UI include:
- Breaking workflows, the new Extract Text button placement is outside of the toolbar; which is a place established contributors associate with editing
- Inability to turn off the new UI
- Since the new Extract Text button affords all contributors the option to Transcribe Text when they proofread documents, folks want the ability to turn it off.
Before we lay out all the options, I wanted to restate underlying goals of the improvements:
- Improved speed and efficiency of underlying engines
- Accessibility for new Wikisource contributors, allowing new contributors to understand what OCR is and how to use it in their proofreading
- Empowering all users with the ability to transcribe without having to know about it as a special gadget -- removing the technical learning curve and "insider" knowledge about transcription tools
**OPTION A
**
- Transcribe Text button overlayed on document image that is to be transcribed
{F34531813}
Pros:
- Less technical copy than OCR (which is an acronym foreign to non-technical contributors)
- A 1-click transcription button (defaults to Tesseract, but users can change preference)
- Button overlayed on document, making it clear that this will apply to the document image, not the proofread box
Cons
- Breaks existing workflows
- Inability to be turned off
- Could potentially block document text (edge case)
[[ https://www.figma.com/file/vmaNXY952auqX6nw2PCh0b/Wikisource-OCR?node-id=593%3A258 | **OPTION B
]]**
- Transcribe Text button inside the toolbar
** ALTERNATIVE OPTION **
- Leave our changes in option A as is, and give people the ability to turn off the new UI in user preferences
** Open questions **
Since we believe there is value in giving everyone the ability to transcribe, we should remove duplicate gadgets to remove tech debt. Shoul