[EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.)
Open, HighPublic
Actions

Assigned To

None

Authored By

	TJones
	Sep 19 2024, 3:46 PM

Description

Normalizing Orthographic Re-Mapper (aka N.O.R.M.)

Build out the necessary infrastructure to support various kinds of text-mapping "second-try" searches, including "DWIM"-style wrong-keyboard searches (i.e., accidentally typing on a Russian/Cyrillic on a US/Latin keyboard) and transliterated searches (i.e., typing Georgian or Hindi in Latin script).

A good place to start is replicating the Russian and Hebrew DWIM gadget's autocomplete results enhancement, and then extending that breadth-first to Georgian and Hindi transliteration in autocomplete, or depth-first into full-text results.

wrong keyboard tickets:

translteration tickets:

Note: Naming is hard. DWIM ("do what I mean") is/was an on-wiki gadget that supported wrong-keyboard searches on Russian and Hebrew wikis. However, it sounds a little too much like DYM ("did you mean"), our query reformulation suggestion feature. We've used second-chance and second-try in the past to refer to a number of related approaches that are a superset of what is under consideration here. Hence "N.O.R.M.", the Normalizing Orthographic Re-Mapper, which would be a shared infrastructure that would allow us to convert both Fhbcnjntkm to Аристотель ("Aristotle") on Russian wikis and devanagari ka itihas to देवनागरी का इतिहास ("history of Devanagari") on Hindi wikis in a variety of useful ways.

Previous on-wiki write ups:

Typing on the Wrong Keyboard—Russian and English
DWIM as API
Hindi Wikipedia Zero Results Queries (includes unsuccessful transliterated queries)

Related Objects
Search...

Status	Subtype	Assigned	Task
Open		None	T375215 [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.)
Open		None	T138958 Detect "wrong keyboard" queries for Russian/American keyboards on EN/RU Wikipedias
Resolved		TJones	T213931 Update TextCat with wrong-keyboard models
Declined		TJones	T213935 Revert changes to TextCat that add dependency on autoload.php
Resolved		Smalyshev	T213936 Deploy new version of TextCat
Resolved		TJones	T216083 Update required version of TextCat in CirrusSearch
Open		None	T155104 Detect "wrong keyboard" queries for Hebrew/American keyboards on EN/HE Wikipedias
Open	Feature	None	T127003 Transliterate Latin or Cyrillic script searches to Georgian script on Georgian wikis
Open	Feature	None	T297761 Create a Latin-to-Devanagari transliteration second-chance search for Hindi wikis

Event Timeline

TJones created this task.Sep 19 2024, 3:46 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptSep 19 2024, 3:46 PM

TJones moved this task from needs triage to [epic] on the Discovery-Search board.Sep 19 2024, 3:46 PM

TJones added subtasks: T138958: Detect "wrong keyboard" queries for Russian/American keyboards on EN/RU Wikipedias, T155104: Detect "wrong keyboard" queries for Hebrew/American keyboards on EN/HE Wikipedias, T127003: Transliterate Latin or Cyrillic script searches to Georgian script on Georgian wikis, T297761: Create a Latin-to-Devanagari transliteration second-chance search for Hindi wikis.

TJones renamed this task from [EPIC] Create infrastructure to support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.) to [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).Sep 19 2024, 3:49 PM

TJones triaged this task as High priority.

TJones updated the task description. (Show Details)Sep 19 2024, 4:30 PM

[EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.)Open, HighPublicActions

Description

Related ObjectsSearch...

Event Timeline

[EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.)
Open, HighPublic
Actions

Related Objects
Search...