Maniphest T191535

Evaluate need for Myanmar Zawgyi encoding detection/transliteration in search
Closed, DeclinedPublic
Actions

Assigned To

None

Authored By

	TJones
	Apr 5 2018, 2:54 PM

Description

Before considering some form of Zawgyi detection and transliteration for Myanmar-language wikis, we should:

get a sense of the frequency of Zawgyi-encoded queries
get a sense of the accuracy of Google’s detection library on short (i.e., query-length) strings
evaluate available transliteration tools and transliteration complexity
maybe evaluate other detection tools that would be more convenient to implement (like TextCat)
evaluate detection and transliteration on non-Myanmar text, too

I've also written up more details, adapted from a previous email conversation about this, in my notes on MediaWiki.

Event Timeline

TJones created this task.Apr 5 2018, 2:54 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptApr 5 2018, 2:54 PM

TJones added a project: Discovery-ARCHIVED.Apr 5 2018, 2:57 PM

TJones edited projects, added Discovery-Search; removed Discovery-ARCHIVED.

• EBjune triaged this task as Low priority.Apr 5 2018, 5:09 PM

• EBjune moved this task from needs triage to search-icebox on the Discovery-Search board.

• santhosh subscribed.Apr 9 2018, 4:43 AM

TJones moved this task from search-icebox to Language Stuff on the Discovery-Search board.Jan 29 2019, 7:16 PM

Ninjastrikers subscribed.May 17 2019, 6:03 PM

Closing out low/est priority tasks over 6 months old with no activity within last 6 months in order to clean out the backlog of tickets we will not be addressing in the near term. Please feel free to reopen if you think a ticket is important, but bare in mind that given current priorities and resourcing, it is unlikely for the Search team to pick up these tasks for the indefinite future. We hope that the requested changes have either been addressed by or made irrelevant by work the team has done or is doing -- e.g. upgrading Elasticsearch to a newer version will solve various ES-related problems -- or will be subsumed by future work in a more generalized way.

Re-opening tasks and removing from team workboard per IRC feedback given yesterday and discussion with MPham.

MPhamWMF added a project: Discovery-ARCHIVED.Aug 2 2022, 1:38 PM

Closing. This might be done as part of focused improvement to specific language analysis. Documentation is on wiki (see link in description) if we need access to it.

Evaluate need for Myanmar Zawgyi encoding detection/transliteration in searchClosed, DeclinedPublicActions

Description

Event Timeline

Evaluate need for Myanmar Zawgyi encoding detection/transliteration in search
Closed, DeclinedPublic
Actions