Special:CX does not seem to be working on Bangla Wikipedia
Closed, ResolvedPublic
Actions

Description

https://bn.wikipedia.org/wiki/%E0%A6%AC%E0%A6%BF%E0%A6%B6%E0%A7%87%E0%A6%B7:%E0%A6%AC%E0%A6%BF%E0%A6%B7%E0%A7%9F%E0%A6%AC%E0%A6%B8%E0%A7%8D%E0%A6%A4%E0%A7%81_%E0%A6%85%E0%A6%A8%E0%A7%81%E0%A6%AC%E0%A6%BE%E0%A6%A6 shows a message that the Special page does not exist. Same for Special:CXStats.

Details

	Subject	Repo	Branch	Lines +/-
	Normalize special page aliases (bn) to a form MediaWiki can understand	mediawiki/extensions/ContentTranslation	master	+2 -2
	Revert "ContentTranslation.alias.php translations for Bengali"	mediawiki/extensions/ContentTranslation	master	+0 -6

Customize query in gerrit

Related Objects

Mentioned In: T288194: Special:EnableStructuredDiscussions says "invalid special page" when wiki language is bn (Bengali)
T153132: $magicWords with য় (U+09DF) does not seem to be working

Event Timeline

Arrbee created this task.Dec 12 2016, 1:29 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptDec 12 2016, 1:29 PM

Screen Shot 2016-12-12 at 7.03.40 PM.png (297×994 px, 50 KB)

Both pages are listed under 'Other Special pages', but do not resolve to anything but the page that says that there are no such special pages.

Change 326427 had a related patch set uploaded (by Amire80):
Revert "ContentTranslation.alias.php translations for Bengali"

https://gerrit.wikimedia.org/r/326427

gerritbot added a project: Patch-For-Review.Dec 12 2016, 1:58 PM

Change 326433 had a related patch set uploaded (by Nikerabbit):
Normalize special page aliases (bn) to a form MediaWiki can understand

https://gerrit.wikimedia.org/r/326433

Copying from chat to permament storage.

Basically [UtfNormal\Validator::cleanUp] normalization call forces everything in MediaWiki to use NFC. Except content from i18n files are trusted and not normalized. When we got most of these via translatewiki.net, this normalization was automatically applied, but with manual submission not.

one possible fix is to call $wgContLang->normalize on input from PHP files
a bigger question is whether NFC is the right thing for MediaWiki to use

My suggestion:

apply mine or Amir's patch in the short term (remember to rebuild l10n cache)
consider adding safeguard for input from i18n files to avoid or catch these type of errors
1. could also add unit test that verifies all i18n file contents... maybe better tradeoff than slowing runtime performance

I totally support idea 2.A. Let's have a unit test for this.

Aftabuzzaman subscribed.Dec 12 2016, 7:10 PM

Additional information: the character য় (U+09DF) was interpreted by MediaWiki as য (U+09AF) and nukta (U+09BC).

Using Amir's patch will revert the translation but the problem will recur if the strings are translated again.

Arrbee triaged this task as High priority.Dec 13 2016, 3:51 AM

Arrbee added a project: Language-Q2-2016-17 Sprint 5.

Restricted Application added a project: Language-Engineering October-December 2016. · View Herald TranscriptDec 13 2016, 3:51 AM

Arrbee moved this task from Backlog to In Progress on the Language-Q2-2016-17 Sprint 5 board.Dec 13 2016, 5:13 AM

KartikMistry moved this task from In Progress to In Review on the Language-Q2-2016-17 Sprint 5 board.Dec 13 2016, 7:10 AM

Arrbee added a project: Unplanned-Sprint-Work.Dec 13 2016, 10:38 AM

Bodhisattwa added a project: Bengali-Sites.Dec 13 2016, 10:54 AM

Change 326427 abandoned by Amire80:
Revert "ContentTranslation.alias.php translations for Bengali"

https://gerrit.wikimedia.org/r/326427

Change 326433 merged by jenkins-bot:
Normalize special page aliases (bn) to a form MediaWiki can understand

https://gerrit.wikimedia.org/r/326433

ReleaseTaggerBot added a project: MW-1.29-release (WMF-deploy-2016-12-13_(1.29.0-wmf.6)).Dec 13 2016, 12:00 PM

• Nikerabbit claimed this task.Dec 13 2016, 3:23 PM

• Nikerabbit moved this task from In Review to QA on the Language-Q2-2016-17 Sprint 5 board.

• Nikerabbit removed a project: Patch-For-Review.

Aftabuzzaman mentioned this in T153132: $magicWords with য় (U+09DF) does not seem to be working.Dec 13 2016, 10:01 PM

Arrbee moved this task from QA to Done on the Language-Q2-2016-17 Sprint 5 board.Dec 16 2016, 7:03 AM

Aftabuzzaman closed this task as Resolved.Dec 17 2016, 8:10 PM

Bodhisattwa moved this task from Backlog to Closed on the Bengali-Sites board.Nov 19 2020, 4:06 AM

Aftabuzzaman mentioned this in T288194: Special:EnableStructuredDiscussions says "invalid special page" when wiki language is bn (Bengali).Dec 1 2021, 5:16 PM

	F5040806: Screen Shot 2016-12-12 at 7.03.40 PM.png
	Dec 12 2016, 1:35 PM

Special:CX does not seem to be working on Bangla WikipediaClosed, ResolvedPublicActions

Description

Details

Related Objects

Event Timeline

Special:CX does not seem to be working on Bangla Wikipedia
Closed, ResolvedPublic
Actions