Page MenuHomePhabricator

Problems with lemmas having LTR marks
Closed, ResolvedPublic

Description

Lemmas that include Unicode character U+200E (LTR Mark) still generate at least
2 problems:

(1) Such lemmas can not be moved to their correct equivalent. Test case: create
http://en.wikipedia.org/w/index.php?title=LTRTest%E2%80%8E&action=edit and then
try to move it to http://en.wikipedia.org/w/index.php?title=LTRTest
MediaWiki will incorrectly report same lemma and not move.

(2) (and this might be connected to (1)) Such lemmas have an incorrect
wgPagename and wgTitle listed in the HTML source (i.e. the lemma variant without
the trailing LTR Mark).


Version: unspecified
Severity: normal
OS: Windows XP
Platform: PC

Details

Reference
bz7939

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 9:24 PM
bzimport set Reference to bz7939.
bzimport added a subscriber: Unknown Object (MLST).

Define "lemmas" in this context please.

Oops, sorry. lemmas meaning pagenames, though I have tested the "move" bug only
with article pages. For some reason, pages with incorrect trailing LTR marks are
quite frequently created by users on de.wikipedia. Today's examples (from the
delete log):

http://de.wikipedia.org/w/index.php?title=Special:Log/delete&page=CFL_Baureihe_850%E2%80%8E

http://de.wikipedia.org/w/index.php?title=Special:Log/delete&page=CP_Baureihe_1200%E2%80%8E

http://de.wikipedia.org/w/index.php?title=Special:Log/delete&page=J%C5%BD_642%E2%80%8E

http://de.wikipedia.org/w/index.php?title=Special:Log/delete&page=JZ_642%E2%80%8E

Would be better if one could move them to the right name, preserving the edit
history.

  • This bug has been marked as a duplicate of 3696 ***