Author: pablo
Description:
In the wa: wikipedia I changed the linktrail to also
include uppercase A-Z, so that the trail can have upercase too, eg it is:
'linktrail' => '/^([a-zåâêîôûçéèA-ZÅÂÊÎÔÛÇÉÈ]+)(.*)$/sD',
since then, I saw a very strange behaviour,
try the following wikicode on a page in the wa:
wikipedia (just "preview" will do):
*[[aa]]<nowiki>bb</nowiki>
*[[aa]]bb
*[[aa]]<nowiki>BB</nowiki>
*[[aa]]BB
*[[aa]]<nowiki>êê</nowiki>
*[[aa]]êê
*[[aa]]<nowiki>ÊÊ</nowiki>
*[[aa]]ÊÊ
as you can see on the lines without the nowiki part
the trail is properly included in the clickable link,
both lower and upercase, and accented letters as well.
However, the lines with nowiki render very strangely,
instead of "aabb" with "aa" clickable and "bb" not,
it renders as
"aaNaodW29-nowiki49a5b674f36af500000001" with
"aaNaodW" clickable and the rest not.
note that if I put a character not in the linktrail between the "]" and the "<"
then it displays fine (but if the character is in Latin extension set A it
displays wrongly, actually it dispalys as 2 U+FFFD, one linked to the clickable
string, and one not...), see:
*[[aa]]x<nowiki>bb</nowiki> (ascii char, displays ok)
*[[aa]]ə<nowiki>bb</nowiki> (above U+00FF, displays ok)
*[[aa]]ö<nowiki>BB</nowiki> (between U+007F and U+00FF, displays wrong for that
char)
the problem with letters in U+007F-U+00FF don't
depend on the nowiki, so maybe there are two problems, see:
*[[aa]]öbb
Now, go to fr: wikipedia and test the same set of wikicode samples; the
rendering is ok, but trail with uppercase unaccented letters is ignored (trail
with uppercase accented letters is recognized); the linktrail in fr: is the
following:
'/^([a-zàâçéèêîôûäëïöüùÇÉÂÊÎÔÛÄËÏÖÜÀÈÙ]+)(.*)$/sD'
the main difference with the wa: linktrail version
is that I added tha "A-Z" as it doesn't make sense
to have upercase accented chars but not the unaccented letters (I also removed
several accented
letters unused in walloon and added "åÅ"):
'/^([a-zåâêîôûçéèA-ZÅÂÊÎÔÛÇÉÈ]+)(.*)$/sD'
could the "A-Z" be the reason of the rendering problem?
but I don't understand how it can be.
there is clearly a parsing problem just after the closing "]" when a nowiki or a
char in latin1 set
is involved
Version: 1.6.x
Severity: normal