wgRelevantPageName is missing on Chinese Wikipedia
Closed, ResolvedPublic
Actions

Description

https://www.mediawiki.org/wiki/Talk:Sandbox?action=history has wgRelevantPageName, but https://zh.wikipedia.org/wiki/Wikipedia_talk:Flow_tests?action=history does not.

It's a PHP issue (it's in the HTML source for one, but not the other).

This breaks the history page (at least) due to VE requiring it.

Details

Subject	Repo	Branch	Lines +/-
Language::truncate(): don't chop up multibyte characters when input contains newlines	mediawiki/core	master	+12 -1
Language::truncate(): don't chop up multibyte characters when input contains newlines	mediawiki/core	wmf/1.27.0-wmf.3	+12 -1
DesktopArticleTarget.init: Tolerate missing wgRelevantPageName	mediawiki/extensions/VisualEditor	master	+3 -0
DesktopArticleTarget.init: Tolerate missing wgRelevantPageName	mediawiki/extensions/VisualEditor	wmf/1.27.0-wmf.3	+3 -0

Customize query in gerrit

Event Timeline

• Mattflaschen-WMF created this task.Oct 26 2015, 11:23 PM

• Mattflaschen-WMF raised the priority of this task from to Unbreak Now!.

• Mattflaschen-WMF updated the task description. (Show Details)

• Mattflaschen-WMF added projects: Collaboration-Team-Archive-2015-2016, StructuredDiscussions.

• Mattflaschen-WMF subscribed.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 26 2015, 11:23 PM

WTF, there is a whole lot of stuff missing from view-source:https://zh.wikipedia.org/wiki/Wikipedia_talk:Flow_tests?action=history . The entire blob with exported wg vars isn't there.

Catrope added projects: MediaWiki-ResourceLoader, Performance-Team.Oct 26 2015, 11:29 PM

Catrope set Security to None.

Change 249022 had a related patch set uploaded (by Catrope):
DesktopArticleTarget.init: Tolerate missing wgRelevantPageName

https://gerrit.wikimedia.org/r/249022

gerritbot added a project: Patch-For-Review.Oct 26 2015, 11:32 PM

Change 249040 had a related patch set uploaded (by Krinkle):
DesktopArticleTarget.init: Tolerate missing wgRelevantPageName

https://gerrit.wikimedia.org/r/249040

Change 249022 merged by jenkins-bot:
DesktopArticleTarget.init: Tolerate missing wgRelevantPageName

https://gerrit.wikimedia.org/r/249022

Change 249040 merged by jenkins-bot:
DesktopArticleTarget.init: Tolerate missing wgRelevantPageName

https://gerrit.wikimedia.org/r/249040

ReleaseTaggerBot added projects: MW-1.27-release (WMF-deploy-2015-10-27_(1.27.0-wmf.4)), MW-1.27-release (WMF-deploy-2015-10-13_(1.27.0-wmf.3)).Oct 27 2015, 1:00 AM

This appears to happen because we use non-multibyte-aware truncation for summaries (I think this is a revision summary, not a topic summary, but I'm not 100% sure), so we end truncate the summary halfway through a Unicode codepoint and add "..." to it. This sometimes produces invalid Unicode sequences.

This invalid Unicode sequence ends up in the wgFlowData variable that's exported to JavaScript, and causes the json_encode() to barf. So json_encode( $this->getJSVars() ) returns false, but that turns into an empty string when other things are concatenated to it, so the result is that the entire JSVars blob is just dropped on the floor.

The revision that breaks this particular page is sqs6skdk7uu952ab , I'll see if I can fix that one in the DB and also fix the truncation code itself.

Catrope claimed this task.Oct 27 2015, 1:57 AM

Catrope removed projects: Performance-Team, MediaWiki-ResourceLoader.

...but somehow the API does manage to output JSON which contains this invalid Unicode, and it gets replaced with \ufffd: https://zh.wikipedia.org/w/api.php?action=flow&submodule=view-topic-history&page=Topic:Sqs6skdav48d3xzn&vthformat=wikitext&format=json

...which happens because ApiResult runs everything through Language::normalize(), which cleans this up.

It looks like the "summary" isn't in the DB at all, but is generated at view time by calling Flow\Parsoid\Utils::htmlToPlaintext(), which calls Language::truncate(). That's the core utility for truncating strings in a Unicode-aware way, so I'd be somewhat surprised if there was a bug in that function,

Change 249050 had a related patch set uploaded (by Catrope):
Language::truncate(): don't chop up multibyte characters when input contains newlines

https://gerrit.wikimedia.org/r/249050

Change 249051 had a related patch set uploaded (by Krinkle):
Language::truncate(): don't chop up multibyte characters when input contains newlines

https://gerrit.wikimedia.org/r/249051

Change 249050 merged by jenkins-bot:
Language::truncate(): don't chop up multibyte characters when input contains newlines

https://gerrit.wikimedia.org/r/249050

Change 249051 merged by jenkins-bot:
Language::truncate(): don't chop up multibyte characters when input contains newlines

https://gerrit.wikimedia.org/r/249051

Krinkle closed this task as Resolved.Oct 27 2015, 3:42 AM

Krinkle subscribed.

ReleaseTaggerBot added a project: MW-1.27-release-notes.Oct 27 2015, 4:00 AM

• Mattflaschen-WMF moved this task from Untriaged to QA Review on the Collaboration-Team-Archive-2015-2016 board.Oct 28 2015, 5:10 PM

Checked in beta - topic titles, summaries with CJK and updates to them displayed correctly in History.

Screen Shot 2015-10-30 at 5.23.23 PM.png (463×1 px, 232 KB)

Etonkovidova moved this task from QA Review to Product Review on the Collaboration-Team-Archive-2015-2016 board.Oct 31 2015, 12:27 AM

• JenifferHomes removed Catrope as the assignee of this task.Sep 16 2016, 10:37 AM

• JenifferHomes updated the task description. (Show Details)

• JenifferHomes removed subscribers: Etonkovidova, Krinkle, gerritbot and 3 others.

Restricted Application added subscribers: Stang, Jay8g, TerraCodes. · View Herald TranscriptSep 16 2016, 10:37 AM

• Mattflaschen-WMF assigned this task to Catrope.Sep 16 2016, 5:52 PM

• Mattflaschen-WMF updated the task description. (Show Details)

• Mattflaschen-WMF added subscribers: Etonkovidova, Krinkle, GerritBot and 3 others.

Stang unsubscribed.Nov 1 2021, 6:58 PM

Aklapper edited subscribers, added: gerritbot; removed: GerritBot.May 16 2023, 10:18 AM

Maintenance_bot removed projects: MW-1.27-release (WMF-deploy-2015-10-13_(1.27.0-wmf.3)), Patch-For-Review.May 16 2023, 10:31 AM

	F2898869: Screen Shot 2015-10-30 at 5.23.23 PM.png
	Oct 31 2015, 12:27 AM

wgRelevantPageName is missing on Chinese WikipediaClosed, ResolvedPublicActions

Description

Details

Event Timeline

wgRelevantPageName is missing on Chinese Wikipedia
Closed, ResolvedPublic
Actions