Page MenuHomePhabricator

A template is ignored by Parsoid because Parsoid considers the template string as part of an URL
Open, MediumPublic

Description

http://en.wikipedia.org/wiki/Bieszk%C3%B3w_G%C3%B3rny

A template is ignored by Parsoid because Parsoid considers the template string as part of an URL.

wikitext:

(http://www.twittercells.com){{Infobox settlement
| name = Bieszków Górny
...
}}

The {{Infobox settlement|...}} template has no data-mw counterpart in rdf, and is ignored by the template parser. TieredParser extracts the template successfully.

Event Timeline

Bianjiang created this task.Jan 4 2016, 6:06 PM
Bianjiang raised the priority of this task from to Needs Triage.
Bianjiang updated the task description. (Show Details)
Bianjiang added a project: Parsoid.
Bianjiang added subscribers: Aklapper, Bianjiang.
Restricted Application added a subscriber: StudiesWorld. · View Herald TranscriptJan 4 2016, 6:06 PM
Arlolra added a subscriber: Arlolra.Jan 4 2016, 8:41 PM

A minimum test case could be http://transcribe.com/one{{echo|>}} (where > isn't a valid char for urllink)

We probably need a solution like T48811 where we inspect the expanded attribute (href) for invalid chars, and then pop the rest of the tokens out of the link.

jmadler added a subscriber: jmadler.Jan 6 2016, 5:11 AM
Arlolra triaged this task as Medium priority.Jan 6 2016, 5:42 PM
Arlolra set Security to None.
LGoto moved this task from Needs Triage to Backlog on the Parsoid board.Sat, Feb 15, 9:42 PM
Restricted Application added a subscriber: Liuxinyu970226. · View Herald TranscriptSat, Feb 15, 9:42 PM