Page MenuHomePhabricator

Remove line feed characters in title (and possible other fields?)
Open, LowPublic0 Estimated Story Points

Description

There are line feed characters in title on pages from this website, which causes template errors.

http://www.juntadeandalucia.es/presidencia/portavoz/gobierno/114816/susana/diaz/destaca/denominacion/origen/montilla/moriles/sinonimo/riqueza/calidad

Remove line feed characters from all fields (maybe except for 'abstract' field?).

Event Timeline

Elitre created this task.May 12 2015, 6:49 PM
Elitre raised the priority of this task from to Needs Triage.
Elitre updated the task description. (Show Details)
Elitre added a project: Citoid.
Elitre added a subscriber: Elitre.
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMay 12 2015, 6:49 PM
Mvolz moved this task from Backlog to Site specific issues on the Citoid board.Sep 30 2015, 6:05 PM
Mvolz renamed this task from Results of a test with 10 random .es URLs on the beta cluster to Remove line feed characters in title (and possible other fields?).Sep 19 2016, 3:52 PM
Mvolz triaged this task as Low priority.
Mvolz updated the task description. (Show Details)
Restricted Application added a project: VisualEditor. · View Herald TranscriptSep 19 2016, 3:52 PM
Mvolz added a subscriber: Mvolz.Sep 19 2016, 3:53 PM

Re-checked all of these, all have since been resolved except the one I've edited in the description. (The date format is in ISO which is the most compatible format across languages)

Jdforrester-WMF set the point value for this task to 0.
Mvolz updated the task description. (Show Details)
Restricted Application added a subscriber: TerraCodes. · View Herald TranscriptOct 10 2016, 2:41 PM
Mvolz moved this task from Site specific issues to Service on the Citoid board.Oct 28 2016, 3:11 PM

I've tried to reproduce this issue, but the URL in the task description (http://www.juntadeandalucia.es/presidencia/portavoz/gobierno/114816/susana/diaz/destaca/denominacion/origen/montilla/moriles/sinonimo/riqueza/calidad) now gives a 404 error, and so Citoid can't generate anything from it. The URL in the duplicate task I just merged (http://www.superheromoviesnews.com/2014/02/x-men-producer-lauren-shuler-donner.html) is also a 404.

I tested with a temporary page I hosted and this is still a problem. You can reproduce with a page containing this HTML:

<title>Newline
test</title>