Page MenuHomePhabricator

CommonsMetadata should remove simple HTML wrapping
Closed, ResolvedPublic

Description

Where possible, metadata values should be plain text, not HTML (Wikidata does not support HTML values). There is not much we can do with arbitrary HTML, but a few simple values should be recognized, e.g. <p>text</p>, <ul><li>text</li></ul> etc. (Also, maybe a list with multiple elements could be turned into a multivalued field?)


Version: unspecified
Severity: minor

Details

Reference
bz57848

Event Timeline

bzimport raised the priority of this task from to Needs Triage.
bzimport set Reference to bz57848.
Tgr created this task.Dec 2 2013, 5:01 PM

Change 120948 had a related patch set uploaded by Gergő Tisza:
Clean parsed HTML

https://gerrit.wikimedia.org/r/120948

Change 120948 merged by jenkins-bot:
Clean parsed HTML

https://gerrit.wikimedia.org/r/120948

Tgr added a comment.Mar 27 2014, 3:44 PM

<p> is cleaned now; <ul> seems too complex to be worth it.

Gilles moved this task from Untriaged to Done on the Multimedia board.Dec 4 2014, 10:11 AM
Gilles triaged this task as Unbreak Now! priority.
Gilles lowered the priority of this task from Unbreak Now! to Needs Triage.Dec 4 2014, 11:23 AM