Page MenuHomePhabricator

CommonsMetadata should remove simple HTML wrapping
Closed, ResolvedPublic


Where possible, metadata values should be plain text, not HTML (Wikidata does not support HTML values). There is not much we can do with arbitrary HTML, but a few simple values should be recognized, e.g. <p>text</p>, <ul><li>text</li></ul> etc. (Also, maybe a list with multiple elements could be turned into a multivalued field?)

Version: unspecified
Severity: minor



Event Timeline

bzimport raised the priority of this task from to Needs Triage.
bzimport set Reference to bz57848.
Tgr created this task.Dec 2 2013, 5:01 PM

Change 120948 had a related patch set uploaded by Gergő Tisza:
Clean parsed HTML

Change 120948 merged by jenkins-bot:
Clean parsed HTML

Tgr added a comment.Mar 27 2014, 3:44 PM

<p> is cleaned now; <ul> seems too complex to be worth it.

Gilles moved this task from Untriaged to Done on the Multimedia board.Dec 4 2014, 10:11 AM
Gilles triaged this task as Unbreak Now! priority.
Gilles lowered the priority of this task from Unbreak Now! to Needs Triage.Dec 4 2014, 11:23 AM