Page MenuHomePhabricator

Media file metadata should be sanitized to be valid UTF-8 before inclusion in page
Open, LowPublic

Description

One problem caused by this is that double-underscore magic words don't work if the metadata is invalid; see T117066 for details.

Event Timeline

Tgr created this task.Oct 29 2015, 11:28 PM
Tgr raised the priority of this task from to Needs Triage.
Tgr updated the task description. (Show Details)
Tgr added a project: MediaWiki-File-management.
Tgr added a subscriber: Tgr.
Restricted Application added a project: Multimedia. · View Herald TranscriptOct 29 2015, 11:28 PM
Restricted Application added subscribers: Steinsplitter, Aklapper. · View Herald Transcript
Tgr updated the task description. (Show Details)Oct 29 2015, 11:28 PM
Tgr set Security to None.
Restricted Application added a subscriber: Matanya. · View Herald TranscriptOct 29 2015, 11:29 PM

Specificly, it looks like the Exif code isn't validating things. Some of the other metadata pieces run UtfNormal\Validator::quickIsNFCVerify() which removes invalid utf-8.

MarkTraceur triaged this task as Low priority.Dec 21 2015, 9:41 PM
MarkTraceur added a subscriber: MarkTraceur.
Restricted Application added a project: Commons. · View Herald TranscriptDec 21 2015, 9:41 PM
zhuyifei1999 moved this task from Incoming to Backlog on the Commons board.Jan 2 2016, 6:43 AM
MarkTraceur moved this task from Untriaged to Triaged on the Multimedia board.Dec 6 2016, 4:13 PM
Restricted Application added a subscriber: Poyekhali. · View Herald TranscriptDec 6 2016, 4:13 PM