Provide EPUB sanitizer


Author: stf

EPUB is a open format for E-Books. Even though it is not really easy to create, its xml-based design enables a broad use. I expect a lot of wikimedia-related epubs, e.g. from wikipedia, wikisource or wikibook pages, which would be nice to store right in the projects near by its source.

Version: unspecified
Severity: enhancement

bzimport added a subscriber: Unknown Object (MLST).
bzimport set Reference to bz17858.
bzimport created this task.Via LegacyMar 8 2009, 11:09 AM
bzimport added a comment.Via ConduitMar 20 2009, 5:49 PM

jeluf wrote:

EPUB is a ZIP file containing (X)HTML files. We should not distribute these without sanitizing them first. Even though Javascript is not part of the EPUB specification, we can't be sure that browser plugins properly disable the browser's Javascript engine.

> changed bug summary, keywords, product

brion added a comment.Via ConduitMar 20 2009, 5:54 PM

Might be interesting, but as noted would need some special support for inline reading and sanitation etc.

Bawolff added a comment.Via ConduitMay 22 2010, 1:30 AM

There exists a tool to validate such files at which might be useful here.

Bawolff added a comment.Via ConduitJan 9 2012, 7:56 PM

I'm resetting the priority field. You really shouldn't be touching those unless you're a developer, and you definitely shouldn't mess with them without an explanation as to why.

Gilles added a project: Multimedia.Via WebNov 24 2014, 3:37 PM

Add Comment