Page MenuHomePhabricator

Add tracking category for pages that contain errors flagged by HTMLTidy
Closed, ResolvedPublic

Description

The parser is aware if HTMLTidy found errors or not. If HTMLTidy finds (and auto-corrects) errors, these pages should be auto-categorized. Presumably the HTML can be fixed by humans or bots or something.

I guess there could be gradations, depending on how specific HTMLTidy is. That is, Category:HTMLTidy found very serious errors; Category:HTMLTidy found mildly serious errors; etc.


Version: unspecified
Severity: enhancement

Details

Reference
bz40607

Event Timeline

bzimport raised the priority of this task from to Low.Nov 22 2014, 12:48 AM
bzimport set Reference to bz40607.
bzimport added a subscriber: Unknown Object (MLST).

If bots can fix them, why aren't they also able to find them?
And if tidy auto-corrects them, why should humans bother "fixing" them?

(In reply to comment #1)

If bots can fix them, why aren't they also able to find them?

HTMLTidy is being run server-side. It's vastly more convenient for the server to auto-categorize the pages. Then a bot will be able to easily find them. :-)

And if tidy auto-corrects them, why should humans bother "fixing" them?

Graceful deprecation.

Some html errors are probably caused by MediaWiki itself, and not those pesky humans (I would guess. Esp. the table generation code. But I could be wrong)

There is an easy trick to test the output without HTML tidy applied: Use Special:ExpandTemplates. I did so with https://de.wikipedia.org/wiki/Spezial:Vorlagen_expandieren?wpInput={{Hilfe:Tabellen}} (en.wikipedia's help on tables is not transcludable), and validator.w3.org doesn't look like MediaWiki's table parsing generates broken HTML.

Izno claimed this task.
Izno subscribed.

I think we can resolve this based on the existence of MediaWiki-extensions-Linter .