Page MenuHomePhabricator

Less expensive redirect article handling by dumpHTML
Closed, DeclinedPublic

Description

Currently the redirect articles are rendered as simple HTML pages/files which
simple make a HTTP refresh on the destination article page/file.

I modified the script to modify all links in the way they not point to the
redirect page anymore, but directly to the destination page/file.

It works with the option --shortcut-redirects

"Queued redirects" and "recursive redirects" loop are handled.

This is a first step to reduce the amazing quantity of HTML pages generated by
dumpHTML.

With the french Wikipedia, I reduce the number of file by more than 10%.


Version: unspecified
Severity: enhancement

Details

Reference
bz7416

Event Timeline

bzimport raised the priority of this task from to Low.Nov 21 2014, 9:22 PM
bzimport set Reference to bz7416.
bzimport added a subscriber: Unknown Object (MLST).

Created attachment 2417
diff

Attached:

This would make the go button not work.

This would make the go button not work - with redirects only.

For people making an offline-reader this is not an issue, they don't use the
default dumpHTML skin, and have theire own search engine.

For this people this option keeps all its interest.

No idea how current this code is. But this could go in as a feature that could be enabled by passing an appropriate parameter. It shouldn't be the default behavior but I can see why some people would find it useful.

sumanah wrote:

Emmanuel, would you mind updating and submitting your patch directly into our new Git source control system?

https://www.mediawiki.org/wiki/Git/Workflow

You can do this by getting and using "developer access" if you do not have it already:

https://www.mediawiki.org/wiki/Developer_access

Thanks.

Aklapper lowered the priority of this task from Low to Lowest.Apr 2 2015, 2:46 PM
Nemo_bis raised the priority of this task from Lowest to Low.Apr 3 2015, 10:19 PM
Nemo_bis set Security to None.
Aklapper subscribed.

The DumpHTML has been unmaintained and broken for many years. It is being archived. Declining this task per T280185.