download tool issue with Cyrillic encoding in filenames (wget)
Closed, InvalidPublic


Author: a1

Description: tool do not recognize Cyrillic in names of files. For example it writes "Р%9FамС%8FС%82РЅРёРє_Р·Р°С%82опленнС%8BРј_РєРѕС%80аблС%8FРј_РІ_СеваС%81С%82ополе"
instead of "Памятник затопленным кораблям в Севастополе.JPG" Please, fix it.

Version: unspecified
Severity: normal


bzimport set Reference to bz40844.
bzimport added a subscriber: Unknown Object (MLST).
bzimport created this task.Oct 7 2012, 7:40 PM

As answered in the mailing list, that's a wget problem.

The list generated by my tool correctly uses:

The problem seems to lie in wget when extracting to a local filename.

If you are using *nix with a utf-8 filesystem, pass the
--restrict-file-names=nocontrol parameter to wget.

If you're using Windows you will end up with utf-8 encoded filenames, so
you'd need another script to decode them to the format used by Windows.

Andrij: Does comment 1 help?

a1 wrote:

Unfortunately no. I could not understand how could i "pass the
--restrict-file-names=nocontrol parameter to wget".

Andrij, you would add that inside download.bat

I could try downloading the category for you if that helps.

I reported the problem upstream This should be fixed at wget level.

Does this bug belongs to this bugzilla?

Andrij: Toolserver issues should be filed at

Closing as "INVALID" simply because this bug database is not the place where this report should be, but not because the report itself is invalid.

Restricted Application added a subscriber: StudiesWorld. · View Herald TranscriptJan 28 2016, 6:13 PM

Add Comment