Page MenuHomePhabricator

PDF URL encoding problem
Open, Needs TriagePublic

Description

When I am trying to upload https://archive.org/details/dli.ministry.28083 as PDF to Commons, it always returns the error.

Screenshot Capture - 2021-05-18 - 17-54-48.png (244×1 px, 24 KB)

An error occurred: Client error: GET https://archive.org/download/dli.ministry.28083/14476.1666-Bharatavarsera%20Itihasa%20a%20Concise%20History%20of%20India.pdf resulted in a 404 Not Found response:

Event Timeline

This seems to be a URL encoding issue, in that the files on the IA actually have encoded entities in them. This means we're requesting

https://archive.org/download/dli.ministry.28083/14476.1666-Bharatavarsera%20Itihasa%20a%20Concise%20History%20of%20India.pdf

but should actually be requesting

https://archive.org/download/dli.ministry.28083/14476.1666-Bharatavarsera%2520Itihasa%2520a%2520Concise%2520History%2520of%2520India.pdf
Samwilson renamed this task from An error occurred: Client error: `GET https://archive.org/download to PDF URL encoding problem.Jun 16 2021, 11:45 PM
Samwilson updated the task description. (Show Details)