Page MenuHomePhabricator

Large file upload request for 0399CHRO
Closed, ResolvedPublic

Description

Could this large file be uploaded from [2] please? It remains impossible for PDFs of this size to be uploaded by volunteers through the API or via project upload UIs, despite being well under the 4GB limit that supposedly applies. Currently this file is the largest example of IA book upload which is repeatedly failing.[3] The content is a book scan from 1879. A suitable filename would be "Chronological history of plants (IA 0399CHRO).pdf"

  1. Description https://archive.org/details/0399CHRO/page/n5/mode/2up
  2. PDF source https://archive.org/download/0399CHRO/0399CHRO.pdf
  3. IA report https://commons.wikimedia.org/wiki/User_talk:F%C3%A6/IA_books/residuals

Event Timeline

Fae created this task.Sun, Oct 4, 11:55 AM
Restricted Application added a project: Internet-Archive. · View Herald TranscriptSun, Oct 4, 11:55 AM
Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Jeff_G added a subscriber: Jeff_G.Mon, Oct 12, 2:16 PM

No tokens found. :(

Kizule added a subscriber: Kizule.Fri, Oct 16, 9:16 AM

What about using bigChunkedUpload.js script?

Fae added a comment.Sat, Oct 17, 11:53 AM

What about using bigChunkedUpload.js script?

No, this does not work. Nor do any of the established local methods, including Pywikibot upload from a command line, nor even running a custom script to vary the chunk size, which appears to make no difference to likely outcome.

Urbanecm added a subscriber: Urbanecm.

I'll do it. Could you please provide a txt file having the description for Wikimedia Commons (ie. including the Information template and a valid license template)? Thanks.

Restricted Application added a project: User-Urbanecm. · View Herald TranscriptSat, Oct 17, 11:59 AM
Urbanecm triaged this task as Medium priority.Sat, Oct 17, 11:59 AM
Fae added a comment.Sat, Oct 17, 12:09 PM

For https://archive.org/download/0399CHRO/0399CHRO.pdf:

Filename is
File:Chronological history of plants- mans record of his own existence illustrated through their names, uses, and companionship (IA 0399CHRO).pdf

Starting image page text:

Thanks. Downloading the file to the maintenance server.

Mentioned in SAL (#wikimedia-operations) [2020-10-17T13:22:22Z] <Urbanecm> [urbanecm@mwmaint2001 ~/uploads]$ mwscript importImages.php --wiki=commonswiki --comment-ext=txt --user=Fæ . # T264529

Urbanecm closed this task as Resolved.Sat, Oct 17, 1:22 PM

File is live.

[urbanecm@mwmaint2001 ~/uploads]$ mwscript importImages.php --wiki=commonswiki --comment-ext=txt --user=Fæ .
Importing Files

Importing Chronological history of plants (IA 0399CHRO).pdf...done.

Found: 1
Added: 1
[urbanecm@mwmaint2001 ~/uploads]$