Page MenuHomePhabricator

For PDL, download and stream the PDF if available
Closed, ResolvedPublic

Description

IMPORTANT: Make sure to read the Outreachy participant instructions and communication guidelines thoroughly before commenting on this task. This space is for project-specific questions, so avoid asking questions about getting started, setting up Gerrit, etc. When in doubt, ask your question on Zulip first!

In Panjab Digital Library, we are parsing all the images and combining it into a single PDF before streaming it into Internet Archive. However, for some books/newsletters/magazines, this whole process could be avoided since there is a way to download the PDF directly. For example, check this book. There's a Download PDF option available which can be used to download and automate the upload without parsing each image separately.