Page MenuHomePhabricator

Create workflow for handling article image files
Closed, ResolvedPublic

Description

Looks like the articles in the document repository consist of separate pages (png files) per article.

If there's no pdf/djvu versions of the articles (can you confirm @Ambrosiani ?), they will have to be created as part of the upload.

Preferably Djvu as that's heavily used by wikisource: https://commons.wikimedia.org/wiki/Help:DjVu
https://commons.wikimedia.org/wiki/Help:Creating_a_DjVu_file

See also https://en.wikisource.org/wiki/User:GrafZahl/How_to_digitalise_works_for_Wikisource

Event Timeline

We got an entry point to the API last week so that it is possible to get the paths of all the pages linked to an article. It requires an authentication header that is described in the e-mail thread Fataburenmetadata som xml och csv.

This means we now can access the image files we need. Next steps will be