Page MenuHomePhabricator

need extractor for archive.org books
Closed, DuplicatePublic8 Estimated Story Points

Description

When I paste a URL like

https://archive.org/details/minutesofcommitt571newy

or

https://archive.org/stream/cu31924028853327

into citoid, it should detect that those are books and extract publisher, author, etc. information from archive.org.

I'm not sure if this helps, but surely some of these sorts of things can be gotten from openlibrary (for which there is already an extractor); e.g. http://openlibrary.org/ia/minutesofcommitt571newy for the first link above.

[This could be a SoC project,

Event Timeline

Restricted Application added a project: Internet-Archive. · View Herald TranscriptMar 27 2016, 7:56 PM
Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Restricted Application added a project: VisualEditor. · View Herald TranscriptMar 27 2016, 9:24 PM
Jdforrester-WMF triaged this task as Medium priority.Apr 12 2016, 7:22 PM
Jdforrester-WMF set the point value for this task to 8.
Jdforrester-WMF moved this task from To Triage to Freezer on the VisualEditor board.
Mvolz moved this task from Backlog to IO Tasks on the Citoid board.Jul 29 2016, 3:03 PM
Mvolz moved this task from IO Tasks to Site specific issues on the Citoid board.Oct 28 2016, 3:36 PM