Page MenuHomePhabricator

Implement Internet Archive BookReader in Commons & Wikisource
Open, LowestPublic

Description

Proposed in Community-Wishlist-Survey-2016. Received 26 support votes, and ranked #65 out of 265 proposals. View full proposal with discussion and votes here.

Original Proposal

Problem: When we view a scanned book in PDF or DjVu format in Commons or Wikisource, its always single-page view with no option to go for previous or next page in that preview. Every-time we go to the next page, we need to click on the drop down menu of pages. See [[ File:The Bird Book.djvu | this book ]] in Commons for example. For book readers, its more like viewing images than reading books. This not only creates difficulty in reading, but also in identifying missing, duplicated pages etc. if the file needs to be corrected.

Who would benefit: Commons & Wikisource editors and readers

Proposed solution: Implement the open-source Internet Archive BookReader in all wikis specially Commons and Wikisource. For the same book in Internet Archive, see the difference. It has features like single view, double-page view, thumbnail view, zoom and a wide variety of other features.

Proposer: @Bodhisattwa (talk) 13:15, 15 November 2016 (UTC)

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Umm, why not use Magnus's book2scrolltool at Toollabs?
https://tools.wmflabs.org/book2scroll/index.html?lang=en&file=A_review_of_the_state_of_the_question_respecting_the_admission_of_dissenters_to_the_universities.djvu&startpage=1

No waiting, just do it now, no phabricator required.

It is used at English Wikisource at this time on every Index: page and has been there for years.

Aklapper triaged this task as Lowest priority.Dec 26 2016, 11:14 AM

I think this proposal is more about adding a better PDF and Djvu viewer directly in the file-view page on Commons, so readers don't have to go to a separate site. book2scroll is brilliant, but isn't integrated in the same place that the files live.

Of course, there's perhaps the option to make a prominent link to book2scroll on PDF and Djvu pages on Commons and Wikisource, and to add better navigation back to these places from book2scroll (because otherwise people get left just relying on their back button and perhaps being confused about what they're looking at).

One thing I personally like about this proposal is that it's making use of an external component to which we may then contribute, to the benefit of the IA (and others). :-)

(Oh, and happy new year everyone.)

A few quick thoughts:

I'm not sure if this is helpful, but I was the designer on this project at IA and would be happy to help out with any design work on this project (there are a few features related to search in particular that were designed and are not currently implemented).

While Commons isn't specifically intended for reading books, a nice way to do so would be a definite plus (and the IA's tool is awesome). Perhaps it could be setup to load instead of MediaViewer when looking at multipage files.

I'm not sure if this is helpful, but I was the designer on this project at IA and would be happy to help out with any design work on this project (there are a few features related to search in particular that were designed and are not currently implemented).

Thanks This our most needed wish.

@jayantanth glad to hear. I'm at Hackathon and happy to help out!

@Alex_brollo created a gadget in Italian Wikisource with similar feature (see for example). Although the gadget needs to be localised for other Wikisource projects.

Thanks Bodhisattwa for mention.
Yes, the gadget is vaguely inspired to IA viewer - with the deep difference that it shows nsPage html coming from wikisource digitalization. It shows djvu/pdf OCR for "red pages" (here an example). The gadget is on active development, using a "down-top" approach - t.i. adding new features to basic ones. Presently it needs a "chronology of navigation" and a "search inside the whole book" tool.
The gadget has some dependencies from other it.wikisource scripts - I'll try to import them just to make easier its localization.

Could anyone please specify estimated time period for doing the project for a beginner in wikimedia. Also the languages that would be required in its implementation.
I would be interested in taking up the project most probably. I would also like if I can continue the same in upcoming GSOC if possible.
REPLY ASAP. PEACE.

I see a possible relationship between the idea of implementing a "book
viewer" into Commons and the proposal of simplifying - as much as possible

  • uploading of books into wikisource. Both ideas underline a central role

of Commons into the work flow related to an old, but important, kind of
"media", the *book. *

Really, I sometimes think that Commons could be considered the
international, multi-language *library*, while wikisource projects could be
considered specialized *digital typographies*. I realize that's an odd
idea.... but I like to share it anyway. An excellent book viewer running
into Commons, showing alternatively scan images (as IA viewer does) or
digitalized version of pages, could be a first step.

Alex

Alex

@Himanshuc3 As we currently don't have mentors and clear directions for this project, I recommend you keep an eye on the MediaWiki GSoC page that we will continue to update with projects in the days to come: https://www.mediawiki.org/wiki/Google_Summer_of_Code/2018

Just to mention it here too, take a look to it.source "book viewer", vaguele inspired (but very different) from IA Viewer. Simply follow this link:

it.source book viewer

Simply adding ?vis=true to any it.source Index page, you'll run the viewer for that Index page. For unexisting pages, the viewer displays the OCR layer. There's a Search option (binoculars icon), but it runs for exixting pages only.

By now, the viewer runs only inside it.source, but I think that it could be scaled and generalized - or simply used as a suggestion.

Removing the Possible-Tech-Projects tag as we are planning to kill it soon! This project does not seem to fit in the Outreach-Programs-Projects category in its current state, so I am not adding that tag right now!