Page MenuHomePhabricator

Investigate what data is avaible about a source's use across the open web
Open, MediumPublicSpike


See parent design proposal, this task is to investigate what info we have or can derive about a source from non-wikimedia data providers.

This might include:

  • Sources entries in universal library catalogues or other repositories (like internet archive)
  • Availability of pdf copies or other media on other sites
  • Availability though programs such as wikipedia library

.... this one is pretty open ended...

Event Timeline

JMinor triaged this task as Medium priority.Mar 30 2020, 6:24 PM
JMinor created this task.
Restricted Application added a project: Internet-Archive. · View Herald TranscriptMar 30 2020, 6:24 PM
LGoto removed cmadeo as the assignee of this task.Mar 30 2020, 6:26 PM

Possible sources to explore:

  • OCLC (aka WorldCat) has many of the external data needs available for books.
  • ArXiv might be helpful for academic articles
  • Wikisource
  • Hathitrust
  • Europeana
  • DP.LA
  • Guttenberg
  • Open Library / Internet Archive