Page MenuHomePhabricator

Add copyright metadata to extracted PDF
Open, LowPublic

Description

"Extract to PDF" link(*) generates PDF for a wiki page, including copyright info in the end of the PDF. However, it doesn't generate machine readable copyright info (in the EXIF of the PDF file).

(*)currently extract to PDF is implemented using ElectronPdfService though https://github.com/msokk/electron-render-service indicate this service is deprecated. This task is general to whatever PDF service is used.

How to reproduce

  1. Go to: https://en.wikipedia.org/wiki/Main_Page
  2. Press on "Download as PDF"
  3. Download the pdf
  4. Check the file properties (or other way to see the EXIF metadata of the PDF file)

(example use case: https://commons.wikimedia.org/wiki/File:Singularit%C3%A4tentheorie_(Osnabr%C3%BCck_2019)Vorlesung10.pdf although in this case this is generated using LATEX)

Acceptance criteria:
Should indicate

License: Creative Commons Attribution-ShareAlike License
Author: Wikipedia, free encyclopedia

(or list of authors, or link to history page or any reasonable machine readable copyright info)