Page MenuHomePhabricator

Store Pdf extracted text in a structured table instead of img_metadata
Closed, DeclinedPublic

Description

Same as T32906 for djvu

Event Timeline

aaron raised the priority of this task from to Needs Triage.
aaron updated the task description. (Show Details)
aaron added a subscriber: aaron.
Nemo_bis added a subscriber: Nemo_bis.

A current very useful feature is the CirrusSearch indexes the text content of files; may need some adaptations if the text is moved elsewhere.

Restricted Application added subscribers: Steinsplitter, Matanya. · View Herald Transcript

A current very useful feature is the CirrusSearch indexes the text content of files; may need some adaptations if the text is moved elsewhere.

CirrusSearch just calls methods in the MediaHandler class. It should be possibly to transparently change this without affecting Cirrus or anything else depending on where the text is stored