Page MenuHomePhabricator

Languageseeker
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Oct 22 2020, 5:10 AM (183 w, 6 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Languageseeker [ Global Accounts ]

Recent Activity

Jun 2 2022

Pamputt awarded T266209: Anki Plugin for Lingua Libre a Like token.
Jun 2 2022, 8:09 AM · Lingua-Libre-Legacy

May 30 2022

Languageseeker closed T266209: Anki Plugin for Lingua Libre as Resolved.

Please, see the extension at Lingua Libre and Forvo Audio Downloader.

May 30 2022, 3:39 AM · Lingua-Libre-Legacy

May 17 2022

Languageseeker added a comment to T266209: Anki Plugin for Lingua Libre.

Ok, I semi-figured it out.

May 17 2022, 5:29 AM · Lingua-Libre-Legacy
Languageseeker added a comment to T266209: Anki Plugin for Lingua Libre.

I've begun working on this and I've written the Anki part in python and tested it by downloading audio from Forvo. The code is available on github. I'm trying to reuse the code from the LinguaLibre bot, but I'm having trouble writing the sparql function. I would like to send a query to LingaLibre containing a term and language that would return all the available pronunciations performed by native speakers, the user name of the speaker, and their learning place [or whatever works as the best proxy for accent]. Can anyone help with writing this query function?

May 17 2022, 3:42 AM · Lingua-Libre-Legacy

Feb 25 2022

Languageseeker added a comment to T302542: Do not export SVGs as PNGs (if EReaders' SVG support is more broadly available nowadays).

@Samwilson Kobo, Kindle, and Koreader support SVG now. Seems to have mostly happened between 2017 and 2019.

Feb 25 2022, 12:51 AM · WS Export

Feb 24 2022

Languageseeker created T302542: Do not export SVGs as PNGs (if EReaders' SVG support is more broadly available nowadays).
Feb 24 2022, 11:27 PM · WS Export

Jan 29 2022

Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

I think the page number is present in the TEI data as https://repository.library.brown.edu/studio/item/bdr:471142/TEI/ as <pb n=""/>. It would require a script to parse the TEI and convert them into a list of page numbers.

Jan 29 2022, 8:13 PM · User-Inductiveload

Jan 19 2022

Languageseeker created T299613: The Winter's wreath.
Jan 19 2022, 11:20 PM · User-Inductiveload

Jan 13 2022

Languageseeker added a comment to T298279: Batch Upload for The Dial.

Good to know. I'll make use this formula in the future. Sheet updated.

Jan 13 2022, 1:54 PM · User-Inductiveload
Languageseeker added a comment to T298851: A Few Books from HT.

Thank you. I'll keep your notes in time. Congratulations on having less time soon. :)

Jan 13 2022, 2:14 AM · User-Inductiveload

Jan 12 2022

Languageseeker added a comment to T298279: Batch Upload for The Dial.

I think I fixed all the issues. I choose 2126 as the safe date to move the files to Commons because I assume that everyone who wrote in 1926 would have been dead + 70 by then.

Jan 12 2022, 5:07 PM · User-Inductiveload

Jan 9 2022

Languageseeker created T298851: A Few Books from HT.
Jan 9 2022, 6:44 PM · User-Inductiveload

Jan 7 2022

Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

I tried to look into the page number and couldn't figure it out. Sorry.

Jan 7 2022, 12:02 AM · User-Inductiveload
Languageseeker added a comment to T298279: Batch Upload for The Dial.

Updated to include 1926 volumes. Also 1926 volumes of Atlantic monthly, Blackwood, and Strand.

Jan 7 2022, 12:01 AM · User-Inductiveload

Jan 3 2022

Languageseeker added a comment to T298493: PD 1926.

Fixed Author - Title column mix-up

Jan 3 2022, 9:05 PM · User-Inductiveload
Languageseeker created T298493: PD 1926.
Jan 3 2022, 7:40 PM · User-Inductiveload

Dec 24 2021

Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

Here is the file for the MJP. Turns it out it was pretty easy to scrape the data. Hurray for well-designed sites.

Dec 24 2021, 6:55 PM · User-Inductiveload
Languageseeker added a comment to T294352: Batch Upload for The Smart Set.
Dec 24 2021, 5:57 PM · User-Inductiveload

Dec 23 2021

Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

Keep going. The spreadsheet and IA are not wrong. It's the publisher. I think they were trying out a British edition for a bit. Some of the months have two different TOC, like the albums of the 1960s.

Dec 23 2021, 11:49 PM · User-Inductiveload
Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

The Sim set has TOCs. There's going to be some untangling to do with the number weirdness. For example Sims Volume 58 N 3 is the same as HT V 59 N 3 (July 1919), but September 1919 is different. The Sims set skips V64 without any missing months.

Dec 23 2021, 11:42 PM · User-Inductiveload
Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

The discrepancy seems to come from the fact that some of the SIM set was published in the UK.

Dec 23 2021, 11:31 PM · User-Inductiveload
Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

Looks perfect!

Dec 23 2021, 11:26 PM · User-Inductiveload
Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

fixed

Dec 23 2021, 10:39 PM · User-Inductiveload
Languageseeker created T298279: Batch Upload for The Dial.
Dec 23 2021, 10:32 PM · User-Inductiveload
Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

Unexpectedly got some time today. Here is the list of all the issues of the Smart Set currently available. It seems that Volume 64, Issue 2-4 are either missing or were never issued. Volume 64 and Volume 65 both start with January 1921, but have different TOC.

Dec 23 2021, 2:45 PM · User-Inductiveload

Dec 22 2021

Languageseeker added a comment to T298173: Batch Upload for Lippincott's Monthly Magazine.

Fixed.

Dec 22 2021, 12:24 AM · User-Inductiveload

Dec 21 2021

Languageseeker updated the task description for T298173: Batch Upload for Lippincott's Monthly Magazine.
Dec 21 2021, 11:38 PM · User-Inductiveload
Languageseeker created T298173: Batch Upload for Lippincott's Monthly Magazine.
Dec 21 2021, 11:28 PM · User-Inductiveload
Languageseeker added a comment to T294352: Batch Upload for The Smart Set.

Thank you for fixing this. I'll make sure to keep this in mind next time. Happy Holidays! :)

Dec 21 2021, 2:13 PM · User-Inductiveload

Oct 28 2021

Languageseeker closed T294182: 5 Books as Resolved.

Perfect. Thank you!

Oct 28 2021, 7:27 PM · User-Inductiveload
Languageseeker closed T291334: Batch Upload for The Atlantic Monthly as Resolved.

Thank you. It's perfect.

Oct 28 2021, 7:26 PM · User-Inductiveload

Oct 26 2021

Languageseeker created T294352: Batch Upload for The Smart Set.
Oct 26 2021, 2:08 PM · User-Inductiveload

Oct 23 2021

Languageseeker created T294182: 5 Books.
Oct 23 2021, 8:37 PM · User-Inductiveload

Oct 5 2021

Languageseeker added a comment to T291805: Batch Upload for Blackwood's Magazine.

Fixed a few broken links, added No Commons

Oct 5 2021, 6:13 PM · User-Inductiveload

Oct 4 2021

Languageseeker added a comment to T291334: Batch Upload for The Atlantic Monthly.

Thank you! Sorry about that. It looks as if the company changed it's name a few time. New file uploaded.

Oct 4 2021, 11:43 PM · User-Inductiveload

Sep 27 2021

Inductiveload awarded T291805: Batch Upload for Blackwood's Magazine a Love token.
Sep 27 2021, 10:44 AM · User-Inductiveload
Languageseeker created T291805: Batch Upload for Blackwood's Magazine.
Sep 27 2021, 4:24 AM · User-Inductiveload

Sep 20 2021

Languageseeker added a comment to T291334: Batch Upload for The Atlantic Monthly.

@Inductiveload I would leave the failed pages until the server side upload can be done. My experience is that users are more likely to proofread when the scans are already in place. Periodicals are particularly challenging to process so I don’t mind waiting a little bit for a server side upload.

Sep 20 2021, 6:54 PM · User-Inductiveload

Sep 19 2021

Languageseeker added a comment to T291334: Batch Upload for The Atlantic Monthly.

@Inductiveload I completely understand. That bug essentially breaks batch uploading. I've encountered it many a times and it's extremely painful. For all intents and purposes, uploading is broken. I think it might be worth leaving a note on your batch upload page so that other users will understand the situation.

Sep 19 2021, 11:19 PM · User-Inductiveload

Sep 18 2021

Languageseeker created T291334: Batch Upload for The Atlantic Monthly.
Sep 18 2021, 9:58 PM · User-Inductiveload

Sep 14 2021

Languageseeker added a comment to T290900: Server side upload to enwikisource (multiple DJVU files ~200MB each).

Pointing out that in the URL for the batch upload "download" is misspelled. https://archive.org/dowload should be https://archive.org/download .

Sep 14 2021, 12:19 PM · Server-side-upload-request, User-Inductiveload, Internet-Archive

Sep 13 2021

Languageseeker added a comment to T290816: Batch Upload for The Strand.

Of course, take your time. This seems far more complicated than I envisioned. Thank you for doing this.

Sep 13 2021, 11:07 PM · User-Inductiveload
Languageseeker added a comment to T290816: Batch Upload for The Strand.

Here are the remaining volumes for The Strand from the IA. I already uploaded them as PDF before I noticed that the files are missing an OCR layer. Would it be possible create an OCRed DJVU with uncompressed images so that they can easily be cropped?

Sep 13 2021, 1:21 PM · User-Inductiveload

Sep 12 2021

Languageseeker updated subscribers of T290816: Batch Upload for The Strand.

@Aklapper I posted it on @Inductiveload user board per https://en.wikisource.org/wiki/User:Inductiveload/Requests/Batch_uploads. Could you please restore it.

Sep 12 2021, 12:19 PM · User-Inductiveload
Languageseeker added a comment to T290816: Batch Upload for The Strand.


Forgot the vollist in the previous version.

Sep 12 2021, 5:03 AM · User-Inductiveload
Languageseeker created T290816: Batch Upload for The Strand.
Sep 12 2021, 5:01 AM · User-Inductiveload

Jun 12 2021

Languageseeker added a comment to T166138: Please add Petit Formal Script to the UniversalLanguageSelector.

Is it possible to merge this patch?

Jun 12 2021, 12:27 PM · All-and-every-Wikisource, Privacy Engineering, UniversalLanguageSelector

Apr 28 2021

Languageseeker created T281407: DB Error when attempting to go to user watchlist.
Apr 28 2021, 7:42 PM · SRE, Commons

Apr 27 2021

Languageseeker added a comment to T281195: ProofreadPage: allow access to proofread page index stats from templates/modules.

Could we store these as tags that are updated whenever a Page is saved?

Apr 27 2021, 12:05 AM · User-Inductiveload, ProofreadPage

Apr 26 2021

Languageseeker added a comment to T281019: Please Upload large files to Commons.

That’s not the same file. 002 uploaded while 010 failed. It seems that the error occurs during the publishing stage.

Apr 26 2021, 1:13 PM · SRE, Wikimedia-Site-requests, Internet-Archive
Languageseeker added a comment to T281019: Please Upload large files to Commons.

I got a 503 error with web.archive.org/web/20150905070709if_/http://www.quartos.org/quarto_images/ham-1625-22278x-fol-c01/ham-1625-22278x-fol-c01-010.tif

Apr 26 2021, 12:15 PM · SRE, Wikimedia-Site-requests, Internet-Archive

Apr 25 2021

Languageseeker added a comment to T269518: IA Upload: Permit duplicate IA identifier if of a different format.

The IA tool did not warn me when creating duplicates yesterday leading to duplicate indexes. I caught them by mistake, but I want to flag this as an issue. If want to permit the creation of duplicate files albeit in different formats, them the warning needs to be in place and require confirmation to override.

Apr 25 2021, 6:40 PM · IA Upload, Community-Tech
Languageseeker added a comment to T281019: Please Upload large files to Commons.

I don’t think it’s the IA because I tried uploading these files via Pattypan and kept getting a myriad of errors. Failure seems almost certain and success a random occurrence. That’s why I suggested trying to upload the file many times.

Apr 25 2021, 4:27 PM · SRE, Wikimedia-Site-requests, Internet-Archive
Languageseeker added a comment to T281019: Please Upload large files to Commons.

As a shot in the dark, would it be possible to try with something else than wget?

Apr 25 2021, 3:57 PM · SRE, Wikimedia-Site-requests, Internet-Archive
Languageseeker added a comment to T281019: Please Upload large files to Commons.

I'm actually not surprised that the c01 files are failing. For some reason, SRE does not seem to like the Tif from the British Library. However, I've been able to get them to upload by repeatedly trying. Would it be possible to make a script to attempt to upload each file around 100 times and see if it goes. That's how i got some of them onto Commons.

Apr 25 2021, 3:52 PM · SRE, Wikimedia-Site-requests, Internet-Archive

Apr 24 2021

Languageseeker added a comment to T281018: Please Upload Large Files to Commons.

I think that a few might have escaped the list. Can you also add these please:

Apr 24 2021, 7:40 PM · Internet-Archive, Wikimedia-Site-requests
Languageseeker added a comment to T281018: Please Upload Large Files to Commons.

Yes, please.

Apr 24 2021, 5:50 PM · Internet-Archive, Wikimedia-Site-requests
Languageseeker added a comment to T281019: Please Upload large files to Commons.

Yes, please.

Apr 24 2021, 5:50 PM · SRE, Wikimedia-Site-requests, Internet-Archive
Languageseeker added a comment to T281020: Upload Large Files to Commons.

Yes, please.

Apr 24 2021, 5:50 PM · Internet-Archive, Wikimedia-Site-requests
Languageseeker created T281020: Upload Large Files to Commons.
Apr 24 2021, 3:28 AM · Internet-Archive, Wikimedia-Site-requests
Languageseeker created T281019: Please Upload large files to Commons.
Apr 24 2021, 3:25 AM · SRE, Wikimedia-Site-requests, Internet-Archive
Languageseeker created T281018: Please Upload Large Files to Commons.
Apr 24 2021, 3:22 AM · Internet-Archive, Wikimedia-Site-requests

Apr 22 2021

Languageseeker added a comment to T278104: Unable to upload to Commons: uploadstash-file-not-found: Key "187kyl5ozj74.xtav8j.51508.djvu" not found in stash.

I've been getting a similar error with Pattypan all day.

Apr 22 2021, 3:31 AM · SRE-swift-storage, User-Inductiveload

Apr 13 2021

Languageseeker created T279975: IA Tool blocked by Commons.
Apr 13 2021, 12:26 AM · Community-Tech, IA Upload

Apr 10 2021

Languageseeker added a comment to T277192: Wikisource: Investigate improving column support for OCR [8H].

There’s also Layout Parser that employs deep learning to analyze, parse, and OCR very complicated layouts.

Apr 10 2021, 5:29 AM · Community-Tech (CommTech-Sprint-1), All-and-every-Wikisource, Wikimedia OCR

Apr 7 2021

Languageseeker created T279537: Ability to link files across Wikiprojects based on Commons metadata Internet Archive ID.
Apr 7 2021, 1:59 PM · StructuredDataOnCommons, Commons, Internet-Archive

Apr 2 2021

Languageseeker updated the task description for T279125: Create an Importer for Hathi Trust.
Apr 2 2021, 3:20 AM · All-and-every-Wikisource
Languageseeker updated the task description for T279125: Create an Importer for Hathi Trust.
Apr 2 2021, 3:17 AM · All-and-every-Wikisource
Languageseeker updated the task description for T279125: Create an Importer for Hathi Trust.
Apr 2 2021, 3:14 AM · All-and-every-Wikisource
Languageseeker created T279125: Create an Importer for Hathi Trust.
Apr 2 2021, 3:13 AM · All-and-every-Wikisource
Languageseeker added a comment to T277768: Wikisource: Investigate adding support for bulk OCR to Wikimedia OCR [16H].

I think it’s important to point out that PDF does not always work especially for larger files. A single page can range anywhere from 500kb to over 35mb. When you factor in the number of pages in a work, you can easily get a PDF that is over 1gb in size. Currently, only Chunked Uploader can potentially handle that. However, I tried uploading a 1.20gb PDF over 10 times and failed even with async unchecked. Even going to the stash failed. Now, I’m not calling for removing support for PDF and I know that this is not entirely in scope for this project. However, if we’re discussing how to bulk store OCR, we also need to make sure that users can upload files to OCR. Even Fæ cannot upload some PDF from IA. So if we are to support bulk OCR, we’d either need to compress PDFs to death like IA does, improve Commons upload to robustly support files of several gb even when dealing with failed or unreliable connections, or develop a container as Xover commented on. A container is a larger project, but one that can have widespread benefits. For example, it would be possible to upload the front-and-back of a coin as a single entry.

Apr 2 2021, 12:34 AM · Community-Tech (CommTech-Sprint-1), Wikimedia OCR, All-and-every-Wikisource

Mar 31 2021

Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

@Xover and @Inductiveload Thank you both for your feedback and comments. I'm glad that there is a way to reduce the size of the PDF generated from Haithi Trust images. However, I feel like there are two separate issues being discussed.

Mar 31 2021, 9:00 PM · ProofreadPage
Languageseeker added a comment to T277768: Wikisource: Investigate adding support for bulk OCR to Wikimedia OCR [16H].

There are two major and related questions to answer here. First, when should the OCR tool be run? Second, how should the result be stored? While it’s tempting to run the OCR on the Index page, OCRing an entire book takes a considerable amount of time during which the user cannot edit wasting valuable user time and potentially resulting in the user leaving. Furthermore, it’s not actually necessary to wait that long because OCR can be performed earlier. When OCR can be performed depends on how the individual scans that make up a book are stored. These are the major options that I can think of:

Mar 31 2021, 8:00 PM · Community-Tech (CommTech-Sprint-1), Wikimedia OCR, All-and-every-Wikisource
Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

@Inductiveload While I agree that individual files make managing the files a real pain, it's probably the only way to do it. I proposed allowing Commons to accept a book scan as one zip in T277921 and AntiCompositeNumber wrote "Nope nope nope, land of 10,000 nopes."

Mar 31 2021, 2:22 PM · ProofreadPage
Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

I don't think that things will magically begin to work. However, there are multiple cases where an Image based Index would make sense: the Balinese Leaf Project or any index that is suffering from display issues. See, the following Phabricator tickets T224355 T256848 T257025 T184867 . Also, see https://en.wikisource.org/wiki/User:Inductiveload/jump_to_file

Mar 31 2021, 12:33 PM · ProofreadPage
Languageseeker added a comment to T277768: Wikisource: Investigate adding support for bulk OCR to Wikimedia OCR [16H].

It occurs to me that storing the OCR text and the Proofread text on Commons with the original image could actually become a very valuable dataset for investigating where OCR fails and help Wikisource at the same time. It would probably look something like this.

Mar 31 2021, 4:03 AM · Community-Tech (CommTech-Sprint-1), Wikimedia OCR, All-and-every-Wikisource
Languageseeker added a comment to T277768: Wikisource: Investigate adding support for bulk OCR to Wikimedia OCR [16H].

We'd also need to figure out how to do this on Index pages comprised of single images, such as https://en.wikisource.org/wiki/Index:Lippincotts_Monthly_Magazine_51 . This should probably be done somewhere on an Index page.

Mar 31 2021, 2:09 AM · Community-Tech (CommTech-Sprint-1), Wikimedia OCR, All-and-every-Wikisource
Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

Ok, I see your point. That was the most minor of quibbles. Your original proposal is great. Hope to see this happen.

Mar 31 2021, 1:15 AM · ProofreadPage
Languageseeker added a comment to T277192: Wikisource: Investigate improving column support for OCR [8H].

I think that for most works, it would better to make a flexible template to split the pages and allow the user to manually adjust. Generally, there are two major types of column usages in book

  1. Header - Two Columns - Footer
  2. Header - Two Columns - Image - Two Columns - Image - Two Columns - Footer
Mar 31 2021, 1:10 AM · Community-Tech (CommTech-Sprint-1), All-and-every-Wikisource, Wikimedia OCR

Mar 30 2021

Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

That sounds like a great approach! Probably, to make life a bit easier for users, we should add "[[File: " and "]]" in the code

Mar 30 2021, 11:13 PM · ProofreadPage
Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

There are multiple issues with PDF.

Mar 30 2021, 7:09 PM · ProofreadPage
Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

The issue seems to be that if you create an Index from individual images many of the scripts and gadgets break because of image based Indexes are handled different from PDF/DJVU. A simple task such as numbering pages becomes far more difficult.

Mar 30 2021, 11:44 AM · ProofreadPage
Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

The basic issue appears to be that the variable page is never set for images and this is cascading downwards.

Mar 30 2021, 1:08 AM · ProofreadPage

Mar 29 2021

Languageseeker added a comment to T278623: Create a Section for Numerically Sequencing Images on Index ns.

Update, while reading the documentation for Proofread page (1). I discovered that you can transclude an image sequence using the format <pages index="Index Name" from="Start Image" to="End Image"/> even if the images are in a different format. This implies that at some point, the code does numerically sequence the image files. However, on the Index ns <pagelist /> does not work. This implies that the Index ns is not aware or make usage of the numerical sequence that exists.

Mar 29 2021, 1:28 AM · ProofreadPage

Mar 27 2021

Languageseeker updated the task description for T278623: Create a Section for Numerically Sequencing Images on Index ns.
Mar 27 2021, 2:47 PM · ProofreadPage
Languageseeker updated the task description for T278623: Create a Section for Numerically Sequencing Images on Index ns.
Mar 27 2021, 2:42 PM · ProofreadPage
Languageseeker updated subscribers of T278623: Create a Section for Numerically Sequencing Images on Index ns.
Mar 27 2021, 2:35 PM · ProofreadPage
Languageseeker created T278623: Create a Section for Numerically Sequencing Images on Index ns.
Mar 27 2021, 2:32 PM · ProofreadPage

Mar 25 2021

Languageseeker added a comment to T278184: Add ability to Download PDF of original scan of chapter/article.

Commons is not the right place because this would require two major components that only Wikisource has both:

Mar 25 2021, 1:28 AM · Commons

Mar 24 2021

Languageseeker added a comment to T278179: Allow for Custom Edit Bar for Pages based on Index..

@Inductiveload Templates are great as well and may be easier to do.

Mar 24 2021, 5:28 PM · ProofreadPage
Languageseeker added a comment to T278184: Add ability to Download PDF of original scan of chapter/article.

No, it's downloading a subset of that file. A journal volume can be hundreds of pages, while an article can be less than one.

Mar 24 2021, 1:55 PM · Commons
Languageseeker added a comment to T278179: Allow for Custom Edit Bar for Pages based on Index..

@Inductiveload @Soda In my opinion, it's both characters and templates. Take for Example, the EB11 project that uses custom templates such as {{EB1911 Fine Print|}} It would make sense for these templates to be in the Edit Bar of all EB11 projects, but not in a book on English Poetry. Also, I would want the characters to be readily accessible not buried in menus.

Mar 24 2021, 1:13 PM · ProofreadPage
Languageseeker added a comment to T278179: Allow for Custom Edit Bar for Pages based on Index..

This is also a way to address https://meta.wikimedia.org/wiki/Community_Wishlist_Survey_2020/Wikisource/UI_improvements_on_Wikisource

Mar 24 2021, 1:04 PM · ProofreadPage
Languageseeker added a comment to T278179: Allow for Custom Edit Bar for Pages based on Index..

@Inductiveload Yes, your thinking is close to mine. Basically, this would have three components.

Mar 24 2021, 1:03 PM · ProofreadPage
Languageseeker added a comment to T278184: Add ability to Download PDF of original scan of chapter/article.

This can rely on WS Export to perform the task. However, WS Export seems more tailored to export proofread text. Instead, I'm asking to create a function to allow for the download of a pdf consisting of scanned images.

Mar 24 2021, 12:40 PM · Commons
Languageseeker added a comment to T278179: Allow for Custom Edit Bar for Pages based on Index..

@Priyanshugupta1909 Of course, you have the assignment.

Mar 24 2021, 12:38 PM · ProofreadPage
Languageseeker added a comment to T278178: Preload OCR on Pages generated from Image file.

@Sandyabhi OCR is the automatic recognition of text from an image. In other words, the usage of a program such as Tesseract to generate a text layer.

Mar 24 2021, 12:37 PM · ProofreadPage

Mar 22 2021

Languageseeker created T278184: Add ability to Download PDF of original scan of chapter/article.
Mar 22 2021, 8:59 PM · Commons
Languageseeker renamed T278178: Preload OCR on Pages generated from Image file from No OCR on Pages generated from Image file to Preload OCR on Pages generated from Image file.
Mar 22 2021, 8:39 PM · ProofreadPage
Languageseeker created T278179: Allow for Custom Edit Bar for Pages based on Index..
Mar 22 2021, 8:08 PM · ProofreadPage