Page MenuHomePhabricator

Samwilson (Sam Wilson)
Software Engineer (Community Tech) & volunteer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Jun 5 2015, 5:03 AM (309 w, 4 d)
Availability
Available
IRC Nick
samwilson
LDAP User
Samwilson
MediaWiki User
Samwilson [ Global Accounts ]

Recent Activity

Today

Samwilson added a comment to T282530: Wikisource Indonesia OCR: Not working so well.

What is the title of that page? Can you add a link to it? Thanks!

Tue, May 11, 10:16 AM · Community-Tech, Wikimedia OCR, Wikisource
Samwilson added a comment to T268400: Investigate IA Upload downtime.

It looks like the downtimes began around April 12, and commit 1ba22eb9083f53c1118175648941a702e80b2a15 was the first major commit recently before that, on March 29.

Tue, May 11, 9:26 AM · Community-Tech (Kanban-2020-21-Q4), Spike, IA Upload
Samwilson committed rEMIPe65a41291310: Remove phpcs exclusions (authored by Samwilson).
Remove phpcs exclusions
Tue, May 11, 4:53 AM
Samwilson moved T268400: Investigate IA Upload downtime from Ready 🎬 to In Development 💻 on the Community-Tech (Kanban-2020-21-Q4) board.
Tue, May 11, 2:14 AM · Community-Tech (Kanban-2020-21-Q4), Spike, IA Upload
Samwilson claimed T268400: Investigate IA Upload downtime.
Tue, May 11, 2:14 AM · Community-Tech (Kanban-2020-21-Q4), Spike, IA Upload
Samwilson added a comment to T282459: Wikimedia OCR: Change "upload.beta.wmflabs.org" to "upload.wikimedia.beta.wmflabs.org" in .env.

This can of course always be overridden locally in .env.local.

Tue, May 11, 2:05 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR
Samwilson moved T282459: Wikimedia OCR: Change "upload.beta.wmflabs.org" to "upload.wikimedia.beta.wmflabs.org" in .env from Ready 🎬 to Review/Feedback 💬 on the Community-Tech (Kanban-2020-21-Q4) board.
Tue, May 11, 2:04 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR
Samwilson edited projects for T282459: Wikimedia OCR: Change "upload.beta.wmflabs.org" to "upload.wikimedia.beta.wmflabs.org" in .env, added: Patch-For-Review, Community-Tech (Kanban-2020-21-Q4); removed Community-Tech.
Tue, May 11, 2:04 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR
Samwilson claimed T282459: Wikimedia OCR: Change "upload.beta.wmflabs.org" to "upload.wikimedia.beta.wmflabs.org" in .env.

PR: https://github.com/wikimedia/wikimedia-ocr/pull/31

Tue, May 11, 2:03 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR
Samwilson added a comment to T282080: Enable new OCR UI on Beta Wikisource.

It's deployed (demo) but there's a CSP error:

Tue, May 11, 1:25 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR

Yesterday

Samwilson added a comment to T282080: Enable new OCR UI on Beta Wikisource.

I've scheduled this for the European mid-day backport window, ~3 hours from now.

Mon, May 10, 8:30 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR

Fri, May 7

Samwilson added a comment to T282210: 'Proofread tools' section doesn't load in WikiEditor.

Also: https://www.mediawiki.org/wiki/Extension:WikiEditor/Toolbar_customization#Determining_when_toolbar_load_is_done suggests not to use wikiEditor-toolbar-doneInitialSections.

Fri, May 7, 6:42 AM · Patch-For-Review, ProofreadPage
Samwilson updated the task description for T282210: 'Proofread tools' section doesn't load in WikiEditor.
Fri, May 7, 5:55 AM · Patch-For-Review, ProofreadPage
Samwilson created T282210: 'Proofread tools' section doesn't load in WikiEditor.
Fri, May 7, 5:49 AM · Patch-For-Review, ProofreadPage

Thu, May 6

Samwilson set the point value for T282080: Enable new OCR UI on Beta Wikisource to 1.
Thu, May 6, 5:54 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR
Samwilson moved T282080: Enable new OCR UI on Beta Wikisource from Ready 🎬 to Review/Feedback 💬 on the Community-Tech (CommTech-Sprint-2021-05-06-to-2021-05-19) board.
Thu, May 6, 5:54 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR
Samwilson created T282080: Enable new OCR UI on Beta Wikisource.
Thu, May 6, 5:50 AM · Community-Tech (Kanban-2020-21-Q4), Patch-For-Review, Wikimedia OCR
Samwilson added a comment to T282073: Add API endpoint to retrieve supported languages.

https://tesseract-ocr.github.io/tessdoc/Data-Files suggests that these are not duplicates, but e.g. ben is Bengali and should only be listed once.

Thu, May 6, 5:49 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson added a comment to T282073: Add API endpoint to retrieve supported languages.

I assume we'll want to map Tesseract's supported language list to return ISO 639-1, which is what we use on-wiki.

Thu, May 6, 5:13 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson updated subscribers of T282060: Support PSM page segmentation option in the UI.

The basic UI for this has already been done by @MusikAnimal (as part of T280213). See it here: https://ocr-test.wmcloud.org/?engine=tesseract

Thu, May 6, 12:33 AM · Design, Community-Tech, Wikimedia OCR
Samwilson added a comment to T282050: Wikisource OCR: add loading state improvements [placeholder].

No, in the MVP T280848 only step 1 is done, the disabling (it actually also disables the button; is that bad?).

Thu, May 6, 12:29 AM · Design, Wikimedia OCR, Community-Tech

Tue, May 4

Samwilson added a comment to T280617: Wikisource OCR: Validate language codes.

Pinging @Samwilson, since I know Harumi is busy with some Editing team work :)

If we wanted to include support for the experimental & mapped languages, would this be possible? Could we create a ticket for it? Any thoughts/concerns?

Tue, May 4, 11:12 PM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson reopened T242406: Remove $wgAllowRequiringEmailForResets feature flag [small] as "Open".

The feature flag still needs to be removed.

Tue, May 4, 10:18 PM · Password-Reset-Update, Community-Tech
Samwilson updated the task description for T265854: Remove the $wsexportConfig global.
Tue, May 4, 7:05 AM · WS Export, Technical-Debt
Samwilson committed rGSVT6c2abdf1c363: i18n updates (authored by Samwilson).
i18n updates
Tue, May 4, 6:54 AM
Samwilson added a comment to T265854: Remove the $wsexportConfig global.

PR for ebook-convert: https://github.com/wikimedia/ws-export/pull/364

Tue, May 4, 6:43 AM · WS Export, Technical-Debt
Samwilson updated the task description for T265854: Remove the $wsexportConfig global.
Tue, May 4, 6:42 AM · WS Export, Technical-Debt
Samwilson added a comment to T278444: Wikisource OCR: investigate exposing an API endpoint (link phetools or google-ocr) to fetch the OCR for a page.

This is pretty much done, and is working at https://ocr.wmcloud.org/api.php

Tue, May 4, 6:30 AM · Wikimedia OCR, Wikisource, Community-Tech
Samwilson closed T259687: Wikisource Export: Remove support for Wikilivres as Resolved.

Done.

Tue, May 4, 4:55 AM · Community-Tech, WS Export
Samwilson added a comment to T259687: Wikisource Export: Remove support for Wikilivres.

Sounds sensible. PR: https://github.com/wikimedia/ws-export/pull/363

Tue, May 4, 2:20 AM · Community-Tech, WS Export
Samwilson renamed T259687: Wikisource Export: Remove support for Wikilivres from Wikisource Export: Support Wikilivres to Wikisource Export: Remove support for Wikilivres.
Tue, May 4, 2:19 AM · Community-Tech, WS Export
Samwilson moved T253282: Wikisource ebooks: Investigate using subpages from all pages, not just those with ws-summary from Backlog to Ready to work on on the WS Export board.
Tue, May 4, 12:31 AM · Wikisource, Community-Tech, WS Export
Samwilson moved T250614: Timeout while generating PDF using WSexport from Backlog to Ready to work on on the WS Export board.
Tue, May 4, 12:31 AM · Wikisource, WS Export, Community-Tech
Samwilson moved T252254: Button to send '.mobi' files directly to kindle from Wikisource from Backlog to Discussion needed on the WS Export board.
Tue, May 4, 12:31 AM · Community-Tech, WS Export, Wikisource, Wikimedia-Hackathon-2020

Mon, May 3

Samwilson moved T276887: Wikisource Export: Remove closed Wikisources from language dropdown from Backlog to Ready to work on on the WS Export board.
Mon, May 3, 10:50 PM · WS Export, Community-Tech
Samwilson moved T280306: Wikisource: Add Exports per day to ws-export stats from Backlog to Ready to work on on the WS Export board.
Mon, May 3, 10:50 PM · Wikisource, WS Export, Community-Tech
Samwilson moved T279855: Ws-export: Hide export menu in edit mode from Backlog to Ready to work on on the WS Export board.
Mon, May 3, 10:50 PM · Community-Tech, WS Export
Samwilson moved T278290: WS-export Improve TOC generation documentation from Backlog to Ready to work on on the WS Export board.
Mon, May 3, 10:49 PM · Documentation, WS Export, Community-Tech
Samwilson moved T277435: Include copyright metadata based on Wikidata P6216 from Backlog to Discussion needed on the WS Export board.
Mon, May 3, 10:49 PM · Community-Tech, WS Export
Samwilson moved T277248: WS export download dialog: fire a hook when dialog shown from Backlog to Ready to work on on the WS Export board.
Mon, May 3, 10:49 PM · WS Export, Community-Tech
Samwilson added a comment to T270770: Wikisource Export: display export time data [placeholder].

Given that we're recording the book-generation times but not displaying them anywhere, this ticket seems like it might still be valid. (Not that Community-Tech is going to work on it.)

Mon, May 3, 10:49 PM · Wikisource, WS Export, Community-Tech
Samwilson moved T272763: Wikisource Export: Change the cover image to local Wikisource logo from Design/Product to Ready to work on on the WS Export board.
Mon, May 3, 10:47 PM · Wikisource-Community-User-Group, Community-Tech, WS Export
Samwilson moved T281522: Ws Export: Internal links missing from Backlog to Ready to work on on the WS Export board.
Mon, May 3, 10:47 PM · WS Export, Community-Tech
Samwilson moved T280637: Disable the WS-Export extension on Hebrew Wikisource from Backlog to Discussion needed on the WS Export board.
Mon, May 3, 10:47 PM · Community-Tech, WS Export, Wikimedia-Site-requests
Samwilson added a comment to T281494: SPIKE: Enable Clean up in OCR Proofreading (4hours).

Here's a task about cleaning up Google's structured output into wikitext: T250185: Make Wikisource-OCR handle paragraphs better

Mon, May 3, 7:54 AM · Community-Tech, Wikisource, Wikimedia OCR
Samwilson set the point value for T281129: Wikimedia OCR: "Call to a member function getText() on null" when image has no text to 1.
Mon, May 3, 7:31 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson claimed T281129: Wikimedia OCR: "Call to a member function getText() on null" when image has no text.

PR: https://github.com/wikimedia/wikimedia-ocr/pull/26

Mon, May 3, 7:31 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson added a project to T279405: Add support for multiple languages: Community-Tech.

The tool work for this has been done in T280214.

Mon, May 3, 7:20 AM · Community-Tech, Wikimedia OCR
Samwilson added a comment to T281539: Wikisource OCR: can we provide a link to the form page within proofread page?.

It's definitely possible (and pretty simple) to add a link with prefilled image URL and language, as well as a backlink (probably as an extra, new, URL parameter).

Mon, May 3, 6:25 AM · Design, Wikimedia OCR, Community-Tech
Samwilson added a comment to T280231: Add optional asset build process to Toolforge Bundle's deploy script.

I think we need to also install, before building: https://github.com/wikimedia/ToolforgeBundle/pull/51

Mon, May 3, 2:08 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, ToolforgeBundle
Samwilson moved T280214: Wikisource OCR: Accept Google options on the API from Product sign-off 🤘 to QA 🐛 on the Community-Tech (Kanban-2020-21-Q4) board.

Better UI is merged (single input with tag-style language codes). Live on the test site (only, so far). @dom_walden, not sure if you want to have another look at this.

Mon, May 3, 2:00 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, Wikisource

Fri, Apr 30

Samwilson added a comment to T280848: Implement MVP of OCR in Wikisource extension.

FWIW, my experience with Phe's OCR tool (whose gadget just sets .disable() on #wpTextBox1) is that this is not particularly intuitive for end users.

Fri, Apr 30, 6:15 AM · MW-1.37-notes (1.37.0-wmf.5; 2021-05-11), Patch-For-Review, Wikimedia OCR, Community-Tech (Kanban-2020-21-Q4), Wikisource
Samwilson moved T280848: Implement MVP of OCR in Wikisource extension from In Development 💻 to Review/Feedback 💬 on the Community-Tech (Kanban-2020-21-Q4) board.

Ready for review: https://gerrit.wikimedia.org/r/c/mediawiki/extensions/Wikisource/+/682034 (dependent on https://github.com/wikimedia/wikimedia-ocr/pull/24 ).

Fri, Apr 30, 3:16 AM · MW-1.37-notes (1.37.0-wmf.5; 2021-05-11), Patch-For-Review, Wikimedia OCR, Community-Tech (Kanban-2020-21-Q4), Wikisource

Thu, Apr 29

Samwilson added a comment to T281026: Export words from Wikisource to Wiktionary.

This sounds similar to the 2020 wishlist proposal Insert attestation using Wikisource as a corpus, which references:

Thu, Apr 29, 3:47 AM · Wiktionary, Wikisource
Samwilson added a comment to T280231: Add optional asset build process to Toolforge Bundle's deploy script.

Merged, and released in 1.4.0 (of the bundle).

Thu, Apr 29, 3:25 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, ToolforgeBundle
Samwilson moved T280953: Allow images and requests to come from localhost from In Development 💻 to Review/Feedback 💬 on the Community-Tech (Kanban-2020-21-Q4) board.

PR for review: https://github.com/wikimedia/wikimedia-ocr/pull/24

Thu, Apr 29, 3:06 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson claimed T280953: Allow images and requests to come from localhost.

There are actually two sets of host names (one list and one single one) that should be configurable:

Thu, Apr 29, 2:19 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson moved T280953: Allow images and requests to come from localhost from Ready 🎬 to In Development 💻 on the Community-Tech (Kanban-2020-21-Q4) board.
Thu, Apr 29, 2:19 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson edited projects for T280953: Allow images and requests to come from localhost, added: Community-Tech (Kanban-2020-21-Q4); removed Community-Tech.
Thu, Apr 29, 2:19 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson claimed T280848: Implement MVP of OCR in Wikisource extension.

Can someone walk me through the flow for this, a bit confused about what hosting it on the extension means. Do we have mocks for this extension UI and if so can folks direct me to them?

Thu, Apr 29, 1:23 AM · MW-1.37-notes (1.37.0-wmf.5; 2021-05-11), Patch-For-Review, Wikimedia OCR, Community-Tech (Kanban-2020-21-Q4), Wikisource
Samwilson added a parent task for T280953: Allow images and requests to come from localhost: T280848: Implement MVP of OCR in Wikisource extension.
Thu, Apr 29, 1:07 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson added a subtask for T280848: Implement MVP of OCR in Wikisource extension: T280953: Allow images and requests to come from localhost.
Thu, Apr 29, 1:07 AM · MW-1.37-notes (1.37.0-wmf.5; 2021-05-11), Patch-For-Review, Wikimedia OCR, Community-Tech (Kanban-2020-21-Q4), Wikisource

Fri, Apr 23

Samwilson added a comment to T280848: Implement MVP of OCR in Wikisource extension.

I've started looking at this and have a basic functioning system with two buttons in the tool bar (one for each engine).

Fri, Apr 23, 6:21 AM · MW-1.37-notes (1.37.0-wmf.5; 2021-05-11), Patch-For-Review, Wikimedia OCR, Community-Tech (Kanban-2020-21-Q4), Wikisource
Samwilson created T280953: Allow images and requests to come from localhost.
Fri, Apr 23, 6:02 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson closed T280212: Wikisource OCR: Tesseract OCR gadget button to Wikisource extension as Declined.

Closing. Instead of this, we're going to go directly to the new design (in T280848).

Fri, Apr 23, 1:58 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, Wikisource

Wed, Apr 21

Samwilson added a comment to T280284: Create prod VPS for Wikimedia OCR.

PR to disable multithreading: https://github.com/wikimedia/wikimedia-ocr/pull/17

Wed, Apr 21, 6:55 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR

Tue, Apr 20

Samwilson added a comment to T274521: WS Export in hewikisource unreadable: Large fonts, translated (?) markup tags, many square boxes.

The remaining issues here look like they're to do with font support. I've installed the Culmus fonts (as recommended here) and with, for example, Yehuda CLM the above page looks like this:

Tue, Apr 20, 10:13 AM · Parsoid (Tracking), Community-Tech, WS Export
Samwilson claimed T280212: Wikisource OCR: Tesseract OCR gadget button to Wikisource extension.

This will include the Fraktur OCR button for German Wikisource as well.

Tue, Apr 20, 2:44 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, Wikisource
Samwilson updated the task description for T280617: Wikisource OCR: Validate language codes.
Tue, Apr 20, 2:00 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson created T280617: Wikisource OCR: Validate language codes.
Tue, Apr 20, 2:00 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR

Mon, Apr 19

Samwilson committed rGSVT124cd1e44384: Build assets (authored by Samwilson).
Build assets
Mon, Apr 19, 1:10 AM

Sat, Apr 17

Samwilson moved T277129: Replace our Google Cloud Vision package with the official one from In Development 💻 to Review/Feedback 💬 on the Community-Tech (Kanban-2020-21-Q4) board.

Yes, that's all correct @ifried, sorry I forgot to move it before!

Sat, Apr 17, 12:42 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR

Fri, Apr 16

Samwilson added a comment to T280213: Wikisource OCR: Accept Tesseract options on the API.

The multiple languages part of this will be dealt with in T280214 (because the lang list is common to both engines). We might want to still do some per-engine verification of the language codes though.

Fri, Apr 16, 6:35 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, Wikisource
Samwilson moved T280214: Wikisource OCR: Accept Google options on the API from In Development 💻 to Review/Feedback 💬 on the Community-Tech (Kanban-2020-21-Q4) board.

PR: https://github.com/wikimedia/wikimedia-ocr/pull/16

Fri, Apr 16, 6:34 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, Wikisource
Samwilson moved T280209: Create test VPS for Wikimedia OCR from In Development 💻 to Review/Feedback 💬 on the Community-Tech (Kanban-2020-21-Q4) board.
Fri, Apr 16, 4:58 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson claimed T280214: Wikisource OCR: Accept Google options on the API.
Fri, Apr 16, 4:58 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, Wikisource
Samwilson closed T279988: Download PDF option on Odia Wikipedia: Typeface issues in Odia as Resolved.

2.6.2 is released now and includes the new translations: https://ws-export.wmcloud.org/?uselang=or

Fri, Apr 16, 12:55 AM · Community-Tech, WS Export

Thu, Apr 15

Samwilson updated the task description for T280231: Add optional asset build process to Toolforge Bundle's deploy script.
Thu, Apr 15, 9:02 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, ToolforgeBundle
Samwilson added a comment to T280209: Create test VPS for Wikimedia OCR.

Test site is up and running, with auto deployment. The documentation has started at https://wikitech.wikimedia.org/wiki/Nova_Resource:Wikisource/Wikimedia_OCR but there's more to be added.

Thu, Apr 15, 9:00 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson created T280231: Add optional asset build process to Toolforge Bundle's deploy script.
Thu, Apr 15, 8:58 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, ToolforgeBundle
Samwilson added a comment to T279988: Download PDF option on Odia Wikipedia: Typeface issues in Odia.

Great, thanks. The new translations should be live within a week.

Thu, Apr 15, 4:10 AM · Community-Tech, WS Export
Samwilson moved T280209: Create test VPS for Wikimedia OCR from Ready 🎬 to In Development 💻 on the Community-Tech (Kanban-2020-21-Q4) board.
Thu, Apr 15, 2:10 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson added a project to T280209: Create test VPS for Wikimedia OCR: Community-Tech (Kanban-2020-21-Q4).
Thu, Apr 15, 2:10 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson created T280209: Create test VPS for Wikimedia OCR.
Thu, Apr 15, 12:55 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR

Wed, Apr 14

Samwilson updated subscribers of T272763: Wikisource Export: Change the cover image to local Wikisource logo.

@Psubhashish: done.

Wed, Apr 14, 10:11 AM · Wikisource-Community-User-Group, Community-Tech, WS Export
Samwilson added a comment to T279988: Download PDF option on Odia Wikipedia: Typeface issues in Odia.

Thanks for reporting these issues!

Wed, Apr 14, 8:58 AM · Community-Tech, WS Export
Samwilson edited projects for T275547: Wikisource OCR: Move Wikimedia OCR gadget to Wikisource extension, added: Wikimedia OCR; removed ProofreadPage.
Wed, Apr 14, 5:59 AM · Wikimedia OCR, Community-Tech (Kanban-2020-21-Q4), Wikisource
Samwilson updated the task description for T275547: Wikisource OCR: Move Wikimedia OCR gadget to Wikisource extension.
Wed, Apr 14, 5:59 AM · Wikimedia OCR, Community-Tech (Kanban-2020-21-Q4), Wikisource
Samwilson added a comment to T280098: IA-upload: Include the OCLC ID in the {{Book}} template.

Good points! And the patch is nice and simple too (look at me, trying to over-complicate things as usual… :-P )

Wed, Apr 14, 3:13 AM · Community-Tech, IA Upload
Samwilson added a comment to T280098: IA-upload: Include the OCLC ID in the {{Book}} template.

I wonder if IA Upload should create a Wikidata item as well, where we could store that stuff?

Wed, Apr 14, 1:16 AM · Community-Tech, IA Upload
Samwilson added a comment to T261568: Bug in French with the gender of the "connecté" adjective.

It looks like the wikimedia/simplei18n package doesn't support {{GENDER}}, but even if it did there doesn't seem to be any way for us to get that information about a user. Do other tools handle this correctly?

Wed, Apr 14, 1:03 AM · IA Upload, Community-Tech
Samwilson closed T269518: IA Upload: Permit duplicate IA identifier if of a different format as Resolved.
Wed, Apr 14, 12:51 AM · IA Upload, Community-Tech
Samwilson closed T280038: IA-upload: filename shown as File:Array on success as Resolved.
Wed, Apr 14, 12:50 AM · IA Upload, Community-Tech
Samwilson added a comment to T280038: IA-upload: filename shown as File:Array on success.

Fixed in https://github.com/wikisource/ia-upload/pull/53

Wed, Apr 14, 12:40 AM · IA Upload, Community-Tech

Tue, Apr 13

Samwilson created T279989: Shouldn't be possible to push direct to master.
Tue, Apr 13, 8:14 AM · Gerrit
Samwilson moved T279118: Wikisource OCR: add support for tesseract on wikimedia ocr from Review/Feedback 💬 to QA 🐛 on the Community-Tech (Kanban-2020-21-Q4) board.

This is all merged now deployed to the test site: https://ocr-test.toolforge.org

Tue, Apr 13, 7:35 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR, Wikisource
Samwilson added a comment to T277129: Replace our Google Cloud Vision package with the official one.

Draft PR: https://github.com/wikimedia/wikimedia-ocr/pull/14

Tue, Apr 13, 6:06 AM · Community-Tech (Kanban-2020-21-Q4), Wikimedia OCR
Samwilson merged T275967: Full book not download in Bengali Wikisource, ignores subpage into T274747: Subpages are not exported where AuxTOC template indexes chapters.
Tue, Apr 13, 2:18 AM · Parsoid, Community-Tech, WS Export
Samwilson merged task T275967: Full book not download in Bengali Wikisource, ignores subpage into T274747: Subpages are not exported where AuxTOC template indexes chapters.
Tue, Apr 13, 2:18 AM · WS Export, Community-Tech
Samwilson added a comment to T275967: Full book not download in Bengali Wikisource, ignores subpage.

It looks like this is the same Parsoid issue as T274747: see the difference between the wiki page and the Parsoid HTML. The issue is in the Auxiliary Table of Contents template.

Tue, Apr 13, 2:18 AM · WS Export, Community-Tech