Page MenuHomePhabricator
Feed Advanced Search

Apr 29 2020

kaldari added a comment to T249788: RFC: Remove Android 2 from basic support in compatibility matrix.

Has anyone noticed that most of OOUI's fallback PNGs are broken anyway?
Help icon:

help.png (20×20 px, 244 B)

Edit pencil:
edit.png (20×20 px, 326 B)

Eyeball (for VE):
eye.png (20×20 px, 312 B)

Apr 29 2020, 4:14 AM · TechCom-RFC (TechCom-RFC-Closed), MediaWiki-General, Browser-Support-Android-Google-Chrome

Apr 28 2020

kaldari created T251344: Declare mediawiki-api stable.
Apr 28 2020, 8:45 PM · User-Addshore, Addwiki
kaldari added a comment to T236341: Decouple UploadWizardConfig.maxUploads and maxUploads for Flickr imports.

@AlexisJazz - I've tested it on test.wikipedia.org under both an admin account and a regular user account and it works as advertised. The change will go out to Commons tomorrow and shouldn't have any noticeable effect. If nothing breaks, we can change the user rights on Commons after tomorrow.

Apr 28 2020, 3:06 PM · MW-1.35-notes (1.35.0-wmf.30; 2020-04-28), Commons, UploadWizard

Apr 27 2020

kaldari reassigned T247875: Assist with maintaining aggregate values in numerical tables from kaldari to Tgr.
Apr 27 2020, 5:40 PM · covid-19, MediaWiki-General
kaldari moved T251151: List of COVID-19 SVG maps from Incoming to Product Doing on the covid-19 board.
Apr 27 2020, 5:23 PM · COVID-19-Graphics, covid-19
kaldari renamed T251151: List of COVID-19 SVG maps from List of COVID-19 maps to List of COVID-19 SVG maps.
Apr 27 2020, 3:38 PM · COVID-19-Graphics, covid-19
kaldari updated the task description for T249127: Create regularly updated maps of COVID-19 outbreak.
Apr 27 2020, 3:37 PM · Design, covid-19, COVID-19-Graphics
kaldari updated the task description for T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:35 PM · COVID-19-Graphics, covid-19
kaldari updated the task description for T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:29 PM · COVID-19-Graphics, covid-19
kaldari updated the task description for T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:24 PM · COVID-19-Graphics, covid-19
kaldari updated the task description for T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:17 PM · COVID-19-Graphics, covid-19
kaldari added a parent task for T251151: List of COVID-19 SVG maps: T249127: Create regularly updated maps of COVID-19 outbreak.
Apr 27 2020, 3:08 PM · COVID-19-Graphics, covid-19
kaldari added a subtask for T249127: Create regularly updated maps of COVID-19 outbreak: T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:08 PM · Design, covid-19, COVID-19-Graphics
kaldari claimed T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:07 PM · COVID-19-Graphics, covid-19
kaldari updated the task description for T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:06 PM · COVID-19-Graphics, covid-19
kaldari created T251151: List of COVID-19 SVG maps.
Apr 27 2020, 3:05 PM · COVID-19-Graphics, covid-19

Apr 21 2020

kaldari renamed T250185: Make Wikisource-OCR handle paragraphs better from Can Wikisource-OCR handle paragraphs better? to Make Wikisource-OCR handle paragraphs better.
Apr 21 2020, 7:53 PM · Community-Tech, Wikimedia OCR
kaldari closed T247284: Improve OCR: Move ws-google-ocr repository to Gerrit, a subtask of T191182: Migrate active repositories in Phabricator Differential to GitLab, as Resolved.
Apr 21 2020, 7:48 PM · Release-Engineering-Team (Quid Pro Crow 🦃), collaboration-services, User-AKlapper, GitLab (Project Migration), Epic, Technical-Debt, Phabricator
kaldari closed T247284: Improve OCR: Move ws-google-ocr repository to Gerrit as Resolved.
Apr 21 2020, 7:48 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T236341: Decouple UploadWizardConfig.maxUploads and maxUploads for Flickr imports.

We can test it right now on https://test.wikimedia.beta.wmflabs.org/ (I already tested as a regular user, correctly limited to 4) but on https://test.wikipedia.org/ the patch isn't active yet. I assume it'll take effect on April 29th there as well?

Apr 21 2020, 7:46 PM · MW-1.35-notes (1.35.0-wmf.30; 2020-04-28), Commons, UploadWizard
kaldari added a comment to T196448: Automatically redirect users to the correct category in the UploadWizard (Wikimedia Commons).

Note that this bug causes a lot of extra work for the Commons community and should be fairly easy to fix.

Apr 21 2020, 6:44 PM · MediaWiki-Categories, UploadWizard, Multimedia
kaldari added a project to T196448: Automatically redirect users to the correct category in the UploadWizard (Wikimedia Commons): good first task.
Apr 21 2020, 6:44 PM · MediaWiki-Categories, UploadWizard, Multimedia
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

@mmodell - To be clear, we need https://phabricator.wikimedia.org/source/tool-wikisource-ocr/ to mirror https://gerrit.wikimedia.org/r/#/admin/projects/labs/tools/wikisource-ocr. Hope you can help us with that.

Apr 21 2020, 6:14 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 21 2020, 6:12 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 21 2020, 6:11 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T236341: Decouple UploadWizardConfig.maxUploads and maxUploads for Flickr imports.

@AlexisJazz - I'm talking about https://test.wikipedia.org/, which has about 500 administrators, including myself.

Apr 21 2020, 6:04 PM · MW-1.35-notes (1.35.0-wmf.30; 2020-04-28), Commons, UploadWizard
kaldari added a comment to T236341: Decouple UploadWizardConfig.maxUploads and maxUploads for Flickr imports.

@AlexisJazz - By coincidence, test.wikipedia is already set up with all users having upload_by_url. In addition admins there have mass-upload. So we should be able to test with an admin account and a regular user account to see the difference.

Apr 21 2020, 12:00 AM · MW-1.35-notes (1.35.0-wmf.30; 2020-04-28), Commons, UploadWizard

Apr 20 2020

kaldari added a comment to T249945: Druid access for view on event.editeventattempt.

@fdans - Yes, the 90 day limit works fine for me.

Apr 20 2020, 4:32 PM · Analytics-Kanban, Patch-For-Review, Analytics

Apr 17 2020

kaldari closed T236341: Decouple UploadWizardConfig.maxUploads and maxUploads for Flickr imports, a subtask of T90004: Enable Flickr import for all users on Commons, as Resolved.
Apr 17 2020, 4:13 PM · Wikimedia-Site-requests, Commons, Wikimedia-Hackathon-2015, Multimedia
kaldari closed T236341: Decouple UploadWizardConfig.maxUploads and maxUploads for Flickr imports as Resolved.
Apr 17 2020, 4:13 PM · MW-1.35-notes (1.35.0-wmf.30; 2020-04-28), Commons, UploadWizard
kaldari added a comment to T236341: Decouple UploadWizardConfig.maxUploads and maxUploads for Flickr imports.

@AlexisJazz - The change should take effect on Common on April 29th, if there are no problems on test.wikipedia.

Apr 17 2020, 4:13 PM · MW-1.35-notes (1.35.0-wmf.30; 2020-04-28), Commons, UploadWizard
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

@mmodell - Sure, if it's possible to change the name at the same time. It's no longer specific to Google, so it would be nice to rename it to something like tool-wikisource-ocr (similar to the new gerrit repo).

Apr 17 2020, 4:09 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

hmm, who else is a Phabricator admin? @Aklapper?

Apr 17 2020, 2:58 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech

Apr 16 2020

kaldari added a comment to T219279: Some pages will become completely unreachable after PHP7 update due to Unicode changes.

Oh yes, now I understand what you mean. That's very interesting. I wonder if unsuccessful move attempts trigger watchlist updates but not log entries.

Apr 16 2020, 7:37 PM · User-notice-archive, MW-1.38-notes (1.38.0-wmf.6; 2021-10-26), Patch-For-Review, MW-1.35-notes (1.35.0-wmf.28; 2020-04-14), Platform Team Workboards (Clinic Duty Team), MW-1.34-notes (1.34.0-wmf.16; 2019-07-30), serviceops, SRE, PHP 7.2 support, MediaWiki-General
kaldari updated subscribers of T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

@mmodell - Looks like MarcoAurelio is MIA. Any chance you could delete the https://phabricator.wikimedia.org/source/tool-ws-google-ocr/ repo for us?

Apr 16 2020, 6:18 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T219279: Some pages will become completely unreachable after PHP7 update due to Unicode changes.

@DannyS712 - I can't parse what you're saying. What do watchlist entries have to do with these page moves? And why are you saying that https://commons.wikimedia.org/w/index.php?title=File%3A%C9%A1obyounohasi.jpg&redirect=no never occurred? I'm probably misunderstanding you, but your first sentence is very hard to understand.

Apr 16 2020, 5:48 PM · User-notice-archive, MW-1.38-notes (1.38.0-wmf.6; 2021-10-26), Patch-For-Review, MW-1.35-notes (1.35.0-wmf.28; 2020-04-14), Platform Team Workboards (Clinic Duty Team), MW-1.34-notes (1.34.0-wmf.16; 2019-07-30), serviceops, SRE, PHP 7.2 support, MediaWiki-General
kaldari added a comment to T249788: RFC: Remove Android 2 from basic support in compatibility matrix.

@Volker_E - Could you explain how SVG background icons or position:fixed fallbacks are necessary for preserving core functionality in MediaWiki (i.e. reading, searching, editing) on Android 2? In other words, what specific feature(s) would be broken? I'm asking because I'm wondering if we could just remove these 2 fallbacks anyway (regardless of Grade C support).

Apr 16 2020, 5:16 PM · TechCom-RFC (TechCom-RFC-Closed), MediaWiki-General, Browser-Support-Android-Google-Chrome

Apr 14 2020

kaldari awarded T234695: Measure % of edits and pvs coming from users without JS a Love token.
Apr 14 2020, 10:38 PM · Product-Analytics, Epic, Web-Team-Backlog (Tracking)
kaldari placed T240910: Sometimes portlet links are unstyled in Minerva (e.g. PageTriage tool) up for grabs.
Apr 14 2020, 10:34 PM · MW-1.38-notes (1.38.0-wmf.2; 2021-09-28), Web-Team-Backlog (Tracking), Growth-Team-Filtering, MinervaNeue (Tracking), Growth-Team, PageTriage
kaldari added a comment to T100402: Core should have support for mobile detection and customization.

Regardless of the issue of removing the .m. subdomain, would anyone object to us doing the first step of this task: "DeviceDetection.php moved from MobileFrontend to core". It used to be in core and that's really where it belongs. All skins (especially non-WMF skins) should have easy access to this data regardless of whether they are running MobileFrontend or not. Also the WikiEditor extension needs this data for proper eventlogging (T249944).

Apr 14 2020, 8:57 PM · MobileFrontend (Tracking), MediaWiki-User-Interface
kaldari added a comment to T249944: WikiEditor records all edits as platform = desktop in EventLogging.

FYI, MobileFrontend has code to detect phones and tablets in UADeviceDetector.php. It seems like that code should really live in core rather than MobileFrontend, otherwise, it's going to be complicated to detect people using a phone or tablet to edit with the Wikitext editor on the desktop site.

Apr 14 2020, 8:13 PM · Data-Engineering, Data Pipelines, Editing-team, Analytics-Radar, Product-Analytics, Mobile, MediaWiki-extensions-EventLogging, WikiEditor
kaldari renamed T249944: WikiEditor records all edits as platform = desktop in EventLogging from WikiEditor records all edits as desktop edits in EventLogging to WikiEditor records all edits as platform = desktop in EventLogging.
Apr 14 2020, 8:01 PM · Data-Engineering, Data Pipelines, Editing-team, Analytics-Radar, Product-Analytics, Mobile, MediaWiki-extensions-EventLogging, WikiEditor
kaldari added a comment to T249944: WikiEditor records all edits as platform = desktop in EventLogging.

... we'd probably still want to differentiate between MobileFrontend and WikiEditor. As-is, that desktop / phone split for platform is the only way to tell, I think.

Yeah, this seems like an oversight in the current schema. I agree that adding something like mobile-page to the integration field would probably be the best solution. We can probably split that off as a separate bug though, as this bug is just about the platform field.

Apr 14 2020, 8:00 PM · Data-Engineering, Data Pipelines, Editing-team, Analytics-Radar, Product-Analytics, Mobile, MediaWiki-extensions-EventLogging, WikiEditor
kaldari added a comment to T249944: WikiEditor records all edits as platform = desktop in EventLogging.

@Mayakp.wiki - Nevermind, I figured it out. It looks like the edits_hourly dashboard relies on the revision_tags in the database.

Apr 14 2020, 7:55 PM · Data-Engineering, Data Pipelines, Editing-team, Analytics-Radar, Product-Analytics, Mobile, MediaWiki-extensions-EventLogging, WikiEditor
kaldari added a comment to T249945: Druid access for view on event.editeventattempt.

To answer the question above, looks like the data in the edits_hourly dashboard comes from the database and mostly relies on revision_tags.

Apr 14 2020, 7:53 PM · Analytics-Kanban, Patch-For-Review, Analytics
kaldari added a comment to T249945: Druid access for view on event.editeventattempt.

Looking great so far. Would it be possible to add a description for this dashboard in Turnillo (similar to the other dashboards), something like: "Sampled eventlogging of the non-API editing interfaces". That way people can tell the difference between it and the edits_hourly dashboard. Speaking of, does anyone know where the data for the edits_hourly Turnillo dashboard comes from?

Apr 14 2020, 7:38 PM · Analytics-Kanban, Patch-For-Review, Analytics
kaldari added a comment to T240910: Sometimes portlet links are unstyled in Minerva (e.g. PageTriage tool).

@Jdlrobson - Now that T231925 is fixed, what code should PageTriage be using to create the link? I was thinking that it would get fixed automatically by T231925, but it still looks broken as in the description screenshot.

Apr 14 2020, 6:44 PM · MW-1.38-notes (1.38.0-wmf.2; 2021-09-28), Web-Team-Backlog (Tracking), Growth-Team-Filtering, MinervaNeue (Tracking), Growth-Team, PageTriage
kaldari updated subscribers of T249944: WikiEditor records all edits as platform = desktop in EventLogging.

@Mayakp.wiki - How does Turnillo split between "Mobile web" editing and "Other" editing in the edits_hourly dashboard? Is it relying on the EditAttemptStep schema or doing something else?

Apr 14 2020, 5:41 PM · Data-Engineering, Data Pipelines, Editing-team, Analytics-Radar, Product-Analytics, Mobile, MediaWiki-extensions-EventLogging, WikiEditor
kaldari assigned T248242: IABot/2.0 uses deprecated MediaWiki API calls to Cyberpower678.
Apr 14 2020, 5:25 PM · InternetArchiveBot (v2.0)
kaldari created T250185: Make Wikisource-OCR handle paragraphs better.
Apr 14 2020, 3:34 PM · Community-Tech, Wikimedia OCR

Apr 13 2020

kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 13 2020, 10:10 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

@Samwilson - I think the last steps needed are to delete the existing Phabricator repo and create a new mirror of the Gerrit repo (with the new name). I don't have adequate permissions to delete the existing repo, but I think you do.

Apr 13 2020, 10:04 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T247875: Assist with maintaining aggregate values in numerical tables.

@Doc_James - If the assumption is that the data is in Wikitext (and not JSON or something else), it seems like the best solution to this problem would be to generate the entire table from a single template, and have a Lua module calculate the totals based on the parameters passed to the template for each country. The big downside to this solution is that editors would no longer be able to use the VisualEditor table editor to edit the country data. And like Ed and Gergo mention above, dealing with number formatting is going to be a problem for any potential solution.

Apr 13 2020, 9:20 PM · covid-19, MediaWiki-General
kaldari added a comment to T248936: Add sorttop option for jquery.tablesorter.

@Tchanders - If you end up implementing the suggested solution in the description (which is the least hacky solution), let me know and I can update all the tables on-wiki.

Apr 13 2020, 5:59 PM · MW-1.35-notes (1.35.0-wmf.30; 2020-04-28), JavaScript, covid-19, MediaWiki-jQuery-Tablesorter--archived
kaldari added a project to T248897: Create an easy-to-use editing interface for JSON tabular data pages: covid-19.
Apr 13 2020, 5:37 PM · covid-19, Commons-Datasets, JsonConfig, Epic

Apr 10 2020

kaldari closed T248058: Google OCR tool: use 'document text detection' rather than 'text detection' as Resolved.

I updated the remote repo in the Toolforge tool and pulled the update from the new repo. It works great!

Apr 10 2020, 8:37 PM · Community-Tech
kaldari added a comment to T249738: Phan complains that 'all' isn't a string.

@Daimona - $cfg['scalar_implicit_cast'] = false; didn't work either, FYI.

Apr 10 2020, 8:24 PM · MW-1.39-notes (1.39.0-wmf.5; 2022-03-28), Community-Tech, MediaWiki-extensions-PageAssessments, phan
kaldari added a comment to T240697: Measure % of edits coming from users without JS.

@DLynch - Interestingly, disabling JS on the mobile web relegates you to using the core/WikiEditor editor, which always reports as "desktop" (T249944), so breaking it out by platform would probably be pointless.

Apr 10 2020, 8:08 PM · Editing-team (Kanban Board), Product-Analytics (Kanban), User-Ryasmeen, Web-Team-Backlog (Tracking), VisualEditor
kaldari created T249944: WikiEditor records all edits as platform = desktop in EventLogging.
Apr 10 2020, 6:27 PM · Data-Engineering, Data Pipelines, Editing-team, Analytics-Radar, Product-Analytics, Mobile, MediaWiki-extensions-EventLogging, WikiEditor
kaldari added a comment to T240697: Measure % of edits coming from users without JS.

@ppelberg - Well, it's after the holidays, but probably an even worse time to bring this up. Regardless, we need this data to move forward with our no-JS guidelines for engineering. From David's analysis, it sounds like this would be a relatively small task (maybe a few days for one engineer). Is there any chance the Editing team could do this in Q4?

Apr 10 2020, 5:57 PM · Editing-team (Kanban Board), Product-Analytics (Kanban), User-Ryasmeen, Web-Team-Backlog (Tracking), VisualEditor
kaldari added a comment to T240697: Measure % of edits coming from users without JS.

To answer my own question, it looks like we could limit it to cases where editor_interface = wikitext and integration = page (to make sure we exclude app edits) for the no-JS number.

Apr 10 2020, 5:38 PM · Editing-team (Kanban Board), Product-Analytics (Kanban), User-Ryasmeen, Web-Team-Backlog (Tracking), VisualEditor
kaldari added a comment to T193550: [PageAssessments] db/addProjectsTable.sql : Error: 1071 Specified key was too long; max key length is 767 bytes.

So the thing is InnoDB DOES support longer indexes than 767. By default, on the latest versions of MariaDB (10.2 and up) and MySQL (8.0 and up), it supports up to 3072 bytes, and on supported versions before those, it supports that if using Barracuda file format, innodb_file_per_table and innodb_large_prefix = ON.

@jcrespo - That's good to know. One thing I don't really understand is what is the actual severity of this bug? For example, if we want to enable PageAssessments on Russian Wikipedia (T184967), which doesn't have these tables yet, would this bug block it? Or are we using the config options you mention above? In other words, does this bug only affect 3rd party users, or also WMF?

Apr 10 2020, 4:23 PM · MediaWiki-extensions-PageAssessments
kaldari added a comment to T249891: Suggested Tags should not suggest tags about image format or genre.

@Ramsey-WMF - I hope y'all are planning to move the blacklist on-wiki. It would make life easier for everyone.

Apr 10 2020, 1:48 AM · Structured-Data-Backlog, Structured Data Engineering, SDC-Statements
kaldari added a comment to T249891: Suggested Tags should not suggest tags about image format or genre.

@Ramsey-WMF - That's awesome. Thanks for the info!

Apr 10 2020, 1:37 AM · Structured-Data-Backlog, Structured Data Engineering, SDC-Statements
kaldari renamed T249891: Suggested Tags should not suggest tags about image format or genre from Suggested Tags should not suggest tags about image format to Suggested Tags should not suggest tags about image format or genre.
Apr 10 2020, 1:18 AM · Structured-Data-Backlog, Structured Data Engineering, SDC-Statements
kaldari updated the task description for T249891: Suggested Tags should not suggest tags about image format or genre.
Apr 10 2020, 1:17 AM · Structured-Data-Backlog, Structured Data Engineering, SDC-Statements
kaldari created T249891: Suggested Tags should not suggest tags about image format or genre.
Apr 10 2020, 1:15 AM · Structured-Data-Backlog, Structured Data Engineering, SDC-Statements
kaldari added a comment to T240697: Measure % of edits coming from users without JS.

This would be using saveSuccess as a way to limit it to sessions that resulted in successful edits. If either having-JS or not-having-JS makes saving substantially harder (lack of tools / bugs), our numbers would be misleading.

I don't actually think this would be misleading, as we want to find out how many actual edits are made with no-JS (i.e. how many edits would we lose by disabling no-JS editing support).

This would exclude VisualEditor users, depressing the overall JS numbers. This would be easy to compensate for by showing a "number of successful edits from VE" figure in the same time period.

Sounds like a good plan.

Bots would probably still be included. Depending on the bot's methodology, it could potentially be classed as JS or no-JS, or bypass this editor entirely and use the API to make its edits.

@DLynch - What we're specifically looking for is no-JS edits made through any editing interface besides the API or mobile apps (regardless of whether they are by bots or not). Is there a way to exclude API edits from the totals? Basically we just need to justify with actual data whether or not we should continue to maintain a no-JS editor (as part of a broader evaluation of all of our no-JS support). I imagine our no-JS editor is used a fair bit, but we need data rather than speculation. The rationale for having an editing API is separate and doesn't need further justification.

Apr 10 2020, 12:36 AM · Editing-team (Kanban Board), Product-Analytics (Kanban), User-Ryasmeen, Web-Team-Backlog (Tracking), VisualEditor

Apr 9 2020

kaldari added a comment to T234695: Measure % of edits and pvs coming from users without JS.

@kzimmerman - Since JK is out for a while, I'll chime in here. This data is basically needed for any future editing-related features, as we need to decide whether or not to continue building no-JS fallbacks for those features. For example, the Editing team is currently working on Discussion Tools, which is a group of Javascript editing features for talk pages. Since we don't have a good idea of how much editing is done on no-JS browsers, we don't really know what the impact will be of not providing a no-JS fallback and whether that may impact some communities more than others.

Apr 9 2020, 11:47 PM · Product-Analytics, Epic, Web-Team-Backlog (Tracking)
kaldari moved T248058: Google OCR tool: use 'document text detection' rather than 'text detection' from New & TBD Tickets to Needs Discussion on the Community-Tech board.
Apr 9 2020, 8:42 PM · Community-Tech
kaldari added a comment to T248058: Google OCR tool: use 'document text detection' rather than 'text detection'.

It looks like you just need to change line 90 in Ocr.php from

$this->gcv->addFeatureTextDetection();

to

$this->gcv->addFeatureDocumentTextDetection();

assuming that the structure of the response is the same (which should be checked).

Apr 9 2020, 8:41 PM · Community-Tech
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 9 2020, 8:32 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 9 2020, 8:30 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T248061: RFC: Remove IE 8 from basic support.

It seems like PNG support should be completely unnecessary in any kind of JS application given we don't run JS here. If we do want to continue supporting background-size on IE8 as a compromise we could limit this fallback to modules added via addModuleStyles.

Apr 9 2020, 6:44 PM · User-notice-archive, MW-1.36-notes (1.36.0-wmf.2; 2020-07-28), MW-1.36-release, TechCom-RFC (TechCom-RFC-Closed), Tech Ambassadors & Translators, MediaWiki-General, Browser-Support-Internet-Explorer, Front-end-Standards-Group
kaldari added a comment to T225293: [BUG][8 hours] Investigate why the SVGTranslate webservice needs to be restarted so often.

@dom_walden - Is this still an issue or should we close it?

Apr 9 2020, 3:56 PM · Community-Tech, Spike, SVG Translate Tool

Apr 8 2020

kaldari added a comment to T118783: Graph maps sometimes render glitch lines.

@Tchanders - Bingo! I can actually reproduce the bug now. Yay! So I wonder if those boxes just have some sub-pixel-width stroke or border being applied for some reason. And is it worth the trouble of trying to track it down?

Apr 8 2020, 10:01 PM · MediaWiki-extensions-Graph
kaldari added a comment to T118783: Graph maps sometimes render glitch lines.

@Tchanders - Totally agree that the data for the map in T118783#6012706 was junk. Thanks for solving that. I thought it might be a useful clue for solving Spage's example in the description though. I can't reproduce the original problem at all, even when activating the map and zooming in. If it only happens to you when being zoomed in then maybe the bug isn't worth worrying about. Have you tried on Firefox in Ubuntu?

Apr 8 2020, 7:57 PM · MediaWiki-extensions-Graph
kaldari added a comment to T249738: Phan complains that 'all' isn't a string.

I fixed it by suppressing that particular error type:

$cfg['suppress_issue_types'] = array_merge( $cfg['suppress_issue_types'], [
	// This test seems to be buggy or overly strict (T249738)
	'PhanTypeMismatchArgumentNullableInternal',
] );

I decided to suppress it repo-wide since there are other parts of that code-base with very similar code that will likely trigger the same phan glitch if they are modified.

Apr 8 2020, 7:45 PM · MW-1.39-notes (1.39.0-wmf.5; 2022-03-28), Community-Tech, MediaWiki-extensions-PageAssessments, phan
kaldari added a comment to T249738: Phan complains that 'all' isn't a string.

@Daimona - FYI, setting null_casts_as_any_type to false in the Phan config didn't work (https://gerrit.wikimedia.org/r/#/c/mediawiki/extensions/PageAssessments/+/587399/). How do you find out which Phan rule triggers a particular error message? It seems like that would be nice to include in the output.

Apr 8 2020, 7:31 PM · MW-1.39-notes (1.39.0-wmf.5; 2022-03-28), Community-Tech, MediaWiki-extensions-PageAssessments, phan
kaldari added a comment to T249738: Phan complains that 'all' isn't a string.

@Daimona - Thanks, I'll suppress it inline, but I hope there's a way to prevent this in the future, as it seems like a needless waste of time.

Apr 8 2020, 7:00 PM · MW-1.39-notes (1.39.0-wmf.5; 2022-03-28), Community-Tech, MediaWiki-extensions-PageAssessments, phan
kaldari added a comment to T249738: Phan complains that 'all' isn't a string.

@Daimona - Why would $ns !== 'all' make it think that $ns can be null? (Removing that part of the code fixes the error.) That doesn't make any sense. And even if it did, it's perfectly fine to pass a null variable to strlen().

Apr 8 2020, 6:22 PM · MW-1.39-notes (1.39.0-wmf.5; 2022-03-28), Community-Tech, MediaWiki-extensions-PageAssessments, phan
kaldari updated the task description for T249738: Phan complains that 'all' isn't a string.
Apr 8 2020, 5:29 PM · MW-1.39-notes (1.39.0-wmf.5; 2022-03-28), Community-Tech, MediaWiki-extensions-PageAssessments, phan
kaldari created T249738: Phan complains that 'all' isn't a string.
Apr 8 2020, 5:29 PM · MW-1.39-notes (1.39.0-wmf.5; 2022-03-28), Community-Tech, MediaWiki-extensions-PageAssessments, phan

Apr 7 2020

kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 7 2020, 10:19 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T211076: improve sort options in Special:PageAssessments.

Sorting by date would be difficult since the table doesn't currently store the timestamp, only the revision ID of the page at the point of assessment. Even if you forced the user to limit it to a single WikiProject, WikiProject Biography has over 1.7 million pages assigned to it, so it would likely timeout without denormalizing the timestamp data into the page_assessments table. Sorting by class or importance would probably be more doable, but only when limited to a single WikiProject.

Apr 7 2020, 8:53 PM · MediaWiki-extensions-PageAssessments
kaldari added a comment to T193550: [PageAssessments] db/addProjectsTable.sql : Error: 1071 Specified key was too long; max key length is 767 bytes.

FYI, it looks like this bug affects at least 33 other extensions: https://phabricator.wikimedia.org/search/query/.toXYVsB2ZdB/. Some have already added schema changes to work around the problem.

Apr 7 2020, 8:33 PM · MediaWiki-extensions-PageAssessments
kaldari added a comment to T193550: [PageAssessments] db/addProjectsTable.sql : Error: 1071 Specified key was too long; max key length is 767 bytes.

@jcrespo - Any update on this? Should we go ahead and fix it with a schema change?

Apr 7 2020, 8:27 PM · MediaWiki-extensions-PageAssessments
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

@Reedy - How do we temporarily add "forge committer identity" and "push" rights to the labs-tools-wikisource-ocr group? Does that require a Gerrit administrator?

Apr 7 2020, 8:23 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 7 2020, 8:20 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

I went ahead and created the gerrit repo and the owner group.

Apr 7 2020, 8:20 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari moved T221577: Wikimedia\Rdbms\LBFactory::getEmptyTransactionTicket: LinksUpdate does not have outer scope from Ready to Done on the MediaWiki-extensions-PageAssessments board.
Apr 7 2020, 8:06 PM · MW-1.34-notes (1.34.0-wmf.7; 2019-05-28), Wikidata, Wikidata-Campsite, Patch-For-Review, Performance-Team, Multimedia, MediaWiki-libs-Rdbms, GlobalUsage, MediaWiki-extensions-PageAssessments, Discovery-Search, GeoData, Wikimedia-production-error
kaldari moved T219935: extensions/PageAssessments/maintenance/purgeUnusedProjects.php is causing fatals on Beta from Ready to Done on the MediaWiki-extensions-PageAssessments board.
Apr 7 2020, 8:06 PM · MW-1.33-notes (1.33.0-wmf.25; 2019-04-09), Community-Tech, Puppet, MediaWiki-extensions-PageAssessments, Beta-Cluster-reproducible, Wikimedia-production-error
kaldari moved T248366: Error creating page_assessments_projects table under SQLite from Ready to Done on the MediaWiki-extensions-PageAssessments board.
Apr 7 2020, 8:06 PM · MW-1.35-notes (1.35.0-wmf.25; 2020-03-24), SQLite, MediaWiki-extensions-PageAssessments, Community-Tech
kaldari moved T231269: All API help links should use `Special:MyLanguage` from Needs Review/Feedback to Done on the MediaWiki-extensions-PageAssessments board.
Apr 7 2020, 8:06 PM · Structured-Data-Backlog, MW-1.35-notes (1.35.0-wmf.8; 2019-11-26), MW-1.34-notes (1.34.0-wmf.21; 2019-09-03), Community-Tech, Machine-Learning-Team, Discovery-Search, Wikidata, Growth-Team, VisualEditor, UploadWizard, TextExtracts, TemplateData, ORES, MobileFrontend, Jade, StructuredDiscussions, BetaFeatures, ApiFeatureUsage, MediaWiki-extensions-WikibaseClient, TimedMediaHandler, RegexBlock, MachineVision, GeoData, PageViewInfo, MediaWiki-extensions-PageAssessments, Notifications, User-DannyS712
kaldari added a comment to T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.

@thcipriani - Does this look right for creating the new repo?

ssh -p 29418 gerrit.wikimedia.org gerrit create-project --require-change-id --owner=labs-tools-wikisource-ocr --parent=labs/tools --description="'Toolforge tool for handling Wikisource OCR requests'" labs/tools/wikisource-ocr
Apr 7 2020, 5:44 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 7 2020, 5:31 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari updated the task description for T247284: Improve OCR: Move ws-google-ocr repository to Gerrit.
Apr 7 2020, 5:08 PM · Release-Engineering-Team, Wikimedia OCR, Community-Tech
kaldari added a comment to T245212: Suggested Tags should not consider reverts as "personal uploads".

@Ramsey-WMF - I still imagine there will be a use case for removing images without adding a tag. What happens currently if someone else tags one of the images in my personal queue before I do? Does it remain in my queue or get removed? If it remains, that would be a good example of a case where there may be no need to add more tags.

Apr 7 2020, 12:11 AM · MW-1.35-notes (1.35.0-wmf.25; 2020-03-24), MachineVision, Structured-Data-Backlog (Current Work), Structured Data Engineering, SDC-Statements (Machine-vision-depicts)

Apr 6 2020

kaldari added a comment to T245212: Suggested Tags should not consider reverts as "personal uploads".

Hmm. Why not just make the "Skip" button remove it from your queue? It's not like you can't add more claims later manually. What's the use case for people repeatedly skipping the same images? For example, I've skipped this image's tag suggestions at least 20 times now:

Screen Shot 2020-04-06 at 4.08.00 PM.png (766×858 px, 1 MB)

Is there any logical reason to show it to me again? It just seems like a waste of time. And the more my queue grows (it's at 193 images currently), the longer it takes me to find recent ones that might actually be actionable. If the queue were presented in reverse chronological order it wouldn't be such a big deal. But when you have to click "Skip" 100 times to get to your newly uploaded image that you were just notified about, it starts to get a bit aggravating :(

Apr 6 2020, 8:35 PM · MW-1.35-notes (1.35.0-wmf.25; 2020-03-24), MachineVision, Structured-Data-Backlog (Current Work), Structured Data Engineering, SDC-Statements (Machine-vision-depicts)
kaldari added a comment to T245212: Suggested Tags should not consider reverts as "personal uploads".

@Ramsey-WMF - Maybe, but there seems to be a related bug... If I click "Skip" for an image it doesn't seem to get removed from my personal queue. So my personal queue is now made up mostly of images that I either don't have anything to do with or images for which Google doesn't have any helpful suggestions (which is common). In other words, my personal queue is slowly turning into garbage. If the "Skip" button is fixed, I think this bug will cease to be a problem though.

Apr 6 2020, 7:55 PM · MW-1.35-notes (1.35.0-wmf.25; 2020-03-24), MachineVision, Structured-Data-Backlog (Current Work), Structured Data Engineering, SDC-Statements (Machine-vision-depicts)