Page MenuHomePhabricator

Uzume (Uzume)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Dec 26 2014, 1:37 PM (524 w, 6 d)
Availability
Available
LDAP User
Uzume
MediaWiki User
Unknown

Recent Activity

Tue, Dec 31

Uzume added a comment to T382824: Cache problems with new Index pages in Wikisource.

It actually has been happening for a rather long time but the frequency has certainly increased as Commons has grown. To actually fix it, the issue is a Commons issue and not a PRP one and thus should get retagged/reassigned there. There are numerous tickets about mentioning things like PDF/DjVu having 0x0 size that are essentially the same thing. This seems to just be the latest duplicate of such.

Tue, Dec 31, 12:08 AM · MediaWiki-extensions-PdfHandler, MediaWiki-File-management, Commons, ProofreadPage

Mon, Dec 30

Uzume added a comment to T382824: Cache problems with new Index pages in Wikisource.

@Jan.Kamenicek Yes, but what exactly do you want PRP to do about that? About the only thing it might be able to do would be to improve the error messaging under those circumstances. PRP has no control over how media are hosted, remotely on Commons or otherwise. The problem description does define such things but it does not really specify what you want to happen. Also this issue is current tagged/attached to PRP so presumably you are looking for a solution in that space but short of potentially improving some error messaging there is really nothing PRP can do about the situation.

Mon, Dec 30, 11:06 PM · MediaWiki-extensions-PdfHandler, MediaWiki-File-management, Commons, ProofreadPage
Uzume added a comment to T382824: Cache problems with new Index pages in Wikisource.

This is a fairly common issue with media from Commons and is not really a PRP issue itself. PRP requires metadata about the media to be available and correctly yields errors when it cannot get such. The typical workaround it to issue a purge on the media on Commons forcing an update there and all sites using Commons to remote host media.

Mon, Dec 30, 6:31 PM · MediaWiki-extensions-PdfHandler, MediaWiki-File-management, Commons, ProofreadPage

Dec 8 2024

Uzume added a comment to T214729: Error during parsing of djvu text layer to produce metadata leads to page offset in ProofreadPage extension.

I'm sorry if I sounded dismissive, that was not what I meant. I was asking if someone planned to do this. I've tested at the other end, and replacing (in the file) the invalid page text by an empty page, in the way outlined above by Mpaa, does prevent the OCR offset. So, if we replace it on the Proofreadpage side, it should also work (see also T219376). I myself know very little PHP, so I'm afraid that if I try to write it up myself I will break something.

Dec 8 2024, 2:41 PM · ProofreadPage
Uzume added a comment to T214729: Error during parsing of djvu text layer to produce metadata leads to page offset in ProofreadPage extension.

Any chance for this to be solved eventually? It's a very common problem, and it looks like it'd be rather simple to solve.

Dec 8 2024, 10:41 AM · ProofreadPage

Dec 4 2024

Uzume merged T381318: PDF pages don't have associated images on the Polish Wikisource into T299754: ProofreadPage does not provide scans for files wich 0x0 size in metadata, despite of the size being set manually in the index.
Dec 4 2024, 6:12 PM · ProofreadPage
Uzume merged task T381318: PDF pages don't have associated images on the Polish Wikisource into T299754: ProofreadPage does not provide scans for files wich 0x0 size in metadata, despite of the size being set manually in the index.
Dec 4 2024, 6:12 PM · MediaWiki-extensions-PdfHandler, ProofreadPage
Uzume added a comment to T381318: PDF pages don't have associated images on the Polish Wikisource.

And the file page magically fixed itself too. Cool. I promise it was broken earlier.

Dec 4 2024, 6:01 PM · MediaWiki-extensions-PdfHandler, ProofreadPage
Uzume added a comment to T381318: PDF pages don't have associated images on the Polish Wikisource.

I do not see an issue when I browse to the URL listed in the description of this issue. It should be noted that the page does not currently exist but the editor and image on the right appear just fine if one clicks the "Utwórz" ("Create") link in the toolbar (for me in the upper right). One can additionally also see the appropriate image by selecting the "Grafika" ("Image") link from the toolbar (for me upper left). I personally have my UI set to English in global preferences but I provided link labels for Polish (uselang=pl) since that is likely the default UI language for most users of that site.

Dec 4 2024, 1:00 AM · MediaWiki-extensions-PdfHandler, ProofreadPage

Nov 14 2024

Uzume updated the task description for T336651: If Archive URL is collection of files it gets only 1 file.
Nov 14 2024, 3:44 PM · Internet-Archive, IA Upload
Uzume added a comment to T336651: If Archive URL is collection of files it gets only 1 file.

I do not think we should be supporting Internet Archive collections, as per se, however, based upon the IA identifier from the provided description, this really about multiple scanned objects available at a single IA identifier.

Nov 14 2024, 3:22 PM · Internet-Archive, IA Upload

Nov 13 2024

Uzume added a comment to T379402: Unable to retrieve IA metadata for any item.

FWIW, one of my tools which relies on the Internet Archive also always times out. Perhaps the Internet Archive has temporarily(?) blocked some WMCS IPs following the outage?

Nov 13 2024, 6:31 PM · IA Upload
Uzume added a comment to T379402: Unable to retrieve IA metadata for any item.

Sorry, ignore me! ia-upload is not running on Toolforge any more, it's got its own VPS. :-/

However, the issue is the same:

$ curl -I https://archive.org/details/20231002_20231002_0537?output=json
curl: (28) Failed to connect to archive.org port 443 after 129880 ms: Couldn't connect to server
$ curl -I https://archive.org/details/history-of-telegraphy-wa-3?output=json
curl: (28) Failed to connect to archive.org port 443 after 130678 ms: Couldn't connect to server
Nov 13 2024, 5:45 PM · IA Upload
Uzume added a comment to T194861: DjVu construction from original scans (JP2) selects which pages to build incorrectly resulting in misintegration of djvu.xml based text layers.

If you look in the scandata XML file at the IA, some pages are marked <addToAccessFormats>false</addToAccessFormats>. For example the registration images (generally first and last, sometimes more). Skipping these images then brings everything back into alignment.

Nov 13 2024, 4:58 PM · Internet-Archive, IA Upload
Uzume renamed T194861: DjVu construction from original scans (JP2) selects which pages to build incorrectly resulting in misintegration of djvu.xml based text layers from Text is offset by one page to DjVu construction from original scans (JP2) selects which pages to build incorrectly resulting in misintegration of djvu.xml based text layers.
Nov 13 2024, 4:48 PM · Internet-Archive, IA Upload
Uzume added a comment to T178197: IA Uploader: random corrupted text structure into bult djvu files.
Nov 13 2024, 4:08 PM · IA Upload
Uzume added a comment to T178197: IA Uploader: random corrupted text structure into bult djvu files.

Since this is related to the integration of the text layer using _djvu.xml, and seems to happen when there is a mismatch in the number of pages, this is likely related to T194861 (and the numerous other tickets merged/closed as duplicates of that).

Nov 13 2024, 4:08 PM · IA Upload
Uzume added a comment to T379402: Unable to retrieve IA metadata for any item.

There haven't been any new uploads in over 30 days so you won't find anything in commons:Special:RecentChanges (e.g., the recent-uploads link at the top of the tool page).

Nov 13 2024, 2:31 PM · IA Upload

Nov 6 2024

Uzume added a comment to T16711: Implement a namespace filter for the logging table.

T16711#193346 states the ability to filter logs by namespace via the API was removed but T16712#193524 states it was added back in https://gerrit.wikimedia.org/r/135283. Apparently it was merged smoothly and a brief perusal of the code seems to imply the code is still there.

Nov 6 2024, 9:36 PM · Patch-Needs-Improvement, MediaWiki-Logevents

Oct 29 2024

Uzume added a comment to T376539: Support remote retrieval of multi-page resources via IIIF.

As long as we primarily depend on WMF local copies and remote IIIF images are only used as an option, it sounds like a good idea. Having situations where we depend on WMF remote data (i.e., images, etc.) without any local fallback seems problematic.

Oct 29 2024, 10:45 PM · ProofreadPage

Oct 28 2024

Uzume placed T370133: Upgrade PHP dependencies up for grabs.
Oct 28 2024, 7:51 PM · IA Upload
Uzume closed T370133: Upgrade PHP dependencies as Resolved.

@Samwilson: I am closing this as resolved based upon your aforementioned PR being merged on 2024-07-16:

Oct 28 2024, 7:51 PM · IA Upload

Oct 15 2024

Uzume added a member for Toolforge-standards-committee: Uzume.
Oct 15 2024, 7:56 AM

Aug 12 2024

Uzume added a comment to T331411: Allow uploading newer version of an existing file through IA upload.

But those are different files scanned from different sources. পথের পাঁচালী.djvu is not even originally from Internet Archive.

Aug 12 2024, 11:23 AM · IA Upload, Internet-Archive

Aug 11 2024

Uzume added a comment to T363619: Remove option for PDF → DjVu conversion (phetools).

I agree that in general there is little advantage to creating DjVus from PDFs but sometimes people prefer such formats. PDF technology has now subsumed most of the advantages DjVu previously had. Unfortunately this now means PDF is a very large and complex set of specifications and it is hard to know how any single PDF is constructed without analysis by digital tools.

Aug 11 2024, 12:29 PM · IA Upload
Uzume added a comment to T364445: DJVU file generated is apparently 0x0 pixels.

I do not believe this is an IA Upload issue as it is not specific to IA Upload nor to DjVu as it happens with PDFs. This is a common issues with Commons in general. The workaround it is to purge the file on Commons (and sometimes a null edit too) to reset its media metadata. Sometimes such things also have to be done on a local wiki that uses Commons too (e.g., on a Wikisource site, etc.)

Aug 11 2024, 12:05 PM · Internet-Archive, IA Upload
Uzume added a watcher for IA Upload: Uzume.
Aug 11 2024, 11:54 AM
Uzume added a member for IA Upload: Uzume.
Aug 11 2024, 11:54 AM
Uzume added a comment to T268246: IA Uploader fails to recognize the first page of a book.

I too am looking forward to scandata.xml addToAccessFormats page filtering. That would get rid of the irritating color card and white card pages often included at the end of many scans (but I have seen them in the middle of book scans too).

Aug 11 2024, 11:53 AM · IA Upload

Jul 25 2024

Uzume added a comment to T5663: Add Special page similar to Special:BookSources for ISSNs (International Standard Serial Numbers).

I am also interested in this behavior and I also do not want to see the return of magic links (in fact I think it would be better if magic links were removed from MediaWiki and stuffed into an extension instead of just disabled by default). That said, I would rather see ISSN links as well as Special:BookSources ISBN links functionality moved into an extension and not part of the MW core. Perhaps an extension that allows generic identifier links via "template" pages would be best. Then it could also subsume things like https://isin.toolforge.org/ for ISIN which currently has/uses German and English "templates" at Benutzer:Magnus Manske/ISIN and User:Magnus Manske/ISIN respectively, etc. The list of allowed "template" pages could be listed in a MediaWiki: namespace page so random people could not introduce link bait and I also suggest using a prefix for Special:XYZZY/idname/id and Project:XYZZY/idname where XYZZY is rhe extension name and idname is ISBN, ISSN, etc. MediaWiki:XYZZY-allowed-ids or such would determine which idnames would be mapped. Anyway--just some thoughts on the subject.

Jul 25 2024, 6:11 PM · MediaWiki-Special-pages

Jul 20 2024

Uzume added a comment to T330458: Reverse order transclusion support for Proofread Page..

FYI: Here is another example: One of a Thousand/Preface. I suppose I could try to alter the order of the pages in the media (DjVu) except based upon research, I believe this was a printing error in the actual book. That is why I hesitate to change it. As a workaround, I am transcluding individual pages out of order without using the <pages/> tag (i.e., via Template:Page). I tried using individual pages out of order with multiple <pages/> tags but then the text across pages was broken (some sort of vertical spacing separation).

Jul 20 2024, 11:40 AM · ProofreadPage

May 15 2024

Uzume added a comment to T364690: Transclusion doesn't work after deletion of source file.

FYI: In terms of the error reporting bug from this issue, the following seems to be applicable:

May 15 2024, 3:53 AM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), ProofreadPage
Uzume added a comment to T364690: Transclusion doesn't work after deletion of source file.

That's incorrect, ProofreadPage depends on the File namespace to build the pagelist, figure out valid Page: namespace pages and determine the transclusion order of the pages.

May 15 2024, 3:38 AM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), ProofreadPage
Uzume added a comment to T364690: Transclusion doesn't work after deletion of source file.

I agree the lack of useful error reporting to know about when this happens is certainly a bug but Proofread Page (PRP) is specifically for media-backed transcriptions. As far as I know it has never supported transcription without being backed by some File media. So in that way I would argue this is not a bug. Most Wikisource sites do support other forms of transcription (and even translation, etc.) not involving PRP (in addition to supporting PRP-based).

May 15 2024, 1:44 AM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), ProofreadPage

May 14 2024

Uzume added a comment to T364690: Transclusion doesn't work after deletion of source file.

This shouldn't happen because the <pages/> mechanism works with text in Page and Index NS. Whereas the mediafile is in File NS. These are independent spaces.

May 14 2024, 8:57 AM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), ProofreadPage

May 13 2024

Uzume added a comment to T364690: Transclusion doesn't work after deletion of source file.

Case above is just an example. This can be any abstract file with any abstract reason of deletion. After deletion of file, the text is no longer displayed even though text exists.

May 13 2024, 3:56 PM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), ProofreadPage
Uzume added a comment to T364690: Transclusion doesn't work after deletion of source file.

I agree—this is expected behavior. Moving forward there are a few options:

May 13 2024, 2:51 PM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), ProofreadPage

Apr 27 2024

Uzume added a comment to T161976: Feature request: add detection for page language to Scribunto.

@PerfektesChaos: I am not sure I am really looking to violate any explicit limitations but avoiding tripping them is good. It is not a bad idea to use mw.loadData() to wrap the pageLang as a workaround to prevent increased "expensiveness" but mw.loadData() validates the return value and it does not allow functions and metatables as mw.title objects like what is returned by mw.title.getCurrentTitle() provides.

Apr 27 2024, 5:52 AM · MW-1.42-notes (1.42.0-wmf.15; 2024-01-23), MediaWiki CodeJam Dec 2023, MW-1.38-notes (1.38.0-wmf.20; 2022-01-31), Patch-For-Review, Scribunto

Apr 26 2024

Uzume added a comment to T299369: Consider removing global $userLang from onPageContentLanguage hook.

Commons and Meta-Wiki don't have an extension doing what Incubator does. Yet, has localised File pages and other namespaces for years without insurmountable technical limitations. This is mainly possible due to {{int:lang}}, whcih is like Incubator's {{int:language}} but defined as pages under MediaWIki:Lang instead of MediaWIki:Language. The Parser in MediaWiki is optimised to share the ParserCache between users whenever possible, and the {{int:}} function safely modifies the cache logic to keep each variant separately in the cache. This works fine from a functional perspective and is supported. Although for performance, it is important that article and article templates on big wikis do not do this!

What's interesting is that Commons also uses this mechanism to set the page direction. It uses using {{dir}} (Template:Dir) or sometimes via Lua (Module:Dir/RTL_overrides). The could be imported to Incubator and used as such. You can use it to set dir="" or style="direction: ;" or class="mw-content-ltr". Whichever you prefer!

Apr 26 2024, 9:07 AM · Patch-For-Review, MW-1.42-notes (1.42.0-wmf.14; 2024-01-16), MediaWiki CodeJam Dec 2023, MediaWiki-Platform-Team, Essential-Work, Content-Transform-Team-WIP, Sustainability (Incident Followup), MediaWiki-extensions-WikimediaIncubator, Technical-Debt (Deprecation process), MediaWiki-ContentHandler
Uzume added a comment to T114432: [RFC] Heredoc arguments for templates (aka "hygienic" or "long" arguments).

This is an interesting problem however, this thread is getting into TL;DR territory (I need to come back to this and read everything). However, referring to the problem statements in the issue's description:

  • issue no. 2 can be mitigated by using a single template in a fashion like {{table|start}} ... {{table|end}}
Apr 26 2024, 8:03 AM · MediaWiki-Parser-Templates, Patch-Needs-Improvement, TechCom-RFC (TechCom-RFC-Closed), Parsing-Team--ARCHIVED, Wikimedia-Developer-Summit-2016
Uzume added a comment to T161976: Feature request: add detection for page language to Scribunto.

I am glad to see this long overdue implementation finally arrive, however, it was made "expensive" regardless of whether other related title information was already fetched about the same target page during the parser rendering of the current page. This is substantially different from the PAGELANGUAGE variable which does not appear to be "expensive" (I only looked at the documentation and not the code so far).

Apr 26 2024, 6:25 AM · MW-1.42-notes (1.42.0-wmf.15; 2024-01-23), MediaWiki CodeJam Dec 2023, MW-1.38-notes (1.38.0-wmf.20; 2022-01-31), Patch-For-Review, Scribunto

Apr 21 2024

Uzume added a comment to T362502: I can't capture <pagelist/> error with {{#iferror}}.

@Xover: Thanks for pointing that out. For some reason I missed that reference.

Apr 21 2024, 8:37 PM · ParserFunctions, ProofreadPage

Apr 17 2024

Uzume added a comment to T362502: I can't capture <pagelist/> error with {{#iferror}}.

<pagelist/> is just a fancy way of generating links to the pages (which you can actually do by hand if you want; this is somewhat still necessary when one has a group of media like a collection of JPEG images, etc.).

Apr 17 2024, 2:02 AM · ParserFunctions, ProofreadPage

Apr 16 2024

Uzume added a comment to T362502: I can't capture <pagelist/> error with {{#iferror}}.

If the issue goes away with a purge there isn't really an issue with the page anyway. The issue is with a stale page cache. If you manage to update a page with such an error in order add processing to catch and categorize such an error, then the error would already have gone away because you necessarily had to update the cached page by editing the page either directly or through one of its transclusions.

Apr 16 2024, 8:47 PM · ParserFunctions, ProofreadPage

Apr 6 2024

Uzume added a comment to T284504: ProofreadPages: Allow <pages/> tag on index pages.

I assume this also breaks (via the quoted code) when <pages/> tag is included to an Index page indirectly (e.g., Xyzzy/ToC has <pages/> and an Index transcludes such via something like: {{Xyzzy/ToC}}. Incidentally, is this an across the board restriction or just one for prevent circular transclusion? Meaning can I use a <pages/> tag in an Index so long as the <pages/> tag refers to a different Index (e.g., a ToC in one volume of a multivolume work where volumes not containing the ToC could still include the ToC in their Index pages using a <pages/> tag` because it is not a circular reference)?

Apr 6 2024, 11:50 AM · ProofreadPage

Mar 26 2024

Uzume added a comment to T272220: Remove "master" terminology from wmfdata-python.

I am not really against removal of "slave" terms as there and usually plenty of other more precise words that can be used that are unrelated to human slavery, however, I am against unnecessary remove of "master" terminology as it was only applied to slave owners considerably after it already had many other usages and meanings (e.g., mastering a skill and the origin of the "Mr.", etc.). I have no issues with "master" branches and think it is silly and not useful to seriously consider trying to remove such references.

Mar 26 2024, 9:45 PM · Product-Analytics, Data-Engineering, Wmfdata-Python

Feb 22 2024

Uzume added a comment to T120794: Create redirect when moving modules.

It seems to me a superior solution would just be to use the existing wikitext redirects (necessitating a change in the content model upon rename/moves) and have Scribunto fetch the targets of such things before it #invokes, requires, etc.

Feb 22 2024, 6:31 PM · User-notice-archive, MediaWiki CodeJam Dec 2023, MW-1.42-notes (1.42.0-wmf.10; 2023-12-19), Platform Engineering (Icebox), User-DannyS712, Scribunto

Mar 17 2023

Uzume added a comment to T282499: Consider whether Parsoid will support forced linear parsing..

Splitting the content model into sequential and non-sequential content models is an interesting concept but I am not sure that is really that useful or necessary.

Mar 17 2023, 7:01 PM · MediaWiki-extensions-ExternalData, Parsoid-Read-Views (Phase 3 - Main namespace of officewiki / mediawiki.org renders with Parsoid), MediaWiki-extensions-Variables, Parsoid, MediaWiki-Parser
Uzume added a comment to T250963: Replace use of removed hook InternalParseBeforeSanitize.

It seems to me the issue is Parsoid introducing parallel processing across all the individual parts of parsing during a page render. This way it can memoize the results of these individual parts and potentially run them out-of-order. In order for it to accomplish such, Parsoid has to know all of the inputs to any part of the parse and it assumes that anytime the inputs are the same the output will be the same.

Mar 17 2023, 6:18 PM · ci-test-error, MediaWiki-extensions-Variables

Mar 1 2023

Uzume added a comment to T329891: Remove mobile-only modules in Wikistories .

I am not sure how this related to ResourceLoader but this seems pertinent: T313514: Enable Wikistories for Desktop users.

Mar 1 2023, 12:33 PM · MW-1.41-notes (1.41.0-wmf.10; 2023-05-23), Wikistories, Performance-Team (Radar), User-Jdlrobson, Technical-Debt (RW-Tech-Debt), Front-end-Standards-Group
Uzume added a comment to T313514: Enable Wikistories for Desktop users.

Is this related to ResourceLoader and T329891: Remove mobile-only modules in Wikistories ?

Mar 1 2023, 12:32 PM · Wikistories (Commons), Inuka-Team (Kanban)

Feb 28 2023

Uzume added a comment to T252054: Implement magic link functionality as an extension.

I would also like to see Special:BookSources get moved out of core to an Extension:ISBN that provides a Special:ISBN (with a Special:BookSources alias) for things like T148274: Implement a convenient way to link to ISBNs without magic links.

Feb 28 2023, 12:01 PM · Parsoid, MediaWiki-Parser
Uzume added a comment to T148274: Implement a convenient way to link to ISBNs without magic links.

I do not really see the value here. First, I would like to see Special:BookSources moved out of core (e.g., into an extension) not unlike how magic links are likely to be handled anyway (see T252054). And what is wrong with links like [[Special:BookSources/{{{ISBN}}}]]? If you really want something shorter why not just make Special:ISBN be an alias for Special:BookSources (I believe several Special pages have aliases as well as language localization names)? Then ISBN {{{ISBN}}} magic links can just be changed to [[Special:ISBN/{{{ISBN}}}]] style links (e.g., via templates), etc.

Feb 28 2023, 11:55 AM · MediaWiki-Parser

Feb 24 2023

Uzume added a comment to T172408: Implement way of querying Proofread Page Status of a Page: (or revision) from the databases directly.

While I am for adding such revision tags I am against migrating to and depending the value there (which is good for watching, etc.).

Feb 24 2023, 1:49 AM · MW-1.37-notes (1.37.0-wmf.15; 2021-07-19), User-Inductiveload, PHP-API-for-Wikisource, All-and-every-Wikisource, ProofreadPage
Uzume added a comment to T291293: ProofreadPage: Move page proofreading status to an MCR slot.

One possible method towards this could be to use Wikibase in much the same way as Structured Data on Commons was deployed. We could develop something akin to Extension:WikibaseMediaInfo (or perhaps more like Extension:WikibaseLexeme since I am not sure we would need or want names, descriptions and aliases for these new objects) and for querying (per T172408) we could leverage things like WikibaseCirrusSearch, e.g., haswbstatement, wbstatementquantity, etc. or whatever else they are using.

Feb 24 2023, 1:40 AM · Multi-Content-Revisions, ProofreadPage
Uzume added a comment to T290578: Use the change_tag table to store the proofreading quality level.

@Tpt I also prefer the MCR route. I think that might allow better handling of the migration cost too.

Feb 24 2023, 1:07 AM · ProofreadPage
Uzume added a watcher for ProofreadPage: Uzume.
Feb 24 2023, 12:17 AM
Uzume added a member for ProofreadPage: Uzume.
Feb 24 2023, 12:16 AM

Feb 22 2023

Uzume added a comment to T36355: add a variable and parser function for the namespace number.

I find it strange that NAMESPACENUMBER was added that works on full pagenames but nothing was added to do the same with just namespace names—the corollary to ns: and nse:. It is easy enough to workaround as I can always just smash on a random pagename to a namespace and then pass to NAMESPACENUMBER: but that seems crudely unnecessary and a strange oversight.

Feb 22 2023, 11:59 PM · MediaWiki-Parser

Feb 15 2023

Uzume updated subscribers of T326480: PHP 8.2 failure for ApiResultTest::testTransformations for different order of the result.

Changelog contains

Fixed bug GH-9296 (ksort behaves incorrectly on arrays with mixed keys).

https://github.com/php/php-src/issues/9296 wants to enforce the "saner" comparision linked by you also be done for SORT_REGULAR places.

https://php.watch/versions/8.2/ksort-SORT_REGULAR-order-changes

Feb 15 2023, 6:50 AM · MW-1.39-notes, MW-1.41-notes, MW-1.40-notes, MW-1.42-notes (1.42.0-wmf.26; 2024-04-09), Patch-For-Review, MediaWiki-Core-Tests, MediaWiki-Action-API, PHP 8.2 support

Feb 13 2023

Uzume added a comment to T299279: Pages with Scribunto content model get English content language and short description.

In addition to the "Central description" ("Zentrale Beschreibung") the "Page information" ("Seiten­informationen") also directly specifies "Page content language" ("Seiteninhaltssprache"):

Which is clearly "en" and not "de".

Feb 13 2023, 4:57 PM · Scribunto
Uzume added a comment to T120794: Create redirect when moving modules.

Using something simplistic like the proposed return require( "Module:Target" ) can easily be detected by the target module, perhaps even affecting its functionality. Specifically, ... will have a value during the module initialization because the module is required and not directly #invoked and mw.getCurrentFrame():getTitle() will refer to the #invoked title and not the redirection target. Also reported script errors will list the #invoked module in the call stack because the target module is not truly treated as the Lua "main" module.

Feb 13 2023, 9:33 AM · User-notice-archive, MediaWiki CodeJam Dec 2023, MW-1.42-notes (1.42.0-wmf.10; 2023-12-19), Platform Engineering (Icebox), User-DannyS712, Scribunto

Feb 12 2023

Uzume added a comment to T326480: PHP 8.2 failure for ApiResultTest::testTransformations for different order of the result.

Please make sure any solution works for both int-like and float-like numeric key values. It would be bad if 9.1 and "9.1" no longer sorted as larger than 9 and "9".

Feb 12 2023, 8:55 PM · MW-1.39-notes, MW-1.41-notes, MW-1.40-notes, MW-1.42-notes (1.42.0-wmf.26; 2024-04-09), Patch-For-Review, MediaWiki-Core-Tests, MediaWiki-Action-API, PHP 8.2 support

Jun 26 2022

Uzume added a comment to T181714: Two cases of failing upload, followed by a successful upload with a different file name.

That's a good idea.

I don't think we can email them though (can bots do that?). Perhaps we can add a note to their Commons talk page?

Jun 26 2022, 10:17 AM · IA Upload
Uzume added a comment to T179736: Fixable vs. unfixable IA Upload failures: overview.

Is it the case that the zip files we want are identified by format = 'Abbyy GZ' in the files' list? That seems to identify the jp2 and tif zip files in the items I've looked at.

Jun 26 2022, 10:01 AM · IA Upload
Uzume added a comment to T166303: Add indicator to see if a queued task is dead on not on IA-upload.

It would be nice if the jobs metadata and the logs were kept longer. That said all jobs should have a master timeout and die that makes them all end up in the completed/aborted bucket. That bucket can then be cleaned after a longer period. This allows one to manually retry jobs by clicking a button on the aborted ones if one can ascertain from the logs that the error was somehow temporal.

Jun 26 2022, 9:22 AM · IA Upload
Uzume added a comment to T161456: Use Commons (individual files?) as a source for building DjVu files.

I think the issues with that could be:

  • the OCR from IA is of better quality than we can do on Labs (I think I'm right in saying they use Abby FineReader, we use Tesseract or Google Cloud Vision API);
Jun 26 2022, 9:14 AM · All-and-every-Wikisource, IA Upload, Commons

Jun 24 2022

Uzume added a comment to T285124: Trying to get property 'key' of non-object.

@Samwilson After pruning old job items via the fix for T183338, the line moved from 150 to 111 but it is possible it was fixed. Is this still happening?

Jun 24 2022, 7:39 AM · IA Upload
Uzume added a comment to T286701: IA-Upload: retry failed commons uploads (504 errors).

Why not just use direct URL upload to begin with? Let Commons pull it from IA Upload then we do not have to worry about teaching addwiki async chunked uploading as IA Upload's part would be downloading instead (and from the perspective of IA Upload, another request is inherently asynchronous). This has the added benefit of transparency as we would have to provide the media URL and file description metadata anyway.

Jun 24 2022, 7:21 AM · IA Upload, Community-Tech

Jun 23 2022

Uzume added a comment to T296834: IA-Upload: DLI conversion failure for in.ernet.dli.2015.226478.

This depends how how you are trying to process that. That IA item does not have an existing DjVu file (it was created well after March 2016 when they stopped making those).

Jun 23 2022, 9:05 PM · Internet-Archive, IA Upload
Uzume added a comment to T300705: pdf2djvu solves difficult cases.

Currently IA Upload uploads DjVu obtained via three possible sources:

  1. Use existing DjVu
  2. From original scans (JP2)
  3. From PDF (maybe of lower quality)
Jun 23 2022, 7:27 PM · IA Upload
Uzume renamed T300761: Text layer mismatch with page images on DjVu created from original scans (JP2) from Text layer mismatch with page images on djvu created from original scan (JP2) to Text layer mismatch with page images on DjVu created from original scans (JP2).
Jun 23 2022, 5:38 PM · IA Upload, Commons
Uzume renamed T300761: Text layer mismatch with page images on DjVu created from original scans (JP2) from Text layer mismatch with page images on djvu created with Abbyy finereader 15 to Text layer mismatch with page images on djvu created from original scan (JP2).
Jun 23 2022, 5:24 PM · IA Upload, Commons
Uzume added a comment to T300761: Text layer mismatch with page images on DjVu created from original scans (JP2).

I too have run into this issue and I do not think it is is so much of an issue with the OCR layer being on the wrong pages per se as the
Jp2DjvuMaker including extraneous pages from the jp2 set and thus the OCR layers effectively no longer match up with resultant pages.

Jun 23 2022, 5:17 PM · IA Upload, Commons

Jan 18 2022

Uzume added a comment to T161976: Feature request: add detection for page language to Scribunto.

For reference, {{PAGELANGUAGE}} mentioned in the description was added in T59603, however, it only allows one to obtain the language of the page being rendered (since it does not take any arguments unlike {{PAGENAME}} and friends) not the content language of arbitrary pages despite arbitrary page content being available via getContent on mw.title objects.

Jan 18 2022, 12:46 PM · MW-1.42-notes (1.42.0-wmf.15; 2024-01-23), MediaWiki CodeJam Dec 2023, MW-1.38-notes (1.38.0-wmf.20; 2022-01-31), Patch-For-Review, Scribunto
Uzume added a member for MediaWiki-ContentHandler: Uzume.
Jan 18 2022, 12:15 PM
Uzume updated the task description for T299369: Consider removing global $userLang from onPageContentLanguage hook.
Jan 18 2022, 12:11 PM · Patch-For-Review, MW-1.42-notes (1.42.0-wmf.14; 2024-01-16), MediaWiki CodeJam Dec 2023, MediaWiki-Platform-Team, Essential-Work, Content-Transform-Team-WIP, Sustainability (Incident Followup), MediaWiki-extensions-WikimediaIncubator, Technical-Debt (Deprecation process), MediaWiki-ContentHandler
Uzume added a comment to T299369: Consider removing global $userLang from onPageContentLanguage hook.

I am hoping that a resolution here can lead to a resolution of T161976 for which the fix was reverted due to T298659 and ultimately resulted in this issue. I believe if page content is available during the rendering of another page that the purported content language of the available page content should also be available during the page render (much like the content model already is).

Jan 18 2022, 12:08 PM · Patch-For-Review, MW-1.42-notes (1.42.0-wmf.14; 2024-01-16), MediaWiki CodeJam Dec 2023, MediaWiki-Platform-Team, Essential-Work, Content-Transform-Team-WIP, Sustainability (Incident Followup), MediaWiki-extensions-WikimediaIncubator, Technical-Debt (Deprecation process), MediaWiki-ContentHandler

Jan 11 2022

Uzume added a comment to T298659: BadMethodCallException: Sessions are disabled for load entry point.

The entry point for this problem seems to be ContentHandler::getPageLanguage(), as called by the recently added Scribunto code for mw.title.pageLanguage. This method does not logically depend on the context/user language, except for where it passes $wgLang as third parameter to HookRunner::onPageContentLanguage.

This seems like a bad idea and not something that is imho reasonable to support in MediaWiki, and incompatible with the general model of how this feature works. For one, it would break any assumption that this is persistable to the database.

I feared the Translate extension might depend on thit, but from a Codesearch query we find it is actually not recongising this parameter at all. Instead, it is LiquidThreads and WikimediaIncubator using these to make some of their wiki pages act like a special page, in that they automagically render in the UI language. This is strange since we already expose the use language to the parser and allow it to be used. Rather, the hook is additionally forcing the internal page language to be perceived as if it were the current user language. Perhaps in the short-mid term we can find a different way to support the underlying need there.

Jan 11 2022, 3:49 AM · MW-1.38-notes (1.38.0-wmf.17; 2022-01-10), MediaWiki-Recent-changes, Growth-Team, Scribunto, Wikimedia-production-error

Jan 10 2022

Uzume added a comment to T197516: pairs() doesn't preserve the order of parameters passed to the module.

The problem with this is that this affects the MediaWIki core code since the Scribunto extension just uses the core template frame code to parse the parameters and arguments. That code makes no attempt to retain original order and is thus this information is lost (despite such order being available to parser functions like #invoke in general). To make matters more complex, parameters and arguments are not only available (likely out of order) for the current #invoke frame but also the parent wikitext template frame. Wikitext template's also need both numbered and named parameters but so far have had no need to retain original ordering. Since Scribunto allows access to the parent frame args which also does not retain the original ordering they were given in this necessitates a core change to fix such.

Jan 10 2022, 2:48 AM · Scribunto

Nov 24 2021

Uzume added a comment to T296337: SDoC {{#statements}} parser function gives bad data in some situations.

It should probably be noted that there are Wikidata items that state they represent (P31) Wikimedia categories (Q4167836). Some of those have category sitelinks at Commons, i.e., Q9013822 sitelinks to Category:Text logos). These should probably not be considered in error despite also having P373 "Commons category" statements claiming the same value. Having a MediaInfo entity's statements linking to such Wikidata items might be considered erroneous (depending on the claims).

Nov 24 2021, 4:55 PM · StructuredDataOnCommons

Nov 23 2021

Uzume added a comment to T296337: SDoC {{#statements}} parser function gives bad data in some situations.

I am not sure if this is actually unexpected. {{#statements:P195|from=M89709639}} yields <span><span>[[Category:Media contributed by Toledo-Lucas County Public Library|Toledo-Lucas County Public Library]]</span></span> because of the P195 claim on M89709639 that points to Q7814140 which in turn has the commons sitelink that points to Category:Media contributed by Toledo-Lucas County Public Library (I doubt that sitelink is really correct and could use to be fixed, e.g., {{#property:P373|from=Q7814140}} also yields Media contributed by Toledo-Lucas County Public Library).

Nov 23 2021, 11:59 PM · StructuredDataOnCommons

Nov 14 2021

Uzume added a comment to T18691: RFC: Section header "share" link.

That doesn't seem feasible, and it's outside of the scope of this task

I never suggested such was feasible or in scope, however, I do think it deserves a discussion point as the reason for wanting such external linking is actually even larger for editor created anchors than for sections. As such, they help define the problem here and possible arguments for or against the proposals here.

Nov 14 2021, 11:20 PM · Tech Ambassadors & Translators, User-Jdlrobson, Platform Team Workboards (Clinic Duty Team), TechCom-RFC, Design, MediaWiki-User-Interface
Uzume added a watcher for TechCom-RFC: Uzume.
Nov 14 2021, 8:40 PM
Uzume added a comment to T18691: RFC: Section header "share" link.

It should be noted that while these work:

the following does not work nor refer to the same thing:

but rather one has to use something like:

Nov 14 2021, 4:13 PM · Tech Ambassadors & Translators, User-Jdlrobson, Platform Team Workboards (Clinic Duty Team), TechCom-RFC, Design, MediaWiki-User-Interface

Jun 26 2020

Uzume added a comment to T161976: Feature request: add detection for page language to Scribunto.

I actually did not do that. I think somehow I must have edited/submitted and older version (though I am not sure how as that was not my intention).

Jun 26 2020, 4:58 AM · MW-1.42-notes (1.42.0-wmf.15; 2024-01-23), MediaWiki CodeJam Dec 2023, MW-1.38-notes (1.38.0-wmf.20; 2022-01-31), Patch-For-Review, Scribunto

Jun 23 2020

Uzume edited projects for T161976: Feature request: add detection for page language to Scribunto, added: Patch-For-Review; removed Patch-Needs-Improvement.

@Aklapper Does it really need triage? There was already a patch for it (thought it seems to need to be updated). I can see how the patch itself needs triage but the issue seems well understood. Anomie already clarified that mw.language.getPageLanguage was not the right thing and demonstrated that a pageLanguage field of mw.title objects was the way to go. What further triage does this issue really need? I only assigned it to Anomie so that he would respond based on the patch he created. I understand if he wanted to remove himself at this point in time but the point was to get him to make such a statement.

Jun 23 2020, 7:49 AM · MW-1.42-notes (1.42.0-wmf.15; 2024-01-23), MediaWiki CodeJam Dec 2023, MW-1.38-notes (1.38.0-wmf.20; 2022-01-31), Patch-For-Review, Scribunto

Jun 22 2020

Uzume updated the task description for T30299: An image redirect from a foreign shared File Repository overrides local wiki page..
Jun 22 2020, 7:55 AM · MediaWiki-Redirects

Jun 21 2020

Uzume added a comment to T254102: Make getContent() work for interwiki pages.

I highly doubt this sort of functionality will arrive anytime soon. The main issue is that if a Scribunto module supplies different output based on different input from remote wikis, how does Mediawiki track the links and maintain the page rendering caches (so cached output gets properly updated when a dependency changes)? To accomplish this sort of dependency tracking the link tables would have to somehow be expanded to support cross wiki linking so that things like [[Special:Whatlinkshere]] can list remote page transclusions, etc. (perhaps you read that getContent causes the page to be recorded as a transclusion and this is why).

Jun 21 2020, 3:24 PM · Crosswiki, Scribunto
Uzume moved T254102: Make getContent() work for interwiki pages from Backlog to Transclusions on the Crosswiki board.
Jun 21 2020, 3:23 PM · Crosswiki, Scribunto
Uzume moved T207932: Special:Notifications should display all cross-wiki notifications anytime whatever their status from Backlog to Notifications on the Crosswiki board.
Jun 21 2020, 3:23 PM · Growth-Team-Filtering, Crosswiki, Growth-Team, Notifications
Uzume moved T235856: Enable cross-wiki searching for 3+ new languages/projects (stretch) from Backlog to Search on the Crosswiki board.
Jun 21 2020, 3:22 PM · Crosswiki, Discovery-Search
Uzume moved T243412: Allow cross-wiki notifications to work without CentralAuth from Backlog to Notifications on the Crosswiki board.
Jun 21 2020, 3:21 PM · Growth-Team-Filtering, Crosswiki, Growth-Team, Notifications
Uzume assigned T161976: Feature request: add detection for page language to Scribunto to Anomie.

@Anomie Can we get your change from over three years ago merged? This is an easy and straightforward fix but Gerrit is reporting some sort of merge conflict even though Jenkins had no issues with it.

Jun 21 2020, 1:42 PM · MW-1.42-notes (1.42.0-wmf.15; 2024-01-23), MediaWiki CodeJam Dec 2023, MW-1.38-notes (1.38.0-wmf.20; 2022-01-31), Patch-For-Review, Scribunto

Jun 20 2020

Uzume edited Description on Wikimedia-Hackathon-2020.
Jun 20 2020, 8:38 AM

Jun 16 2020

Uzume added a comment to T192462: mw.wikibase.entityExists returns false for redirected entities.

So quick question from people who need this (ping @Uzume and @eranroz): Should mw.wikibase.entityExists('Q123') return true in case Q123 is redirect all the time or when it's redirecting to an existing entity?

Jun 16 2020, 12:48 AM · MW-1.34-notes (1.34.0-wmf.22; 2019-09-10), User-Ladsgroup, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞ (On Hold)), Wikidata, MediaWiki-extensions-WikibaseClient, Wikibase-Lua

May 14 2020

Uzume added a comment to T250763: Cannot use SyntaxHighlight: "Unable to run external programs, proc_open() is disabled".

This is actually a regression when the extension moved from GeSHi (PHP) to Pygments (Python): T85794: Convert SyntaxHighlight_Geshi from Geshi to Pygments (was: Don't register 250+ modules) Since MW is PHP the extension can just use GeSHi as a library without having to fork a separate process via one of the unsafe functions removed in the hardened PHP.

May 14 2020, 12:35 AM · SyntaxHighlight

Jan 24 2020

Uzume added a comment to T128173: Represent editions as interwiki links on Wikisource.

The issue becomes how to represent multiple edition links in Mediawiki toolbars across multiple WMF wikis across their projects. Currently, as implemented via WD sitelinks, we only allow one link per wiki per project per WD item. This is in part owning to the limited space in the Mediawiki toolbars where such links are displayed. Even across wikis within a single project when only a single link is allowed per wiki, there can sometimes be a *very* large number of links (there are many languages in Wikipedia alone and already there are mechanisms that limit the number of sitelinks displayed in the toolbar by default).

Jan 24 2020, 9:00 AM · Patch-For-Review, MW-1.35-notes (1.35.0-wmf.36; 2020-06-09), All-and-every-Wikisource, Wikidata

Jan 23 2020

Uzume added a member for All-and-every-Wikisource: Uzume.
Jan 23 2020, 3:46 AM
Uzume added a member for SDC General: Uzume.
Jan 23 2020, 3:18 AM