Page MenuHomePhabricator

wmr (維基小霸王)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Nov 26 2014, 3:27 AM (501 w, 1 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
維基小霸王 [ Global Accounts ]

Recent Activity

Mar 25 2024

wmr created T360890: Option to seamlessly transclude consecutive files in Wikisource.
Mar 25 2024, 1:04 PM · ProofreadPage

Mar 19 2024

wmr added a comment to T352503: Google OCR problem for punctuation marks out of line in Chinese old book .

There has been a month and still no updates to the vision release notes.

Mar 19 2024, 1:39 AM · Upstream, Wikimedia OCR, Chinese-Sites

Feb 26 2024

wmr added a comment to T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.

A key in a JSON object is always a string. Therefore a number is impossible. However a Lua table needs to be a JSON object if any not one-based array shall be mapped.

Only 1-based arrays can be mapped directly as JSON array ↔ Lua sequence table.

A Lua table (mapping object) is permitted to use any data type as key, even boolean and floating point numbers. Even worse, also a table. And all data types may be mixed as keys within one table.

If you have a Lua object (table) with int keys 0, 1, 2 you need to convert these keys in JSON as "0", "1", "2". On backward conversion you have the choice in Lua whether you want string keys or number keys.

Feb 26 2024, 12:19 PM · Scribunto
wmr added a comment to T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.

The documentation states:

Feb 26 2024, 1:14 AM · Scribunto

Feb 23 2024

wmr updated subscribers of T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.
Feb 23 2024, 8:57 AM · Scribunto
wmr added a comment to T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.

Please replicate the bug with the following code in debug console in Commons:

Feb 23 2024, 6:21 AM · Scribunto

Jan 29 2024

wmr added a comment to T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.

Any response?

Jan 29 2024, 1:13 AM · Scribunto

Jan 24 2024

wmr updated the task description for T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.
Jan 24 2024, 1:43 PM · Scribunto
wmr added a comment to T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.
} elseif ( $isEncoding && ctype_digit( $k ) ) {
  // json_decode currently doesn't return integer keys for {}
  $isSequence = $next++ === (int)$k;
} else {
Jan 24 2024, 1:32 PM · Scribunto
wmr updated subscribers of T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.
Jan 24 2024, 1:17 PM · Scribunto
wmr created T355763: mw.text.jsonDecode() incorrectly interprets number strings as integers.
Jan 24 2024, 11:46 AM · Scribunto

Jan 19 2024

wmr added a comment to T352503: Google OCR problem for punctuation marks out of line in Chinese old book .

Great! Hope it will solve the problem.

Jan 19 2024, 1:51 AM · Upstream, Wikimedia OCR, Chinese-Sites
wmr added a comment to T352503: Google OCR problem for punctuation marks out of line in Chinese old book .

Is there anyone know how to let Google change this?

Jan 19 2024, 1:15 AM · Upstream, Wikimedia OCR, Chinese-Sites

Jan 8 2024

wmr added a comment to T354500: Allow Optional Starting File Navigation from Current Page in Categories.

What this means is that where the generated HTML for the category link is currently <a href="/wiki/Category:BAR" title="FOO">BAR</a>, @wmr would like to be able to have <a href="/w/index.php?title=Category:BAR&filefrom=FOO" title="FOO">BAR</a>. It would be pretty easy to have a client-side gadget to do this for those who want it.

Jan 8 2024, 7:11 AM · MediaWiki-Categories, Commons
wmr updated the task description for T354500: Allow Optional Starting File Navigation from Current Page in Categories.
Jan 8 2024, 1:39 AM · MediaWiki-Categories, Commons
wmr updated the task description for T354500: Allow Optional Starting File Navigation from Current Page in Categories.
Jan 8 2024, 1:38 AM · MediaWiki-Categories, Commons
wmr removed a project from T354500: Allow Optional Starting File Navigation from Current Page in Categories: Internet-Archive.
Jan 8 2024, 1:37 AM · MediaWiki-Categories, Commons
wmr created T354500: Allow Optional Starting File Navigation from Current Page in Categories.
Jan 8 2024, 1:36 AM · MediaWiki-Categories, Commons

Dec 1 2023

wmr renamed T352503: Google OCR problem for punctuation marks out of line in Chinese old book from Google OCR problem for Chinese old book to Google OCR problem for punctuation marks out of line in Chinese old book .
Dec 1 2023, 6:21 AM · Upstream, Wikimedia OCR, Chinese-Sites
wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

PR for re-sending the full image data: https://github.com/wikimedia/wikimedia-ocr/pull/120

@wmr the other issues you raise here are not related to the current task, could you please create new tasks for these if you think they need addressing? Thanks!

Dec 1 2023, 6:20 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR
wmr created T352503: Google OCR problem for punctuation marks out of line in Chinese old book .
Dec 1 2023, 6:19 AM · Upstream, Wikimedia OCR, Chinese-Sites

Aug 30 2023

wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

The solution would be easy. Just write a bot, download a PDF from commons, and convert the file to jpg locally. Upload every jpg to Google, get the OCRed text, and use the bot put text to Wikisource.

It would only require two parameters for users to input: filename of pdf (or djvu) and the target Wikisource domain name (like zh.wikisource.org). The user should be autoconfirmed user in the target Wikisource and should confirm that they think the quality would OK (avoid handwritten manuscript that would have bad OCR quality).

Mass OCR is explicitly forbidden quite a lot of language wikisources

Aug 30 2023, 10:41 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR
wmr updated subscribers of T332125: Google OCR error: "We can not access the URL currently".
Aug 30 2023, 3:32 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR
wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

The solution would be easy. Just write a bot, download a PDF from commons, and convert the file to jpg locally. Upload every jpg to Google, get the OCRed text, and use the bot put text to Wikisource.

Aug 30 2023, 3:16 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR

Jul 3 2023

wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

A limitation of Google OCR has been found: it cannot recognize punctuation marks outside vertical lines. This is a common typesetting practice during the Chinese Republican era. For example, for this image, no punctuation marks were recognized. Are there any options available on Google to recognize them?

This is out of topic. But where should I report this? Is this count as bug?

It is (sort of) a bug, but there's nothing we can do about it, as it exists wholly within Google's service. The API docs are here: https://cloud.google.com/vision/docs/reference/rest/v1/Feature — there's not much in the way of configurability for text detection, beyond languageHints[].

Jul 3 2023, 9:01 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR
wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

A limitation of Google OCR has been found: it cannot recognize punctuation marks outside vertical lines. This is a common typesetting practice during the Chinese Republican era. For example, for this image, no punctuation marks were recognized. Are there any options available on Google to recognize them?

This is out of topic. But where should I report this? Is this count as bug?

It is (sort of) a bug, but there's nothing we can do about it, as it exists wholly within Google's service. The API docs are here: https://cloud.google.com/vision/docs/reference/rest/v1/Feature — there's not much in the way of configurability for text detection, beyond languageHints[].

Jul 3 2023, 9:01 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR

Jun 26 2023

wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

A limitation of Google OCR has been found: it cannot recognize punctuation marks outside vertical lines. This is a common typesetting practice during the Chinese Republican era. For example, for this image, no punctuation marks were recognized. Are there any options available on Google to recognize them?

Jun 26 2023, 2:09 PM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR
wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

I think the problem lies with Wikimedia Commons being slow to respond. If someone manually opens a rarely accessed book on browser and randomly selects a page to view, the server might not display it immediately; it may take some time. The server should extract pages from PDF files, cache them as image files, and then display them. For OCR, there should be dedicated tools to download the entire PDF file, convert it to images using those tools, and then send them to Google for OCR processing.

Jun 26 2023, 2:02 PM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR

Jun 5 2023

wmr added a comment to T337718: Request increased quota for add-text Toolforge tool.

I want to run many things at the same time. Thanks.

Jun 5 2023, 5:45 AM · Toolforge (Quota-requests)

May 31 2023

wmr added a comment to T337707: Page numbers not displaying in Chinese Wikisource .

Yes. We need a lot of tech workforce in zhws.

May 31 2023, 9:10 AM · Chinese-Sites, ProofreadPage
wmr added a comment to T337707: Page numbers not displaying in Chinese Wikisource .

Ah. I found it. zhWS used to import an old old version of the pagenumbers script from enWS, but it was removed last November by @WikiBayer apparently in response to a steward request. Probably because that script on enWS is very very old and has not worked for a very long time. enWS has migrated to an actual Gadget (MediaWiki:Gadget-PageNumbers.js, MediaWiki:Gadget-PageNumbers-core.js, and MediaWiki:Gadget-PageNumbers-core.css).

May 31 2023, 1:33 AM · Chinese-Sites, ProofreadPage
wmr removed a project from T337707: Page numbers not displaying in Chinese Wikisource : All-and-every-Wikisource.
May 31 2023, 1:32 AM · Chinese-Sites, ProofreadPage
wmr added a comment to T337707: Page numbers not displaying in Chinese Wikisource .

@wmr: Are all other Wikisource sites also affected by this?

May 31 2023, 1:32 AM · Chinese-Sites, ProofreadPage

May 30 2023

wmr created T337718: Request increased quota for add-text Toolforge tool.
May 30 2023, 9:18 AM · Toolforge (Quota-requests)
wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

Google actually OCR every image pdf it indexes. See the cache pages for

May 30 2023, 3:47 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR
wmr created T337707: Page numbers not displaying in Chinese Wikisource .
May 30 2023, 3:42 AM · Chinese-Sites, ProofreadPage
wmr added a comment to T332125: Google OCR error: "We can not access the URL currently".

Google OCR cannot recognize punctuations out of line in Chinese verticle text.

May 30 2023, 3:34 AM · All-and-every-Wikisource, Community-Tech, Wikimedia OCR

Dec 6 2022

wmr closed T272436: wmr-bot home directory using high NFS storage as Resolved.

@fnegri Done.

Dec 6 2022, 11:05 AM · Tools
wmr closed T272436: wmr-bot home directory using high NFS storage, a subtask of T272247: 2021-01-17: tools NFS share cleanup, as Resolved.
Dec 6 2022, 11:04 AM · Toolforge, cloud-services-team (Kanban)
wmr closed T272436: wmr-bot home directory using high NFS storage, a subtask of T323798: 2022-11-25: tools nfs share cleanup, as Resolved.
Dec 6 2022, 11:04 AM · Toolforge, cloud-services-team (Kanban)
wmr closed T272436: wmr-bot home directory using high NFS storage as Resolved.

Now tw/pdf is under 100G. I shall close this issue.

Dec 6 2022, 10:02 AM · Tools
wmr closed T272436: wmr-bot home directory using high NFS storage, a subtask of T272247: 2021-01-17: tools NFS share cleanup, as Resolved.
Dec 6 2022, 10:02 AM · Toolforge, cloud-services-team (Kanban)
wmr closed T272436: wmr-bot home directory using high NFS storage, a subtask of T323798: 2022-11-25: tools nfs share cleanup, as Resolved.
Dec 6 2022, 10:02 AM · Toolforge, cloud-services-team (Kanban)

May 7 2022

wmr created T307836: Add www.ncpssd.org to the wgCopyUploadsDomains allowlist of Wikimedia Commons.
May 7 2022, 3:04 PM · Commons, Wikimedia-Site-requests

Sep 20 2020

wmr added a comment to T261037: Add archive.wul.waseda.ac.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.

Can anyone add it please?

Sep 20 2020, 10:49 AM · User-Kizule, Wikimedia-Site-requests, Commons
wmr updated subscribers of T261037: Add archive.wul.waseda.ac.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.
Sep 20 2020, 1:45 AM · User-Kizule, Wikimedia-Site-requests, Commons

Sep 3 2020

wmr updated subscribers of T261037: Add archive.wul.waseda.ac.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.
Sep 3 2020, 9:32 AM · User-Kizule, Wikimedia-Site-requests, Commons

Aug 31 2020

wmr added a subtask for T60224: Add domains to $wgCopyUploadsDomains (tracking): T261037: Add archive.wul.waseda.ac.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.
Aug 31 2020, 4:19 PM · Commons, Tracking-Neverending, Wikimedia-Site-requests
wmr added a parent task for T261037: Add archive.wul.waseda.ac.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons: T60224: Add domains to $wgCopyUploadsDomains (tracking).
Aug 31 2020, 4:19 PM · User-Kizule, Wikimedia-Site-requests, Commons

Aug 22 2020

wmr created T261037: Add archive.wul.waseda.ac.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.
Aug 22 2020, 5:22 AM · User-Kizule, Wikimedia-Site-requests, Commons

Jun 5 2020

wmr updated the task description for T254585: Customizable replacement of HTML line breaks for transcluded pages in Wikisource.
Jun 5 2020, 2:53 PM · All-and-every-Wikisource, Chinese-Sites, I18n, ProofreadPage
wmr updated the task description for T254585: Customizable replacement of HTML line breaks for transcluded pages in Wikisource.
Jun 5 2020, 2:51 PM · All-and-every-Wikisource, Chinese-Sites, I18n, ProofreadPage
wmr updated the task description for T254585: Customizable replacement of HTML line breaks for transcluded pages in Wikisource.
Jun 5 2020, 2:47 PM · All-and-every-Wikisource, Chinese-Sites, I18n, ProofreadPage
wmr created T254585: Customizable replacement of HTML line breaks for transcluded pages in Wikisource.
Jun 5 2020, 2:43 PM · All-and-every-Wikisource, Chinese-Sites, I18n, ProofreadPage
wmr renamed T60729: Don't add space characters between transcluded pages in Chinese Wikisource from Don't add space characters between transcluded pages in Chinese Wikisource to Don't add space characters between transcluded pages and customized HTML line break replacement in Chinese Wikisource.
Jun 5 2020, 2:12 PM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr closed T60729: Don't add space characters between transcluded pages in Chinese Wikisource as Resolved.
Jun 5 2020, 2:11 PM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr reopened T60729: Don't add space characters between transcluded pages in Chinese Wikisource as "Open".

The function of removing space from line breaks is still badly needed in Chinese Wikisource. Line breaks are kept to help proofreading.

Jun 5 2020, 2:08 PM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage

Dec 28 2019

wmr reopened T238476: Add *.archives.go.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons, a subtask of T60224: Add domains to $wgCopyUploadsDomains (tracking), as Open.
Dec 28 2019, 1:17 AM · Commons, Tracking-Neverending, Wikimedia-Site-requests
wmr reopened T238476: Add *.archives.go.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons as "Open".
Dec 28 2019, 1:17 AM · User-Kizule, Commons, Wikimedia-Site-requests
wmr added a comment to T238476: Add *.archives.go.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.
In T238476#5717151, @Zoranzoki21 wrote:

Deployed!

Dec 28 2019, 1:16 AM · User-Kizule, Commons, Wikimedia-Site-requests

Dec 2 2019

wmr triaged T238476: Add *.archives.go.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons as High priority.
Dec 2 2019, 12:58 AM · User-Kizule, Commons, Wikimedia-Site-requests

Nov 17 2019

wmr updated subscribers of T238476: Add *.archives.go.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.
Nov 17 2019, 12:10 PM · User-Kizule, Commons, Wikimedia-Site-requests

Nov 16 2019

wmr created T238476: Add *.archives.go.jp to the wgCopyUploadsDomains whitelist of Wikimedia Commons.
Nov 16 2019, 3:14 PM · User-Kizule, Commons, Wikimedia-Site-requests

Sep 24 2019

wmr updated subscribers of T233682: "Paragraph editing not supported" after clicking "Edit" section link with transclusion of proofreading pages.
Sep 24 2019, 5:33 AM · ProofreadPage, All-and-every-Wikisource
wmr created T233682: "Paragraph editing not supported" after clicking "Edit" section link with transclusion of proofreading pages.
Sep 24 2019, 5:25 AM · ProofreadPage, All-and-every-Wikisource

Aug 9 2019

wmr reopened T229715: Set up 2x logo for Chinese Wikisouce, a subtask of T150618: Provide HD logos for all projects, as Open.
Aug 9 2019, 12:37 PM · Patch-Needs-Improvement, good first task, Google-Code-in-2019, User-Urbanecm, Goal, Wikimedia-Site-requests
wmr reopened T229715: Set up 2x logo for Chinese Wikisouce as "Open".
Aug 9 2019, 12:37 PM · Chinese-Sites, Wikimedia-Site-requests
wmr added a comment to T229715: Set up 2x logo for Chinese Wikisouce.

Deployed.

Aug 9 2019, 11:34 AM · Chinese-Sites, Wikimedia-Site-requests

Jul 24 2019

wmr updated the task description for T228917: Please upload large files to Wikimedia Commons.
Jul 24 2019, 5:54 PM · Commons, Wikimedia-Site-requests
wmr updated the task description for T228917: Please upload large files to Wikimedia Commons.
Jul 24 2019, 5:53 PM · Commons, Wikimedia-Site-requests
wmr updated the task description for T228917: Please upload large files to Wikimedia Commons.
Jul 24 2019, 5:48 PM · Commons, Wikimedia-Site-requests
wmr created T228917: Please upload large files to Wikimedia Commons.
Jul 24 2019, 5:47 PM · Commons, Wikimedia-Site-requests

Dec 5 2016

wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

On English Wikisource, it's policy to remove mid-paragraph line breaks.

Dec 5 2016, 3:01 AM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

If it's part of ProofreadPage, the message should probably be something more specific e.g. proofreadpage-page-separator. Sound okay?

Dec 5 2016, 2:57 AM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

The addition of space between lines is for all pages regardless of namespace. So fixing this problem should be involving changing somewhere other than this proofreading extension.

Dec 5 2016, 1:17 AM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

Thank you.

Dec 5 2016, 1:09 AM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

@wmr: No progress yet because nobody has written a patch yet. You are very welcome to use developer access to submit a proposed code change as a Git branch directly into Gerrit which makes it easier to review it quickly and provide feedback. Thanks!

Dec 5 2016, 12:48 AM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage

Dec 4 2016

wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

Seems that there needs to be a configurable option for a wiki to have a space or not to have a space between transcluded pages. Presumably set in the MW: namespace.

Dec 4 2016, 3:27 PM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

Why no progress so far?

Dec 4 2016, 3:27 PM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage

Nov 26 2014

wmr added a comment to T60729: Don't add space characters between transcluded pages in Chinese Wikisource.

Why no progress so far?

Nov 26 2014, 8:45 AM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr added a comment to T75967: Remove word seperator between proofreading pages.

Duplicate of T60729 ?

Nov 26 2014, 8:43 AM · ProofreadPage
wmr updated subscribers of T60729: Don't add space characters between transcluded pages in Chinese Wikisource.
Nov 26 2014, 8:42 AM · MW-1.32-notes (WMF-deploy-2018-05-08 (1.32.0-wmf.3)), Patch-For-Review, Chinese-Sites, All-and-every-Wikisource, I18n, ProofreadPage
wmr merged task T75967: Remove word seperator between proofreading pages into T60729: Don't add space characters between transcluded pages in Chinese Wikisource.
Nov 26 2014, 8:42 AM · ProofreadPage
wmr updated subscribers of T75967: Remove word seperator between proofreading pages.
Nov 26 2014, 3:44 AM · ProofreadPage
wmr created T75967: Remove word seperator between proofreading pages.
Nov 26 2014, 3:39 AM · ProofreadPage