Page MenuHomePhabricator

Yurik (Yuri Astrakhan)
User

Projects (10)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Oct 7 2014, 6:43 PM (316 w, 2 d)
Availability
Available
LDAP User
Yurik
MediaWiki User
Yurik [ Global Accounts ]

Recent Activity

Sat, Oct 10

Yurik added a comment to T265176: Refactor react.i18n to allow functional usage.

Not exactly, even though this is also possible. There are cases in my app when i could be using multiple Banana contexts in the same place, e.g. showing text/links in different languages, and being able to call createMessage( banana, messageKey, ...placeholders ) with my own custom-configured banana instance would be very useful. Another case is that sometimes I need to override the way banana itself functions by overriding default implementation -- example is the well known MW qqx language code that allows MW to show debug messages instead of the real i18n resolution - for that I would also need to be able to provide custom banana handler. Thus, separating <Message> from the actual code that implements react substitutions would be highly useful to me.

Sat, Oct 10, 5:28 AM · React-i18n, I18n

Fri, Oct 9

Yurik updated the task description for T265176: Refactor react.i18n to allow functional usage.
Fri, Oct 9, 7:56 PM · React-i18n, I18n
Yurik added a project to T265176: Refactor react.i18n to allow functional usage: I18n.
Fri, Oct 9, 7:55 PM · React-i18n, I18n
Yurik created T265176: Refactor react.i18n to allow functional usage.
Fri, Oct 9, 7:54 PM · React-i18n, I18n

Mon, Oct 5

Yurik added a comment to T264547: The "take" util unable to take ownership of files.

@bd808 thanks for the explanation. This surfaced because i was doing an rsync directly from my laptop via ssh, and since i cannot login as dibabel directly, i will need to do some extra steps. Would it be possible for take to link to some documentation in its errors about this? Thx!

Mon, Oct 5, 4:41 PM · Toolforge
Yurik updated the task description for T264547: The "take" util unable to take ownership of files.
Mon, Oct 5, 5:24 AM · Toolforge
Yurik renamed T264547: The "take" util unable to take ownership of files from take util unable to take ownership of files to The "take" util unable to take ownership of files.
Mon, Oct 5, 5:24 AM · Toolforge

Sun, Oct 4

Yurik created T264547: The "take" util unable to take ownership of files.
Sun, Oct 4, 9:16 PM · Toolforge

Sep 1 2020

Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

Should the terms page be updated, possibly with a link to this discussion?

Sep 1 2020, 4:51 AM · Toolforge, cloud-services-team (Kanban)

Aug 29 2020

Yurik added a comment to T261451: Add Wikidata support to jawikivoyage.

Just caught this while comparing sitematrix output against the list of allowed sites values. Is there a reason why these are not coming from the same source? Is there a way to get the list that wikibase is using via api?

Aug 29 2020, 3:46 PM · Wikidata-Campsite (Wikidata-Campsite-Iteration-∞), User-Ladsgroup, Wikidata

Aug 28 2020

Akuckartz awarded T154071: Allow non-CC0 licensed data for datasets a Like token.
Aug 28 2020, 7:24 PM · WMF-Legal, Commons-Datasets

Aug 27 2020

Yurik closed T260787: Add project Dibabel to translatewiki.net as Resolved.

Closing as it is clearly complete and works great. Thanks everyone!

Aug 27 2020, 4:41 PM · Language-Team (Language-2020-July-September), translatewiki.net

Aug 26 2020

Yurik added a comment to T261306: TranslateWiki should show short help when PLURAL is used.

I think the current approach assumes that everyone has read https://translatewiki.net/wiki/FAQ

Aug 26 2020, 10:09 PM · translatewiki.net
Yurik created T261306: TranslateWiki should show short help when PLURAL is used.
Aug 26 2020, 12:11 PM · translatewiki.net

Aug 25 2020

Yurik added a comment to T122086: RFC: Sharing templates and modules between wikis - poor man's version (investigation).

@Candalua there is a number of bugs and features I would like to fix/add before advertising it further. For example, the service does not refresh page state too often enough, it does not show dependency status (e.g. if one module depends on another which is stale, it doesn't warn about it). Everyone is welcome to use it though - shouldn't cause any major issues :)

Aug 25 2020, 1:57 PM · Tech-Ambassadors, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

@abi_ thanks, done! Let me know when you can push the updates to the repo :)

Aug 25 2020, 1:45 PM · Language-Team (Language-2020-July-September), translatewiki.net
Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

@abi_ would it be possible to do a massive rename to get rid of the dibabel- prefix in all strings? I just realized that the tool ended up with two of them in the translatewiki, and since this is a fully standalone tool, there is no point in having a prefix everywhere in code either.

Aug 25 2020, 6:58 AM · Language-Team (Language-2020-July-September), translatewiki.net
Yurik added a comment to T122086: RFC: Sharing templates and modules between wikis - poor man's version (investigation).

There is now a tool for this... https://dibabel.toolforge.org/

Aug 25 2020, 6:57 AM · Tech-Ambassadors, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik closed T252453: Create a Toolforge tool for syncing templates and Lua modules between wikis as Resolved.

Technically this is done. See https://dibabel.toolforge.org/

Aug 25 2020, 6:55 AM · Technical-Tool-Request, covid-19

Aug 24 2020

Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

Thank you everyone for getting it solved! @abi_ could you make it more often than twice a week? I'm pretty sure most contributors would love to see their changes going live asap (the instant gratification is important is FOSS :) ) -- and if there are no changes, it would simply be a noop, i don't think running a bot is very taxing on the system :) Thx!

Aug 24 2020, 3:27 PM · Language-Team (Language-2020-July-September), translatewiki.net

Aug 20 2020

Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

@abi_ thanks, I reviewed them, made a few minor corrections to the wiki pages, and added translatewiki as the contributor. Pushing to master is fine. If we make some changes to the master's i18n dir (either en.json or some other one), will you automatically pick that up before pushing?

Aug 20 2020, 3:54 PM · Language-Team (Language-2020-July-September), translatewiki.net
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

@Jack_who_built_the_house well, if GitHub has your ssh key, in theory any software that can get access to it could establish any kind of connection, including a regular session to run arbitrary commands. But if you establish a tunnel and unset the key right away, any subsequently ran software can only create socket connections, not run commands (at least that's my understanding).

Aug 20 2020, 2:11 AM · Toolforge, cloud-services-team (Kanban)
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

@bd808 thanks for the good example. It does provide a workaround, but it also highlights some issues:

Aug 20 2020, 12:36 AM · Toolforge, cloud-services-team (Kanban)

Aug 19 2020

Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

@bd808 I am not sure if the attack surface is significant, and also I don't believe this is relevant

Aug 19 2020, 11:04 PM · Toolforge, cloud-services-team (Kanban)
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

I should also say I'm skeptical about using Toolforge resources as a proxy for a proprietary/non-free service, getting the advantages of the Toolforge platform but bypassing the open source code requirements.

Aug 19 2020, 7:33 AM · Toolforge, cloud-services-team (Kanban)
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

Goal

To automate the build & publish process, making it as simple and stable as possible. Desired workflow:

  • User publishes a new release of their tool
  • System automatically builds, tests, and publishes the result to the wiki
Aug 19 2020, 7:02 AM · Toolforge, cloud-services-team (Kanban)

Aug 8 2020

Yurik awarded T256482: kubectl logs fails on /usr/lib/uwsgi/plugins/python_plugin.so a Haypence token.
Aug 8 2020, 12:06 AM · cloud-services-team (Kanban), Toolforge, Kubernetes

Jul 24 2020

Yurik added a comment to T258827: Return response headers in oauthclient->makeOAuthCall().

@Tgr forwarding an api call from a complex js client - if the response is a non-200, the client may decide to handle it somehow different (I assume non-200 responses are still returned in some form as content, but they won't be JSON).

Jul 24 2020, 11:35 PM · MediaWiki-extensions-OAuth
Yurik created T258827: Return response headers in oauthclient->makeOAuthCall().
Jul 24 2020, 7:42 PM · MediaWiki-extensions-OAuth

Jul 22 2020

Yurik added a comment to T258596: Gathering data from multiple Wikimedia Commons Tabular Entries.

Sure, Vega allows you to load from more than one data source, as long as you explicitly list them all. Afterwards, you can create a new data source that joins all the other ones (i think it was available in Vega 2.0 that Wikipedia is using)

Jul 22 2020, 6:28 PM · Commons-Datasets, MediaWiki-extensions-Graph

Jul 21 2020

Yurik awarded T165217: HTMLCheckMatrix cannot be preloaded a Like token.
Jul 21 2020, 2:51 AM · MediaWiki-HTMLForm

May 20 2020

Yurik placed T56221: Support for text/syntax/markup driven or WYSIWYG editable charts, diagrams, graphs, flowcharts etc. (Identify, develop, review and deploy extension on Wikimedia wikis to add) up for grabs.
May 20 2020, 7:03 PM · Commons, Wikimedia-Extension-setup
Yurik placed T134237: Graphoid returns a 400 on MW API time-out up for grabs.
May 20 2020, 7:03 PM · Platform Team Legacy (Watching / External), Services (watching), Operations, Graphoid
Yurik placed T139399: Fix Graphoid rendering issue + warning up for grabs.
May 20 2020, 7:02 PM · Graphoid
Yurik placed T166667: Whitelist .wmflabs.org for wikiraw:// protocol up for grabs.
May 20 2020, 7:02 PM · MediaWiki-extensions-Graph

May 19 2020

Volker_E awarded T9356: User-specified HTML IDs can be the same as interface IDs a Mountain of Wealth token.
May 19 2020, 4:52 AM · Parsing-Team--ARCHIVED, MediaWiki-Parser

May 12 2020

Yurik added a comment to T251759: Create a basic editing dialog for editing JSON tabular data as a table.

I think the right way to integrate spreadsheets and .tab editor is the proper copy/paste handling. It should be possible to paste a spreadsheet directly into the editor, and handle when the paste data is a table from spreadsheet, or has considerable size, or has multiple lines, or has tabs (or commas?), and treat it as a table of text, and ask the user how to deal with it: replace data or append to it. Other cases to handle would be shape mismatch (different number of columns), and type mismatch (e.g. paste data has text in the numeric column). I do not think the special cases like "replace just one column while keeping others intact" should be handled -- for that people can just copy/paste the whole table.

May 12 2020, 1:22 AM · MW-1.35-notes (1.35.0-wmf.32; 2020-05-12), covid-19, Commons-Datasets, JsonConfig
Yurik awarded T251759: Create a basic editing dialog for editing JSON tabular data as a table a Love token.
May 12 2020, 1:11 AM · MW-1.35-notes (1.35.0-wmf.32; 2020-05-12), covid-19, Commons-Datasets, JsonConfig

May 11 2020

Yurik added a comment to T251759: Create a basic editing dialog for editing JSON tabular data as a table.

I just tried it at https://commons.wikimedia.beta.wmflabs.org/wiki/Data:ISO15924/trans.tab -- looks great, and offers an amazing base for further features! Did you remove the "delete row" button?

May 11 2020, 5:45 PM · MW-1.35-notes (1.35.0-wmf.32; 2020-05-12), covid-19, Commons-Datasets, JsonConfig
Yurik added a comment to T250919: Add row/cell annotations to tabular data.

@Tgr i strongly oppose storing wiki markup inside columns because it makes the system far less portable and less stable. Wiki markup only works in the context of a specific wiki, and would render either differently or simply break -- templates, localization settings, and modules are wiki specific.

May 11 2020, 4:48 PM · Commons-Datasets, JsonConfig, covid-19

May 10 2020

Yurik added a comment to T234788: Commons limit on data is 2,048 kilobytes.

@AlexisJazz per my above comments -- it seems the system pretty-prints JSON, checks the size, and only then it stores it in the compact format. To make it work properly, the system should only validate json size after serializing it in compact form.

May 10 2020, 10:21 PM · Commons-Datasets
Yurik created T252322: GitHub mirror is missing Phabricator-based repos.
May 10 2020, 7:21 AM

May 9 2020

Yurik added a comment to T181319: Support external tabular datasets in WDQS.

@NavinoEvans I agree - feel free to take my implementation (which was already working for any CSV-style inputs), and extend/adapt it. Ideally, it should be merged upstream to the Blazegraph, so it should support any kind of CSVs. It may make sense to have either some sort of a wrapper for the tabular datasets as an extension to Blazegraph, or alternatively to extend the jsonconfig's API to be able to get CSV directly (which might be a better solution, as it would allow other, non-blazegraph usages)

May 9 2020, 5:22 PM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery, Wikidata, Wikidata-Query-Service
John_Cummings awarded T181319: Support external tabular datasets in WDQS a Love token.
May 9 2020, 9:44 AM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery, Wikidata, Wikidata-Query-Service
NavinoEvans awarded T181319: Support external tabular datasets in WDQS a Burninate token.
May 9 2020, 9:36 AM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery, Wikidata, Wikidata-Query-Service

May 6 2020

Yurik added a comment to T248897: Create an easy-to-use editing interface for JSON tabular data pages.

Good details, thx. For the localized strings (both in the data and in metadata), I think the better way would be to have just a single "global" language selector somewhere at the top of the dialog, set to the user's language by default. Changing its value would only change what is being shown, but it won't do any data modifications.

May 6 2020, 4:43 PM · covid-19, Commons-Datasets, JsonConfig, Epic

May 5 2020

Yurik added a comment to T248897: Create an easy-to-use editing interface for JSON tabular data pages.

@Tchanders this looks awesome!!! My understanding is that for MVP it should just allow editing of the existing tables, not change the table structure, right? Also, the multi-lingual column might be somewhat difficult to represent visually - how do you think it should be done? Thanks again!

May 5 2020, 6:00 PM · covid-19, Commons-Datasets, JsonConfig, Epic

Apr 28 2020

Yurik added a comment to T122086: RFC: Sharing templates and modules between wikis - poor man's version (investigation).

Most wikis do want to protect highly used templates/modules. E.g. the Module:TNT would be used by most pages - you never want to make it editable by novice users. Thus, the bot would need to have the rights to edit that page.

Apr 28 2020, 9:51 PM · Tech-Ambassadors, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik updated subscribers of T122086: RFC: Sharing templates and modules between wikis - poor man's version (investigation).
Apr 28 2020, 9:47 PM · Tech-Ambassadors, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik added a comment to T122086: RFC: Sharing templates and modules between wikis - poor man's version (investigation).

@Tgr I just ran it a bit more, but the issue is that the bot would need a bot flag with admin rights (not possible globally, so one would have to apply to every wiki... painful). The bot source code is in https://github.com/nyurik/dibabel

Apr 28 2020, 7:35 PM · Tech-Ambassadors, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik added a comment to T249419: RFC: Render data visualizations on the server.

If the job queue can call external services and wait for the rendered result, it would simplify the architecture a bit, allow for easier testing, and I am all for it. This assumes job queue itself can store large data blobs, rather than just short strings.

Apr 28 2020, 3:28 PM · covid-19, TechCom-RFC

Apr 22 2020

Yurik added a comment to T250919: Add row/cell annotations to tabular data.

At this point additional annotations could only be done as extra columns. This would work for many cases, but probably not all. Could you give some examples of where columns won't be enough, and the dedicated annotation system would be required?

Apr 22 2020, 5:49 PM · Commons-Datasets, JsonConfig, covid-19
Yurik added a comment to T248897: Create an easy-to-use editing interface for JSON tabular data pages.

@Tgr are there any good templates/examples of setting up a toolforge tool? Something that would show all the basic building blocks (creating a "site", setting up oauth, connecting to a specific wiki, etc), so that a volunteer could just write the task-specific javascript?

Apr 22 2020, 5:47 PM · covid-19, Commons-Datasets, JsonConfig, Epic
deryckchan awarded T134618: Epic: Implement spreadsheet-like cell editing for tabular data a Pterodactyl token.
Apr 22 2020, 1:00 AM · Epic, Commons-Datasets

Apr 17 2020

Yurik added a comment to T249419: RFC: Render data visualizations on the server.

@Milimetric thx for working on this! A few points that I would like further clarification on:

Apr 17 2020, 11:14 PM · covid-19, TechCom-RFC
Sophivorus awarded T134618: Epic: Implement spreadsheet-like cell editing for tabular data a Love token.
Apr 17 2020, 9:51 PM · Epic, Commons-Datasets

Apr 15 2020

Yurik added a comment to T249419: RFC: Render data visualizations on the server.

One more thing: the current custom-protocol:///?someparams=url-encoded for the data sources was a work around of the older Vega limitation. In Kibana, we used a much more successful approach:

Apr 15 2020, 8:36 PM · covid-19, TechCom-RFC
Yurik added a comment to T249419: RFC: Render data visualizations on the server.

Thx @Tgr and @Milimetric. Reiterating some possibly overlooked points:

  • In addition to data blobs (WDQS, data pages, API calls, etc), graphs could contain images (i.e. Commons or local wiki), and map image snapshots (generated by the maps snapshot service). See examples. If data is "prepackaged", some system would have to call all those services to assemble the needed data.
  • Newer Vega allows data loading as a result of a user action, or as a result of other data loading (e.g. if datasource A returns X, get datasource B)
  • MediaWiki PHP could try parse the graph spec to get all data sources, and we could say that for the preview image, data must not be dynamic, but that still leaves images -- e.g. if the data has country codes, a graph could get corresponding country flag by its name, e.g. File:Icons-flag-<countrycode>.png.
  • Many edits are done by non-javascript clients (bots), so requiring the client to submit some data when saving might introduce too many bugs and data mismatches.
  • Vega is not allowed to get any data from outside of the WMF network (uses a custom data/image loader for that).
Apr 15 2020, 8:33 PM · covid-19, TechCom-RFC

Apr 4 2020

Pppery awarded T122086: RFC: Sharing templates and modules between wikis - poor man's version (investigation) a Dislike token.
Apr 4 2020, 1:40 AM · Tech-Ambassadors, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Pppery awarded T137584: Allow Scribunto code to add a category without changing output a Dislike token.
Apr 4 2020, 12:28 AM · Platform Engineering, Patch-For-Review, MediaWiki-extensions-Scribunto

Feb 26 2020

Yurik added a comment to T174981: Add pageviews total counts to WDQS.

@Gehel lets define this amount of data, just for clarity. My back-of-the-envelope calculations:

Feb 26 2020, 4:52 PM · Analytics-Radar, Discovery, Wikidata-Query-Service, Wikidata

Feb 24 2020

Yurik added a comment to T128281: Privacy link does not point to localized policy.

hi @Jcross, can't recall what this is about, can close I guess

Feb 24 2020, 7:38 PM · WMF-Legal, Privacy

Feb 13 2020

Amire80 awarded T150031: Map with overlay is not being shown in Android app a Heartbreak token.
Feb 13 2020, 8:36 AM · Wikipedia-Android-App-Backlog, Android-app-Bugs

Feb 11 2020

Yurik added a comment to T244845: Handling of nulls in data namespace.

@Fae Python treats null as None when doing dict <-> json

Feb 11 2020, 7:17 PM · Commons-Datasets

Jan 13 2020

Yurik updated subscribers of T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

@Lucas_Werkmeister_WMDE thank you for all the hard work on this task! Do you have any approximate timeline of the getEntity() returning all lexeme forms, or is that already implemented? How significant of a challenge is it? I have been spending considerable time updating Lexicator bot to parse multiple Wiktionary languages, and handle multiple linguistic types, but all that work is mostly pointless until Wiktionaries can access that data.

Jan 13 2020, 8:45 PM · Wiktionary, Wikidata, Wikidata Lexicographical data
Yurik added a comment to T242596: Allow .tab and .map files to be added to categories.

This would be solved with T155290

Jan 13 2020, 6:04 PM · Commons-Datasets

Jan 4 2020

Yurik added a comment to T174981: Add pageviews total counts to WDQS.

I would guess this is mostly a devops task - orchestrate execution of an updating script. Here's the working implementation - https://github.com/Sophox/sophox/blob/master/osm2rdf/updatePageViewStats.py

Jan 4 2020, 6:52 PM · Analytics-Radar, Discovery, Wikidata-Query-Service, Wikidata
Yurik added a comment to T174981: Add pageviews total counts to WDQS.

@Tagishsimon this proposal would not edit wikidata. Instead, as part of the WDQS import process, it would upload pageviews in bulk from the pageview dump files directly into the Blazegraph index. It could do it every hour, and computation-wise it will be relatively inexpensive (i ran it as part of Sophox a few times).

Jan 4 2020, 3:02 AM · Analytics-Radar, Discovery, Wikidata-Query-Service, Wikidata

Dec 9 2019

Yurik awarded T220985: Complete support of Lexemes in QuickStatements a Mountain of Wealth token.
Dec 9 2019, 12:22 PM · Wikidata Lexicographical data, Wikidata

Dec 6 2019

Yurik added a comment to T235356: Fatal from ApiGraph->getGraphSpec: "Call to a member function getExtensionData() on boolean".

The fix seems sensible, thx!

Dec 6 2019, 5:54 PM · MW-1.35-notes (1.35.0-wmf.10; 2019-12-10), Editing-team, MediaWiki-extensions-Graph, Wikimedia-production-error

Nov 25 2019

Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

@MSantos there will be an OpenMapTiles community sync up this Thursday (10:30a ET), let me know if you would like to join in - we will be discussing how to move OMT forward, and possibly accommodate for Wikipedia needs. Email me YuriAstrakhan@gmail.com with your email addr.

Nov 25 2019, 9:20 PM · Product-Infrastructure-Team-Backlog (Kanban), Maps (Maps-data)

Nov 18 2019

Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

sure, sounds good, so how about this - if you create a page/ticket/... with some basic info and goals, I will add implementation details to it. Would that work?

Nov 18 2019, 7:34 PM · Product-Infrastructure-Team-Backlog (Kanban), Maps (Maps-data)
Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

@MSantos I am all for WMF to start using the OMT project rather than our first implementation, but I am not sure how valuable it will be to write an RFC -- so far WMF has not been too eager to support a proper map serving efforts, relying mostly on semi-volunteer efforts of different enthusiasts to keep it around. Do you think writing RFC will help in changing that? Or will it be just another dusty page on Phabricator?

Nov 18 2019, 6:41 PM · Product-Infrastructure-Team-Backlog (Kanban), Maps (Maps-data)
Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

Note that the openmaptiles project is rapidly improving, with the goal of generating tiles "on the fly" -- without the tile pregeneration step, and without mapnik. In other words, a vector tile (MVT) is generated by a single giant PostgreSQL query, and send to the user on request (with some caching to speed up frequently-viewed regions). Adapting this approach will greatly simplify current Wikipedia setup - no more Mapnik, no more Cassandra, easily scalable architecture (the more postgres replicas, the bigger the capacity).
P.S. And yes, OpenMapTiles is using Imposm3, together with a number of other good data sources like Natural Earth for low zooms.

Nov 18 2019, 4:43 PM · Product-Infrastructure-Team-Backlog (Kanban), Maps (Maps-data)

Nov 10 2019

Od1n awarded T137584: Allow Scribunto code to add a category without changing output a Like token.
Nov 10 2019, 4:36 PM · Platform Engineering, Patch-For-Review, MediaWiki-extensions-Scribunto

Nov 5 2019

Yurik added a comment to T234788: Commons limit on data is 2,048 kilobytes.

See my above comment, and @Lucas_Werkmeister_WMDE response -- while it stores things in the compact JSON form, the length is checked while it is in the "pretty-printed" format. A way to work around it might be to upload it to the server in the compact form via API, in which case it might get accepted.

Nov 5 2019, 9:27 PM · Commons-Datasets

Nov 2 2019

Arjunaraoc awarded T145688: [epic] Improve OSM-Wikipedia collaboration a Like token.
Nov 2 2019, 5:09 AM · Epic, Maps (Kartographer)

Oct 25 2019

Yurik added a comment to T211881: graphoid: Code stewardship request.

@dr0ptp4kt not just JS -- data sources could be far larger component to the graphs - e.g. one graph could mix together multiple data sources, including some tabular data pages (up to 2MB each), queries to Wikidata (currently broken btw -- lots of users are complaining because millions of population graphs are broken), a few images from commons, and even some mediawiki API calls. A full download could be in tens of megabytes, and some could be slow.

Oct 25 2019, 9:22 PM · Release-Engineering-Team-TODO (201908), Release-Engineering-Team (Code Health), Platform Team Legacy (Watching / External), Services (watching), Operations, Code-Stewardship-Reviews, Graphoid

Oct 13 2019

Yurik updated the task description for T224312: Improve LinguaLibreBot on Wikidata Lexeme.
Oct 13 2019, 6:49 PM · Wikidata, Wikidata Lexicographical data, Lingua Libre
Yurik updated the task description for T224312: Improve LinguaLibreBot on Wikidata Lexeme.
Oct 13 2019, 6:44 PM · Wikidata, Wikidata Lexicographical data, Lingua Libre

Oct 7 2019

Yurik added a comment to T234788: Commons limit on data is 2,048 kilobytes.

@Lucas_Werkmeister_WMDE thanks, but this is very surprising, I was 99.99% certain it was storing it pretty-printed... Either that, or it did the size limit check in the pretty-printed version before storing. Would it be possible to do a direct SQL query for that data, and also to run a MAX( LEN( data ))to see the largest page in the Data namespace on Commons? Thanks for checking!

Oct 7 2019, 9:54 PM · Commons-Datasets
Yurik added a comment to T234788: Commons limit on data is 2,048 kilobytes.

Correct, this is the tabular data hitting the 2MB page limit. One relatively simple solution would be to fix JsonConfig base class to store data as "compact", rather than pretty-printed JSON (there shouldn't be any externally visible consequences because JSON is always reformatted before saving). That would immediately increase max storage by a significant percentage, especially for .map (geojson tends to have a lot of small arrays, so when they break up between lines and prefixed with tons of spaces, the size increases several times the original). I suspect Wikibase has had to solve a similar problem storing their items in the MW engine.

Oct 7 2019, 4:18 PM · Commons-Datasets

Oct 6 2019

Kolossos awarded T138057: Epic: Enable <mapframe> on Wikipedia a Love token.
Oct 6 2019, 9:28 AM · Maps, Epic

Sep 27 2019

Yurik added a comment to T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

@Fnielsen i am not sure I understand what that query does, could you elaborate? Especially I am confused why you look at the forms -- from the perspective of Wiktionary, you request a single Lexeme, not individual forms. (btw, the query times out for me).

Sep 27 2019, 5:15 PM · Wiktionary, Wikidata, Wikidata Lexicographical data
Yurik added a comment to T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

P.S. @Fnielsen does bring a valid point about various linked lexemes , and that might be useful -- for example if lexeme lists another lexeme as being a synonym, it would be good to show it as a word rather than an L-number.

Sep 27 2019, 3:25 PM · Wiktionary, Wikidata, Wikidata Lexicographical data
Yurik added a comment to T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

@Lydia_Pintscher most of the Wiktionary pages have just one corresponding lexeme - and that's all I would expect to load.

Sep 27 2019, 3:17 PM · Wiktionary, Wikidata, Wikidata Lexicographical data

Sep 23 2019

Yurik added a comment to T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

@RexxS you do bring up a valid point about watchlist. The minor difference here is that lexeme is tied to a specific language, so it is less likely to have content not relevant to that one language / wiktionary. The only exception might be the description of sensese in other languages. TBH, I am not sure that adding sense description in a non-native language is a scalable solution -- we are repeating the issue of sitelinks, where every wiki page referenced all other wiki pages on the same subject. But this is a separate discussion, unrelated to this ticket.

Sep 23 2019, 11:26 PM · Wiktionary, Wikidata, Wikidata Lexicographical data

Sep 14 2019

Yurik updated the task description for T232930: Unable to create lexeme talk pages (permission error).
Sep 14 2019, 7:20 PM · Wikidata, Wikidata Lexicographical data
Yurik created T232930: Unable to create lexeme talk pages (permission error).
Sep 14 2019, 7:18 PM · Wikidata, Wikidata Lexicographical data

Sep 13 2019

Yurik added a comment to T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

P.S. to sum up -- Wiktionary needs just a single Lua function for the minimum viable product: getEntity('L100000') that simply returns the whole Lexeme JSON. Everything else is optional.

Sep 13 2019, 7:14 PM · Wiktionary, Wikidata, Wikidata Lexicographical data
Yurik added a comment to T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

I have imported some Russian nouns (~20,000 so far, but will be more soon), plus added links from Wiktionary's pages to the corresponding Lexemes. I think the simplest use case for Lexemes would be to allow Wiktionary Lua script to be able to load Lexeme by its ID. This will instantly make Lexemes useful to Wiktionary because the Lua script will be able to:

  • generate table of the word forms
  • generate etymology and pronunciation sections
  • do the above for every lexeme if more than one is used on the page.
Sep 13 2019, 2:53 AM · Wiktionary, Wikidata, Wikidata Lexicographical data

Sep 11 2019

Yurik added a comment to T232670: `maxlag` is ignored, instead returning `readonly` error.

@Anomie thx for the explanation. Several weeks ago by bot was banned for a short time because it didn't have the maxlag param. Are you saying that it was a mistake because WMF MW doesn't actually pay any attention to it? Also, would it be possible to update the documentation to indicate what the proper bot should do when running on WMF servers? Thanks!

Sep 11 2019, 9:07 PM · Platform Engineering, MediaWiki-API
Yurik updated the task description for T232670: `maxlag` is ignored, instead returning `readonly` error.
Sep 11 2019, 8:27 PM · Platform Engineering, MediaWiki-API
Yurik created T232670: `maxlag` is ignored, instead returning `readonly` error.
Sep 11 2019, 8:25 PM · Platform Engineering, MediaWiki-API
Yurik created T232557: Lexeme's Grammatical features are created in random order.
Sep 11 2019, 2:27 AM · Wikidata, Wikidata Lexicographical data

Sep 3 2019

Yurik added a comment to T231741: Allow displaying the one pageview number of a page in plain text form.

In theory it should be fairly straightforward to create a <graph> that outputs a single number, but that would still be an image, not text (and it might look slightly off - e.g. fuzzier or in different font)

Sep 3 2019, 6:19 PM · MediaWiki-extensions-Graph, MediaWiki-Templates

Aug 21 2019

Yurik added a comment to T98940: graphoid fails if page_props is out of sync with parser cache, or on old revisions of a page.

However, that seems pretty unlikely, because Graphoid only sends these API requests in order to graph requests that come from parsed HTML, so if we're getting an API request we probably recently displayed the page and the parser cache should still be warm.

Aug 21 2019, 4:35 AM · MW-1.34-notes (1.34.0-wmf.20; 2019-08-27), Graphoid
Yurik closed T222806: Security Review for Vega 5 and Vega-Lite JavaScript Libraries as Resolved.

Thanks, closing for now, waiting for the Vega team and the students.

Aug 21 2019, 3:23 AM · secscrum, Security Readiness Reviews, Upstream, JavaScript, Maps, MediaWiki-extensions-Graph

Aug 18 2019

Yurik added a comment to T98940: graphoid fails if page_props is out of sync with parser cache, or on old revisions of a page.

@Catrope thanks for tackling it! I always thought parser cache is non-persisted, so if a page does not get any edits in 2 months, the relevant data might not be there?

Aug 18 2019, 5:13 PM · MW-1.34-notes (1.34.0-wmf.20; 2019-08-27), Graphoid

Aug 16 2019

Yurik added a comment to T134617: Implement CSV/TSV import/export for tabular data set.

This is awesome, thank you @TheDJ and @JeanFred ! One kinda important issue -- it breaks on localized columns, e.g. Data:I18n/No_globals.tab -- CSV outputs empty values, and Excel shows English (I think).

Aug 16 2019, 9:51 PM · Wikimania-Hackathon-2019, Commons-Datasets