Page MenuHomePhabricator

Yurik (Yuri Astrakhan)
User

Projects (8)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Oct 7 2014, 6:43 PM (522 w, 2 d)
Availability
Available
LDAP User
Yurik
MediaWiki User
Yurik [ Global Accounts ]

Recent Activity

Apr 25 2024

Pppery awarded T47116: Add all global sites as valid interwiki links a Dislike token.
Apr 25 2024, 11:14 PM · Wikimedia-Interwiki-links

Nov 18 2023

valerio.bozzolan awarded T181319: Support external tabular datasets in WDQS a Doubloon token.
Nov 18 2023, 3:31 PM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery-ARCHIVED, Wikidata-Query-Service, Wikidata

Oct 24 2023

Bean49 awarded T141864: Add parser function for associated wikidata item ID a Like token.
Oct 24 2023, 9:03 AM · Wikibase-Lua, Wikidata Integration in Wikimedia projects, Wikidata, MediaWiki-extensions-WikibaseClient

May 12 2023

Yurik added a comment to T336595: Restrict editing of Vega spec to a small set of users.

@Tgr I am a bit surprised that Vega cannot be made safe -- to my knowledge, the primary authors of Vega (especially Jeff) were specifically trying to make it as safe as possible.

May 12 2023, 5:43 PM · Security, MediaWiki-extensions-Graph

May 10 2023

Yurik added a comment to T334940: All Graphs broken on Wikimedia wikis (due to security issue T336556).

A simpler solution might be similar to what I did in Elasticsearch's Kibana -- the url does not have to be a string - it can be an object too. As such, it can easily be "strongly typed", e.g. "url": { "type": "tabular", "page": "{{{table}}}" }. This allows for no-escaping, easy to test approach. BTW, @Jdlrobson I had to deal with a few edge cases with fitting a graph into a box - you may want to examine the code there for that too.

May 10 2023, 12:30 AM · User-zeljkofilipin, Regression, User-notice, Tech Ambassadors & Translators, MediaWiki-extensions-Graph

Apr 28 2023

Yurik added a comment to T334940: All Graphs broken on Wikimedia wikis (due to security issue T336556).

Correct - 2->3 was by far the most breaking. The following major version updates had mostly to do with the Vega lib API changes rather than the Vega syntax changes.

Apr 28 2023, 6:02 PM · User-zeljkofilipin, Regression, User-notice, Tech Ambassadors & Translators, MediaWiki-extensions-Graph

Apr 19 2023

Yurik added a comment to T165118: Support Vega 5.0+.

VegaLite is an easier way to draw graphs, and supports a lot of simpler cases. The only downside is a bigger JS download

Apr 19 2023, 7:55 PM · Community-Wishlist-Survey-2023, Community-Wishlist-Survey-2022, MediaWiki-extensions-Graph
Yurik added a comment to T334895: XSS via Graph extension.

I think it should be fairly straightforward/easy to just use the latest Vega v5 with the disabled unsafe-eval, see https://github.com/vega/vega/blob/main/docs/usage/interpreter.md and let the community fix the broken graphs. It offers a clear and fast path forward, and solves a lot of bit rot issues.

Apr 19 2023, 12:25 AM · SecTeam-wikimedia-project-event, SecTeam-Processed, WMDE-TechWish-Sprint-2023-04-05, Editing-team, Vuln-XSS, MediaWiki-extensions-Graph, Security, Security-Team
Yurik added a comment to T334940: All Graphs broken on Wikimedia wikis (due to security issue T336556).

I would recommend to simply update graph ext to use the latest vega v5, and to make it use the slower interpreter (csp safe) instead of the eval based one.

The idea is good, but we will not be able to convert all graphs quickly.

Apr 19 2023, 12:20 AM · User-zeljkofilipin, Regression, User-notice, Tech Ambassadors & Translators, MediaWiki-extensions-Graph

Apr 18 2023

Yurik added a comment to T334940: All Graphs broken on Wikimedia wikis (due to security issue T336556).

I would recommend to simply update graph ext to use the latest vega v5, and to make it use the slower interpreter (csp safe) instead of the eval based one.

Apr 18 2023, 11:43 PM · User-zeljkofilipin, Regression, User-notice, Tech Ambassadors & Translators, MediaWiki-extensions-Graph
Yurik removed a member for Maps: Yurik.
Apr 18 2023, 4:21 PM

Jan 11 2023

Moebeus awarded T159106: Show P31 in the Wikidata search results a Like token.
Jan 11 2023, 12:33 PM · Discovery-ARCHIVED, patch-welcome, MediaWiki-extensions-WikibaseRepository, Wikidata

Aug 22 2022

Yurik added a comment to T249419: RFC: Render data visualizations on the server.

Adaption is stalled by having an ancient version that is very limited in functionality and documentation. Is there a plan to get to the proper vega version?

Aug 22 2022, 6:28 PM · Wikimedia-Performance-recommendation, JavaScript, MediaWiki-extensions-Graph, covid-19, TechCom-RFC

Feb 18 2022

awight awarded T148605: RFC: Refactor Kartotherian to use an established plugin system a Like token.
Feb 18 2022, 8:56 AM · Proposal, Maps (Kartotherian)

Jan 24 2022

Yurik added a comment to T122086: Use bot to share templates and modules between wikis.

Yes, the service needs some reworking. I might just revert it to an older more stable version though.

Jan 24 2022, 9:14 PM · Tech Ambassadors & Translators, covid-19, Pywikibot, Proposal, Pywikibot-RfCs

Aug 18 2021

Yurik added a comment to T289101: Bring WMF map tile feature sets into line with OSM default feature sets.

@Pikne this is not accurate -- while it does help to reduce the number of OSM features to some extent, it is still far better to include more. What we do want is not to keep small number of features, but rather style map in such a way that it doesn't get in the way of the overlayed content. Here are some sample styles OpenMapTiles has, and I think Positron style would be a better fit than the bright style.:

Aug 18 2021, 1:44 PM · Maps (Map-Styles)
Yurik added a comment to T289101: Bring WMF map tile feature sets into line with OSM default feature sets.

Eventually I do hope Wikipedia maps will use OpenMapTiles project (with some customization like no admin borders if so desired). There are a lot of efforts by numerous individuals spent on optimizing OSM data extraction, and eventually it hopefully will cover every OSM feature, possibly as extensions (there is a proposal for such arrangement as well).

Aug 18 2021, 5:19 AM · Maps (Map-Styles)

May 19 2021

IKhitron awarded T127682: Make code editor understand JSON with comments and trailing commas a Love token.
May 19 2021, 5:09 PM · Upstream, CodeEditor

Jan 16 2021

Marsupium awarded T138057: Epic: Enable <mapframe> on Wikipedia a Love token.
Jan 16 2021, 10:03 AM · Maps, Epic

Jan 8 2021

So9q awarded T181319: Support external tabular datasets in WDQS a Like token.
Jan 8 2021, 7:16 PM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery-ARCHIVED, Wikidata-Query-Service, Wikidata

Dec 31 2020

Moebeus awarded T181319: Support external tabular datasets in WDQS a Like token.
Dec 31 2020, 7:15 PM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery-ARCHIVED, Wikidata-Query-Service, Wikidata

Oct 10 2020

Yurik added a comment to T265176: Refactor react.i18n to allow functional usage.

Not exactly, even though this is also possible. There are cases in my app when i could be using multiple Banana contexts in the same place, e.g. showing text/links in different languages, and being able to call createMessage( banana, messageKey, ...placeholders ) with my own custom-configured banana instance would be very useful. Another case is that sometimes I need to override the way banana itself functions by overriding default implementation -- example is the well known MW qqx language code that allows MW to show debug messages instead of the real i18n resolution - for that I would also need to be able to provide custom banana handler. Thus, separating <Message> from the actual code that implements react substitutions would be highly useful to me.

Oct 10 2020, 5:28 AM · React-i18n, I18n

Oct 9 2020

Yurik updated the task description for T265176: Refactor react.i18n to allow functional usage.
Oct 9 2020, 7:56 PM · React-i18n, I18n
Yurik added a project to T265176: Refactor react.i18n to allow functional usage: I18n.
Oct 9 2020, 7:55 PM · React-i18n, I18n
Yurik created T265176: Refactor react.i18n to allow functional usage.
Oct 9 2020, 7:54 PM · React-i18n, I18n

Oct 5 2020

Yurik added a comment to T264547: The "take" util unable to take ownership of files.

@bd808 thanks for the explanation. This surfaced because i was doing an rsync directly from my laptop via ssh, and since i cannot login as dibabel directly, i will need to do some extra steps. Would it be possible for take to link to some documentation in its errors about this? Thx!

Oct 5 2020, 4:41 PM · Toolforge
Yurik updated the task description for T264547: The "take" util unable to take ownership of files.
Oct 5 2020, 5:24 AM · Toolforge
Yurik renamed T264547: The "take" util unable to take ownership of files from take util unable to take ownership of files to The "take" util unable to take ownership of files.
Oct 5 2020, 5:24 AM · Toolforge

Oct 4 2020

Yurik created T264547: The "take" util unable to take ownership of files.
Oct 4 2020, 9:16 PM · Toolforge

Sep 1 2020

Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

Should the terms page be updated, possibly with a link to this discussion?

Sep 1 2020, 4:51 AM · Toolforge, cloud-services-team (Kanban)

Aug 29 2020

Yurik added a comment to T261451: Add Wikidata support to jawikivoyage.

Just caught this while comparing sitematrix output against the list of allowed sites values. Is there a reason why these are not coming from the same source? Is there a way to get the list that wikibase is using via api?

Aug 29 2020, 3:46 PM · Wikidata-Campsite (Wikidata-Campsite-Iteration-∞ (On Hold)), User-Ladsgroup, Wikidata

Aug 28 2020

Akuckartz awarded T154071: Allow non-CC0 licensed data for datasets a Like token.
Aug 28 2020, 7:24 PM · WMF-Legal, Commons-Datasets

Aug 27 2020

Yurik closed T260787: Add project Dibabel to translatewiki.net as Resolved.

Closing as it is clearly complete and works great. Thanks everyone!

Aug 27 2020, 4:41 PM · Language-Team (Language-2020-July-September), translatewiki.net

Aug 26 2020

Yurik added a comment to T261306: TranslateWiki should show short help when PLURAL is used.

I think the current approach assumes that everyone has read https://translatewiki.net/wiki/FAQ

Aug 26 2020, 10:09 PM · translatewiki.net
Yurik created T261306: TranslateWiki should show short help when PLURAL is used.
Aug 26 2020, 12:11 PM · translatewiki.net

Aug 25 2020

Yurik added a comment to T122086: Use bot to share templates and modules between wikis.

@Candalua there is a number of bugs and features I would like to fix/add before advertising it further. For example, the service does not refresh page state too often enough, it does not show dependency status (e.g. if one module depends on another which is stale, it doesn't warn about it). Everyone is welcome to use it though - shouldn't cause any major issues :)

Aug 25 2020, 1:57 PM · Tech Ambassadors & Translators, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

@abi_ thanks, done! Let me know when you can push the updates to the repo :)

Aug 25 2020, 1:45 PM · Language-Team (Language-2020-July-September), translatewiki.net
Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

@abi_ would it be possible to do a massive rename to get rid of the dibabel- prefix in all strings? I just realized that the tool ended up with two of them in the translatewiki, and since this is a fully standalone tool, there is no point in having a prefix everywhere in code either.

Aug 25 2020, 6:58 AM · Language-Team (Language-2020-July-September), translatewiki.net
Yurik added a comment to T122086: Use bot to share templates and modules between wikis.

There is now a tool for this... https://dibabel.toolforge.org/

Aug 25 2020, 6:57 AM · Tech Ambassadors & Translators, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik closed T252453: Create a Toolforge tool for syncing templates and Lua modules between wikis as Resolved.

Technically this is done. See https://dibabel.toolforge.org/

Aug 25 2020, 6:55 AM · Technical-Tool-Request, covid-19

Aug 24 2020

Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

Thank you everyone for getting it solved! @abi_ could you make it more often than twice a week? I'm pretty sure most contributors would love to see their changes going live asap (the instant gratification is important is FOSS :) ) -- and if there are no changes, it would simply be a noop, i don't think running a bot is very taxing on the system :) Thx!

Aug 24 2020, 3:27 PM · Language-Team (Language-2020-July-September), translatewiki.net

Aug 20 2020

Yurik added a comment to T260787: Add project Dibabel to translatewiki.net.

@abi_ thanks, I reviewed them, made a few minor corrections to the wiki pages, and added translatewiki as the contributor. Pushing to master is fine. If we make some changes to the master's i18n dir (either en.json or some other one), will you automatically pick that up before pushing?

Aug 20 2020, 3:54 PM · Language-Team (Language-2020-July-September), translatewiki.net
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

@Jack_who_built_the_house well, if GitHub has your ssh key, in theory any software that can get access to it could establish any kind of connection, including a regular session to run arbitrary commands. But if you establish a tunnel and unset the key right away, any subsequently ran software can only create socket connections, not run commands (at least that's my understanding).

Aug 20 2020, 2:11 AM · Toolforge, cloud-services-team (Kanban)
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

@bd808 thanks for the good example. It does provide a workaround, but it also highlights some issues:

Aug 20 2020, 12:36 AM · Toolforge, cloud-services-team (Kanban)

Aug 19 2020

Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

@bd808 I am not sure if the attack surface is significant, and also I don't believe this is relevant

Aug 19 2020, 11:04 PM · Toolforge, cloud-services-team (Kanban)
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

I should also say I'm skeptical about using Toolforge resources as a proxy for a proprietary/non-free service, getting the advantages of the Toolforge platform but bypassing the open source code requirements.

Aug 19 2020, 7:33 AM · Toolforge, cloud-services-team (Kanban)
Yurik added a comment to T260746: Clarify Terms of Use on whether using Toolforge as a proxy for GitHub Actions is permitted.

Goal

To automate the build & publish process, making it as simple and stable as possible. Desired workflow:

  • User publishes a new release of their tool
  • System automatically builds, tests, and publishes the result to the wiki
Aug 19 2020, 7:02 AM · Toolforge, cloud-services-team (Kanban)

Aug 8 2020

Yurik awarded T256482: kubectl logs fails on /usr/lib/uwsgi/plugins/python_plugin.so a Haypence token.
Aug 8 2020, 12:06 AM · cloud-services-team, Toolforge, Kubernetes

Jul 24 2020

Yurik added a comment to T258827: Return response headers in oauthclient->makeOAuthCall().

@Tgr forwarding an api call from a complex js client - if the response is a non-200, the client may decide to handle it somehow different (I assume non-200 responses are still returned in some form as content, but they won't be JSON).

Jul 24 2020, 11:35 PM · MediaWiki-extensions-OAuth
Yurik created T258827: Return response headers in oauthclient->makeOAuthCall().
Jul 24 2020, 7:42 PM · MediaWiki-extensions-OAuth

Jul 22 2020

Yurik added a comment to T258596: Gathering data from multiple Wikimedia Commons Tabular Entries.

Sure, Vega allows you to load from more than one data source, as long as you explicitly list them all. Afterwards, you can create a new data source that joins all the other ones (i think it was available in Vega 2.0 that Wikipedia is using)

Jul 22 2020, 6:28 PM · Commons-Datasets, MediaWiki-extensions-Graph

Jul 21 2020

Yurik awarded T165217: HTMLCheckMatrix cannot be preloaded a Like token.
Jul 21 2020, 2:51 AM · MediaWiki-HTMLForm

May 20 2020

Yurik placed T56221: Support for text/syntax/markup driven or WYSIWYG editable charts, diagrams, graphs, flowcharts etc. (Identify, develop, review and deploy extension on Wikimedia wikis to add) up for grabs.
May 20 2020, 7:03 PM · Commons, Wikimedia-Extension-setup
Yurik placed T134237: Graphoid returns a 400 on MW API time-out up for grabs.
May 20 2020, 7:03 PM · Platform Team Legacy (Watching / External), Services (watching), SRE, Graphoid
Yurik placed T139399: Fix Graphoid rendering issue + warning up for grabs.
May 20 2020, 7:02 PM · Graphoid
Yurik placed T166667: Whitelist .wmflabs.org for wikiraw:// protocol up for grabs.
May 20 2020, 7:02 PM · MediaWiki-extensions-Graph

May 19 2020

Volker_E awarded T9356: User-specified HTML IDs can be the same as interface IDs a Mountain of Wealth token.
May 19 2020, 4:52 AM · HTML5, MediaWiki-Parser

May 12 2020

Yurik added a comment to T251759: Create a basic editing dialog for editing JSON tabular data as a table.

I think the right way to integrate spreadsheets and .tab editor is the proper copy/paste handling. It should be possible to paste a spreadsheet directly into the editor, and handle when the paste data is a table from spreadsheet, or has considerable size, or has multiple lines, or has tabs (or commas?), and treat it as a table of text, and ask the user how to deal with it: replace data or append to it. Other cases to handle would be shape mismatch (different number of columns), and type mismatch (e.g. paste data has text in the numeric column). I do not think the special cases like "replace just one column while keeping others intact" should be handled -- for that people can just copy/paste the whole table.

May 12 2020, 1:22 AM · MW-1.35-notes (1.35.0-wmf.32; 2020-05-12), covid-19, Commons-Datasets, JsonConfig
Yurik awarded T251759: Create a basic editing dialog for editing JSON tabular data as a table a Love token.
May 12 2020, 1:11 AM · MW-1.35-notes (1.35.0-wmf.32; 2020-05-12), covid-19, Commons-Datasets, JsonConfig

May 11 2020

Yurik added a comment to T251759: Create a basic editing dialog for editing JSON tabular data as a table.

I just tried it at https://commons.wikimedia.beta.wmflabs.org/wiki/Data:ISO15924/trans.tab -- looks great, and offers an amazing base for further features! Did you remove the "delete row" button?

May 11 2020, 5:45 PM · MW-1.35-notes (1.35.0-wmf.32; 2020-05-12), covid-19, Commons-Datasets, JsonConfig
Yurik added a comment to T250919: Add row/cell annotations to tabular data.

@Tgr i strongly oppose storing wiki markup inside columns because it makes the system far less portable and less stable. Wiki markup only works in the context of a specific wiki, and would render either differently or simply break -- templates, localization settings, and modules are wiki specific.

May 11 2020, 4:48 PM · Commons-Datasets, JsonConfig, covid-19

May 10 2020

Yurik added a comment to T234788: Commons limit on data is 2,048 kilobytes.

@AlexisJazz per my above comments -- it seems the system pretty-prints JSON, checks the size, and only then it stores it in the compact format. To make it work properly, the system should only validate json size after serializing it in compact form.

May 10 2020, 10:21 PM · Commons-Datasets
Yurik created T252322: GitHub mirror is missing Phabricator-based repos.
May 10 2020, 7:21 AM

May 9 2020

Yurik added a comment to T181319: Support external tabular datasets in WDQS.

@NavinoEvans I agree - feel free to take my implementation (which was already working for any CSV-style inputs), and extend/adapt it. Ideally, it should be merged upstream to the Blazegraph, so it should support any kind of CSVs. It may make sense to have either some sort of a wrapper for the tabular datasets as an extension to Blazegraph, or alternatively to extend the jsonconfig's API to be able to get CSV directly (which might be a better solution, as it would allow other, non-blazegraph usages)

May 9 2020, 5:22 PM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery-ARCHIVED, Wikidata-Query-Service, Wikidata
John_Cummings awarded T181319: Support external tabular datasets in WDQS a Love token.
May 9 2020, 9:44 AM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery-ARCHIVED, Wikidata-Query-Service, Wikidata
NavinoEvans awarded T181319: Support external tabular datasets in WDQS a Burninate token.
May 9 2020, 9:36 AM · Wikimedia-Hackathon-2020, patch-welcome, Commons-Datasets, Discovery-ARCHIVED, Wikidata-Query-Service, Wikidata

May 6 2020

Yurik added a comment to T248897: Create an easy-to-use editing interface for JSON tabular data pages.

Good details, thx. For the localized strings (both in the data and in metadata), I think the better way would be to have just a single "global" language selector somewhere at the top of the dialog, set to the user's language by default. Changing its value would only change what is being shown, but it won't do any data modifications.

May 6 2020, 4:43 PM · covid-19, Commons-Datasets, JsonConfig, Epic

May 5 2020

Yurik added a comment to T248897: Create an easy-to-use editing interface for JSON tabular data pages.

@Tchanders this looks awesome!!! My understanding is that for MVP it should just allow editing of the existing tables, not change the table structure, right? Also, the multi-lingual column might be somewhat difficult to represent visually - how do you think it should be done? Thanks again!

May 5 2020, 6:00 PM · covid-19, Commons-Datasets, JsonConfig, Epic

Apr 28 2020

Yurik added a comment to T122086: Use bot to share templates and modules between wikis.

Most wikis do want to protect highly used templates/modules. E.g. the Module:TNT would be used by most pages - you never want to make it editable by novice users. Thus, the bot would need to have the rights to edit that page.

Apr 28 2020, 9:51 PM · Tech Ambassadors & Translators, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik updated subscribers of T122086: Use bot to share templates and modules between wikis.
Apr 28 2020, 9:47 PM · Tech Ambassadors & Translators, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik added a comment to T122086: Use bot to share templates and modules between wikis.

@Tgr I just ran it a bit more, but the issue is that the bot would need a bot flag with admin rights (not possible globally, so one would have to apply to every wiki... painful). The bot source code is in https://github.com/nyurik/dibabel

Apr 28 2020, 7:35 PM · Tech Ambassadors & Translators, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Yurik added a comment to T249419: RFC: Render data visualizations on the server.

If the job queue can call external services and wait for the rendered result, it would simplify the architecture a bit, allow for easier testing, and I am all for it. This assumes job queue itself can store large data blobs, rather than just short strings.

Apr 28 2020, 3:28 PM · Wikimedia-Performance-recommendation, JavaScript, MediaWiki-extensions-Graph, covid-19, TechCom-RFC

Apr 22 2020

Yurik added a comment to T250919: Add row/cell annotations to tabular data.

At this point additional annotations could only be done as extra columns. This would work for many cases, but probably not all. Could you give some examples of where columns won't be enough, and the dedicated annotation system would be required?

Apr 22 2020, 5:49 PM · Commons-Datasets, JsonConfig, covid-19
Yurik added a comment to T248897: Create an easy-to-use editing interface for JSON tabular data pages.

@Tgr are there any good templates/examples of setting up a toolforge tool? Something that would show all the basic building blocks (creating a "site", setting up oauth, connecting to a specific wiki, etc), so that a volunteer could just write the task-specific javascript?

Apr 22 2020, 5:47 PM · covid-19, Commons-Datasets, JsonConfig, Epic
deryckchan awarded T134618: Epic: Implement spreadsheet-like cell editing for tabular data a Pterodactyl token.
Apr 22 2020, 1:00 AM · Epic, Commons-Datasets

Apr 17 2020

Yurik added a comment to T249419: RFC: Render data visualizations on the server.

@Milimetric thx for working on this! A few points that I would like further clarification on:

Apr 17 2020, 11:14 PM · Wikimedia-Performance-recommendation, JavaScript, MediaWiki-extensions-Graph, covid-19, TechCom-RFC
Sophivorus awarded T134618: Epic: Implement spreadsheet-like cell editing for tabular data a Love token.
Apr 17 2020, 9:51 PM · Epic, Commons-Datasets

Apr 15 2020

Yurik added a comment to T249419: RFC: Render data visualizations on the server.

One more thing: the current custom-protocol:///?someparams=url-encoded for the data sources was a work around of the older Vega limitation. In Kibana, we used a much more successful approach:

Apr 15 2020, 8:36 PM · Wikimedia-Performance-recommendation, JavaScript, MediaWiki-extensions-Graph, covid-19, TechCom-RFC
Yurik added a comment to T249419: RFC: Render data visualizations on the server.

Thx @Tgr and @Milimetric. Reiterating some possibly overlooked points:

  • In addition to data blobs (WDQS, data pages, API calls, etc), graphs could contain images (i.e. Commons or local wiki), and map image snapshots (generated by the maps snapshot service). See examples. If data is "prepackaged", some system would have to call all those services to assemble the needed data.
  • Newer Vega allows data loading as a result of a user action, or as a result of other data loading (e.g. if datasource A returns X, get datasource B)
  • MediaWiki PHP could try parse the graph spec to get all data sources, and we could say that for the preview image, data must not be dynamic, but that still leaves images -- e.g. if the data has country codes, a graph could get corresponding country flag by its name, e.g. File:Icons-flag-<countrycode>.png.
  • Many edits are done by non-javascript clients (bots), so requiring the client to submit some data when saving might introduce too many bugs and data mismatches.
  • Vega is not allowed to get any data from outside of the WMF network (uses a custom data/image loader for that).
Apr 15 2020, 8:33 PM · Wikimedia-Performance-recommendation, JavaScript, MediaWiki-extensions-Graph, covid-19, TechCom-RFC

Apr 4 2020

Pppery awarded T122086: Use bot to share templates and modules between wikis a Dislike token.
Apr 4 2020, 1:40 AM · Tech Ambassadors & Translators, covid-19, Pywikibot, Proposal, Pywikibot-RfCs
Pppery awarded T137584: Allow Scribunto code to add a category without changing output a Dislike token.
Apr 4 2020, 12:28 AM · MediaWiki CodeJam Dec 2023, Patch-For-Review, Platform Engineering, Scribunto

Feb 26 2020

Yurik added a comment to T174981: Add pageviews total counts to WDQS.

@Gehel lets define this amount of data, just for clarity. My back-of-the-envelope calculations:

Feb 26 2020, 4:52 PM · Analytics-Radar, Discovery-ARCHIVED, Wikidata, Wikidata-Query-Service

Feb 24 2020

Yurik added a comment to T128281: Privacy link does not point to localized policy.

hi @Jcross, can't recall what this is about, can close I guess

Feb 24 2020, 7:38 PM · WMF-Legal, Privacy

Feb 13 2020

Amire80 awarded T150031: Map with overlay is not being shown in Android app a Heartbreak token.
Feb 13 2020, 8:36 AM · Content-Transform-Team, Wikipedia-Android-App-Backlog, Android-app-Bugs

Feb 11 2020

Yurik added a comment to T244845: Handling of nulls in data namespace.

@Fae Python treats null as None when doing dict <-> json

Feb 11 2020, 7:17 PM · Commons-Datasets

Jan 13 2020

Yurik updated subscribers of T212843: [EPIC] Access to Wikidata's lexicographical data from Wiktionaries and other WMF sites.

@Lucas_Werkmeister_WMDE thank you for all the hard work on this task! Do you have any approximate timeline of the getEntity() returning all lexeme forms, or is that already implemented? How significant of a challenge is it? I have been spending considerable time updating Lexicator bot to parse multiple Wiktionary languages, and handle multiple linguistic types, but all that work is mostly pointless until Wiktionaries can access that data.

Jan 13 2020, 8:45 PM · All-and-every-Wiktionary, Wikidata, Wikidata Lexicographical data
Yurik added a comment to T242596: Allow .tab and .map files to be added to categories.

This would be solved with T155290

Jan 13 2020, 6:04 PM · Commons-Datasets

Jan 4 2020

Yurik added a comment to T174981: Add pageviews total counts to WDQS.

I would guess this is mostly a devops task - orchestrate execution of an updating script. Here's the working implementation - https://github.com/Sophox/sophox/blob/master/osm2rdf/updatePageViewStats.py

Jan 4 2020, 6:52 PM · Analytics-Radar, Discovery-ARCHIVED, Wikidata, Wikidata-Query-Service
Yurik added a comment to T174981: Add pageviews total counts to WDQS.

@Tagishsimon this proposal would not edit wikidata. Instead, as part of the WDQS import process, it would upload pageviews in bulk from the pageview dump files directly into the Blazegraph index. It could do it every hour, and computation-wise it will be relatively inexpensive (i ran it as part of Sophox a few times).

Jan 4 2020, 3:02 AM · Analytics-Radar, Discovery-ARCHIVED, Wikidata, Wikidata-Query-Service

Dec 9 2019

Yurik awarded T220985: Complete support of Lexemes in QuickStatements a Mountain of Wealth token.
Dec 9 2019, 12:22 PM · Wikidata, Wikidata Lexicographical data

Dec 6 2019

Yurik added a comment to T235356: Fatal from ApiGraph->getGraphSpec: "Call to a member function getExtensionData() on boolean".

The fix seems sensible, thx!

Dec 6 2019, 5:54 PM · MW-1.35-notes (1.35.0-wmf.10; 2019-12-10), Editing-team, MediaWiki-extensions-Graph, Wikimedia-production-error

Nov 25 2019

Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

@MSantos there will be an OpenMapTiles community sync up this Thursday (10:30a ET), let me know if you would like to join in - we will be discussing how to move OMT forward, and possibly accommodate for Wikipedia needs. Email me YuriAstrakhan@gmail.com with your email addr.

Nov 25 2019, 9:20 PM · Product-Infrastructure-Team-Backlog-Deprecated (Kanban), Maps (Maps-data)

Nov 18 2019

Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

sure, sounds good, so how about this - if you create a page/ticket/... with some basic info and goals, I will add implementation details to it. Would that work?

Nov 18 2019, 7:34 PM · Product-Infrastructure-Team-Backlog-Deprecated (Kanban), Maps (Maps-data)
Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

@MSantos I am all for WMF to start using the OMT project rather than our first implementation, but I am not sure how valuable it will be to write an RFC -- so far WMF has not been too eager to support a proper map serving efforts, relying mostly on semi-volunteer efforts of different enthusiasts to keep it around. Do you think writing RFC will help in changing that? Or will it be just another dusty page on Phabricator?

Nov 18 2019, 6:41 PM · Product-Infrastructure-Team-Backlog-Deprecated (Kanban), Maps (Maps-data)
Yurik added a comment to T238554: [Spike] Consider using imposm3 as the OSM replication system.

Note that the openmaptiles project is rapidly improving, with the goal of generating tiles "on the fly" -- without the tile pregeneration step, and without mapnik. In other words, a vector tile (MVT) is generated by a single giant PostgreSQL query, and send to the user on request (with some caching to speed up frequently-viewed regions). Adapting this approach will greatly simplify current Wikipedia setup - no more Mapnik, no more Cassandra, easily scalable architecture (the more postgres replicas, the bigger the capacity).
P.S. And yes, OpenMapTiles is using Imposm3, together with a number of other good data sources like Natural Earth for low zooms.

Nov 18 2019, 4:43 PM · Product-Infrastructure-Team-Backlog-Deprecated (Kanban), Maps (Maps-data)

Nov 10 2019

Od1n awarded T137584: Allow Scribunto code to add a category without changing output a Like token.
Nov 10 2019, 4:36 PM · MediaWiki CodeJam Dec 2023, Patch-For-Review, Platform Engineering, Scribunto

Nov 5 2019

Yurik added a comment to T234788: Commons limit on data is 2,048 kilobytes.

See my above comment, and @Lucas_Werkmeister_WMDE response -- while it stores things in the compact JSON form, the length is checked while it is in the "pretty-printed" format. A way to work around it might be to upload it to the server in the compact form via API, in which case it might get accepted.

Nov 5 2019, 9:27 PM · Commons-Datasets

Nov 2 2019

Arjunaraoc awarded T145688: [epic] Improve OSM-Wikipedia collaboration a Like token.
Nov 2 2019, 5:09 AM · Epic, Maps (Kartographer)

Oct 25 2019

Yurik added a comment to T211881: graphoid: Code stewardship request.

@dr0ptp4kt not just JS -- data sources could be far larger component to the graphs - e.g. one graph could mix together multiple data sources, including some tabular data pages (up to 2MB each), queries to Wikidata (currently broken btw -- lots of users are complaining because millions of population graphs are broken), a few images from commons, and even some mediawiki API calls. A full download could be in tens of megabytes, and some could be slow.

Oct 25 2019, 9:22 PM · Release-Engineering-Team-TODO (201908), Release-Engineering-Team (Code Health), Platform Team Legacy (Watching / External), Services (watching), SRE, Code-Stewardship-Reviews, Graphoid

Oct 13 2019

Yurik updated the task description for T224312: LinguaLibreBot : Improve handling of Wikidata Lexeme.
Oct 13 2019, 6:49 PM · Wikidata, Wikidata Lexicographical data, Lingua-Libre-Legacy