Page MenuHomePhabricator

Prepare lexeme workshop at WikidataCon 2019
Closed, ResolvedPublic

Description

Prepare the lexeme workshop at WikidataCon 2019.

https://wikidatacon.wikidata.link/wiki/Item:Q86

26 October 2019
11:30

Event Timeline

I was thinking some graphics on the growth of lexemes, forms, and senses would be good - do we already have that somewhere?

Not that I know of, but they could be generated based on Quarry results, such as this one with no. of new lexemes per day https://quarry.wmflabs.org/query/39180. I'm not sure how to query for forms/senses/usage examples off the top of my head, but I'm sure it's possible.

Also would be cool to have a growth graph with several of the most popular languages.

I played around with visualizing the Quarry output some time ago so I've got something to work with :)

I just did some exploring but I don't think Quarry will help with forms and senses - at least they're not "pages" in themselves with their own namespace. Actually I couldn't figure out where they were in the database schema at all... Anyway, I think I can get some rough numbers from looking at the stats page as it has changed over time, I will work on this.

I am uploading files with data on the counts of forms and senses by date for the last year (also totals in last column). There may have been some issues with this - it comes from the Lexicographical statistics pages that are generated from WDQS queries, so there were a few periods where I think the numbers were off. Anyway, it should be close to correct for most of the time period. So we can plot this along a bit of a timeline for the last year I think?

Ohh, these are really cool :) You can see how Basque goes crazy, and some time later Russian joins it...

Here's a draft of slides for our workshop. Please feel free to edit this. Also I think you wanted to cover a bit more basics of how lexemes are put together - maybe that should go first? This was mostly just gathering statistics and data and then a page of questions at the end...