Prepare the lexeme workshop at WikidataCon 2019.
https://wikidatacon.wikidata.link/wiki/Item:Q86
26 October 2019
11:30
Alicia_Fagerving_WMSE | |
Oct 11 2019, 6:33 AM |
F30875443: lexeme_workshop.pptx | |
Oct 22 2019, 1:46 PM |
F30872929: senses_lang_dates.csv | |
Oct 21 2019, 2:22 PM |
F30872928: forms_lang_date.csv | |
Oct 21 2019, 2:22 PM |
Prepare the lexeme workshop at WikidataCon 2019.
https://wikidatacon.wikidata.link/wiki/Item:Q86
26 October 2019
11:30
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | Alicia_Fagerving_WMSE | T229798 WMSE participation in WikidataCon 2019 | |||
Resolved | Alicia_Fagerving_WMSE | T230864 Outline a lexeme workshop at WikidataCon 2019 | |||
Resolved | Alicia_Fagerving_WMSE | T235247 Prepare lexeme workshop at WikidataCon 2019 |
I was thinking some graphics on the growth of lexemes, forms, and senses would be good - do we already have that somewhere?
Not that I know of, but they could be generated based on Quarry results, such as this one with no. of new lexemes per day https://quarry.wmflabs.org/query/39180. I'm not sure how to query for forms/senses/usage examples off the top of my head, but I'm sure it's possible.
Also would be cool to have a growth graph with several of the most popular languages.
I played around with visualizing the Quarry output some time ago so I've got something to work with :)
I just did some exploring but I don't think Quarry will help with forms and senses - at least they're not "pages" in themselves with their own namespace. Actually I couldn't figure out where they were in the database schema at all... Anyway, I think I can get some rough numbers from looking at the stats page as it has changed over time, I will work on this.
I am uploading files with data on the counts of forms and senses by date for the last year (also totals in last column). There may have been some issues with this - it comes from the Lexicographical statistics pages that are generated from WDQS queries, so there were a few periods where I think the numbers were off. Anyway, it should be close to correct for most of the time period. So we can plot this along a bit of a timeline for the last year I think?
Ohh, these are really cool :) You can see how Basque goes crazy, and some time later Russian joins it...
Here's a draft of slides for our workshop. Please feel free to edit this. Also I think you wanted to cover a bit more basics of how lexemes are put together - maybe that should go first? This was mostly just gathering statistics and data and then a page of questions at the end...
Slides on Commons: https://commons.wikimedia.org/wiki/File:2019-10-26_WikidataCon_Lexemes_in_Wikidata.pdf
Entry in the program: https://www.wikidata.org/wiki/Wikidata:WikidataCon_2019/Program/Sessions/Wikidata_lexemes:_now_and_in_the_future
Global metrics: T236649