Page MenuHomePhabricator

Ingest more statements when retrieving lexemes, forms, & senses; add ranks to statements
Closed, ResolvedPublic

Description

Description

Currently, When transforming a retrieved Lexeme, Lexeme form, or Lexeme sense into the corresponding Wikidata type, we only ingest nested statements whose values are strings. All other statements are ignored. In this task, we add statements whose values are any of these:

  • Lexeme
  • Lexeme form
  • Lexeme sense
  • Item
  • Monolingual text

In addition, we had a new field rank to each statement that gets ingested, using the type & instances created in T378678.


Completion checklist

Event Timeline

I am interested in parsing the Wikidata Lexeme Dump and have experimented with it in the past. Maybe it is more efficient regarding needed resources per request to create lists in Wikifunctions as Z-Objects with the content what is needed per Lexeme. Updating the list of Lexemes in Wikifunctions is necessary in this case as new Lexemes will be added or updates to existing lexemes occur.

DMartin-WMF renamed this task from Ingest more statements when retrieving lexemes, forms, and senses to Ingest more statements when retrieving lexemes, forms, & senses; add ranks to statements.Nov 4 2024, 5:04 AM
DMartin-WMF updated the task description. (Show Details)

Change #1093349 had a related patch set uploaded (by Jforrester; author: Jforrester):

[operations/deployment-charts@master] wikifunctions: Upgrade orchestrator from 2024-11-18-142635 to 2024-11-19-132736

https://gerrit.wikimedia.org/r/1093349

Change #1093349 merged by jenkins-bot:

[operations/deployment-charts@master] wikifunctions: Upgrade orchestrator from 2024-11-18-142635 to 2024-11-19-132736

https://gerrit.wikimedia.org/r/1093349

Change #1115032 had a related patch set uploaded (by Cory Massaro; author: Cory Massaro):

[operations/deployment-charts@master] wikifunctions: Upgrade orchestrator from version: 2025-01-22-203140 to 2025-01-28-144249

https://gerrit.wikimedia.org/r/1115032

Change #1115032 abandoned by Cory Massaro:

[operations/deployment-charts@master] wikifunctions: Upgrade orchestrator from version: 2025-01-22-203140 to 2025-01-28-144249

Reason:

already done

https://gerrit.wikimedia.org/r/1115032