Page MenuHomePhabricator

Support linking to lexemes in statements on Commons
Open, Needs TriagePublicFeature

Description

Feature summary (what you would like to be able to do and where):

It should be possible to create statements linking to lexemes.

Use case(s) (list the steps that you performed to discover that problem, and describe the actual underlying problem which you want to solve. Do not describe only a solution):

It's not currently possible to store links from files such as pronunciation audio files (e.g. File:Ja-nihon(日本).ogg) or images of words (e.g. File:Nihon.png) to the corresponding lexemes in Wikidata as structured data.

Benefits (why should this be implemented?):

Event Timeline

Nikki renamed this task from Support linking to lexemes in statements to Support linking to lexemes in statements on Commons.Mar 22 2022, 2:41 AM

Note that it needs flexibility, as in some cases, one would like to be able to link to a specific form. For example Sv-anden (wild duck).ogg should link to L251549#F2

Note that it needs flexibility, as in some cases, one would like to be able to link to a specific form. For example Sv-anden (wild duck).ogg should link to L251549#F2

I would link to the lexeme, and use "object form" as a qualifier when it's really necessary. Most of the time, it'll be possible to detect which forms it matches (when P9533 (audio transcription) on the file matches the form representation), the search works better for lexemes, there can be multiple forms with the same representation, and there won't always be a matching form (e.g. languages with extremely large numbers of possible forms).

I can think of two situations where linking specific forms is necessary:

When a lexeme has forms which are written the same but pronounced differently, e.g. Knie in German.

When the text being spoken includes other words (like "a" or "the"), e.g. https://commons.wikimedia.org/wiki/File:Le_chat.ogg.

  • That file could have "pronunciation of lexeme: L511" with qualifier "object form: L511-F4".