Page MenuHomePhabricator

Support for monolingual text in the data extension service for Commons
Closed, ResolvedPublic

Description

Monolingual text isn't yet showing up when performing data extension of Commons files via the Commons Reconciliation Service in OpenRefine.

You can test with this small dataset:

(all audio files that are pronunciation audio, found via this query)

  • Load this set in OpenRefine
  • Reconcile the file names using the Commons Reconciliation Service
  • Try to retrieve the values of the Audio Transcription (P9533) property - currently this will produce an empty column.

For inspiration: the Wikidata reconciliation service returns the string(s) without language code.

Event Timeline

Change 757428 had a related patch set uploaded (by Eugene233; author: Eugene233):

[labs/tools/commons-recon-service@main] Support for monolingual text in the data extension service for Commons

https://gerrit.wikimedia.org/r/757428

Change 757428 merged by jenkins-bot:

[labs/tools/commons-recon-service@main] Support for monolingual text in the data extension service for Commons

https://gerrit.wikimedia.org/r/757428

Tested, and this works! Thank you 😎

image.png (187×473 px, 27 KB)

I like that the language code is included!