With the introduction of a new dump format the DCAT script should be updated to also include .nt
Description
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
Allow format to be overridden in mediatype object | operations/dumps/dcat | master | +34 -17 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T88728 Improve Wikimedia dumping infrastructure | |||
Open | None | T88991 improve Wikidata dumps [tracking] | |||
Resolved | Smalyshev | T154531 Bad formatting for quotes in .nt export | |||
Resolved | Smalyshev | T144103 Create .nt (NTriples) dumps for wikidata data | |||
Resolved | Lokal_Profil | T154914 Add .nt to DCAT-AP for Wikidata dumps |
Event Timeline
Holding off on this until we get a clarification about the future of the DCAT-AP "extension".
.nt dumps is still used, and may even have more than one dump in the future, so it should be supported.
Change 425993 had a related patch set uploaded (by Lokal Profil; owner: Lokal Profil):
[operations/dumps/dcat@master] Allow format to be overridden in mediatype object
The patch doesn't add any new formats (truthy-nt is handled by T163328: Add the truthy nt dump to dcat-AP) but it allows for multiple .nt flavours in the future per T154914#3883832
Well, for this particular task we first need to actually have .nt dump :) Then I guess we'd need to merge https://gerrit.wikimedia.org/r/425993
I plan to have some progress on T144103 soon-ish. Unless of course anybody else wants to volunteer :)
@ArielGlenn
I think somebody needs to merge https://gerrit.wikimedia.org/r/c/operations/dumps/dcat/+/425993 for this? I don't have +2 there. I reviewed it and for me it looks ok (except for one typo) but it'd be nice if somebody who knows what's going on there reviewed it too.
After it's merged I assume we need to add another config there.
I looked at it and agree the format variable thing is a little annoying to read; other than that I have no comments. I shoudn't be the other reviewer though. Let me see if @hoo has time to have a look at the patch.
Hoo hopes to look at it sometime this week; if there's no movement by Friday I'll see what can be done.
Change 425993 merged by jenkins-bot:
[operations/dumps/dcat@master] Allow format to be overridden in mediatype object
I merged the patch and will deploy it probably in the next days… but without a configuration change, this is a no-op for now.