Page MenuHomePhabricator

Add bzip2 support to DCAT-AP description of dumps
Closed, ResolvedPublic

Description

Since 2015-11-09 bzip2 compressed version of the Wikidata entity dumps are published alongside the gzip one. This compressed version is however not described by the DCAT-AP dump description.

Event Timeline

Lokal_Profil raised the priority of this task from to Needs Triage.
Lokal_Profil updated the task description. (Show Details)
Lokal_Profil subscribed.

Unlike different dump formats new compression formats are not as easy to add to the DCAT script.

Change 262422 had a related patch set uploaded (by Lokal Profil):
Support multiple compression formats for dumps

https://gerrit.wikimedia.org/r/262422

Change 262422 had a related patch set uploaded (by Lokal Profil):
Support multiple compression formats for dumps

https://gerrit.wikimedia.org/r/262422

And https://gerrit.wikimedia.org/r/262423 for the config.json change

Change 262422 merged by Hoo man:
Support multiple compression formats for dumps

https://gerrit.wikimedia.org/r/262422

Merged into master, but still needs to be deployed (and I want localization updates unstuck before doing that). I'll keep this open so that we don't forget about it.

I also added a follow-up patch to deal with (some of) the concerns about parameter passing.

https://gerrit.wikimedia.org/r/#/c/263169/

Merged into master, but still needs to be deployed (and I want localization updates unstuck before doing that). I'll keep this open so that we don't forget about it.

We also need to remember to merge+deploy the operations/puppet patch before the operations/dumps/dcat patch is deployed.

@hoo: Can https://gerrit.wikimedia.org/r/#/c/262423/ be merged? Should work fine even before the operations/dumps/dcat patch is deployed.

@hoo: Can https://gerrit.wikimedia.org/r/#/c/262423/ be merged? Should work fine even before the operations/dumps/dcat patch is deployed.

I talked to Ariel about this today… I guess it will be taken care of either today or tomorrow.

I merged this today. Please check that everything's working as you expect.

I merged this today. Please check that everything's working as you expect.

Thanks. Nothing should visibly change until https://gerrit.wikimedia.org/r/#/c/262422/ is deployed. And for that we are waiting to see if the next l10n-bot patch is succesfully submitted. I updated the sv translation so there should be a push in the works.

Hum.. so l10n-bot doesn't seem to be pushing the last changes. Any thoughts @Nikerabbit?

I am semi-regularly running it on Mondays and Thursdays, but I had to skip it this Thursday.

I am semi-regularly running it on Mondays and Thursdays, but I had to skip it this Thursday.

That explains it :)

I just saw that the latest l10n-bot patch was submitted correctly so the setings seem to be ok now.

I guess that was the last blocker for deploying operations/dumps/dcat?

hoo claimed this task.

I just deployed 92ab37d94e and regenerated the RDF.