Passes for me locally, so not sure what's wrong exactly.
Sat, Jun 22
But packed production version does not have UrlShortener.js? I think it's the code on query.personaldata.io that is using unpacked version that is blocked. I am not sure we can (or should) do anything here - maybe just the code on query.personaldata.io should rename the file?
Given that it only keeps 5 versions which now take under 2G, I think we shouldn't worry about it too much for now.
STRAFTER( STR( ?item ), STR( wd: ) ) looks like sensible solution for this.
Fri, Jun 21
I think this is what 2385879c4d64e027a4122aee20655e365628f6ef in WikibaseCirrusSearch did?
OK, I did a survey of existing SPARQL endpoints and looks like they all omit datatypes on plain literals on JSON. So I'll fix WDQS to do the same.
This: https://issues.apache.org/jira/browse/JENA-1077 suggests JENA also thinks types should be omitted on output for plain/language literals.
The rules for the XML and JSON results formats are that the implicit datatype of langtagged literals is omitted.
Could you give some details on what recommendation API is actually doing there? Maybe post a link to the relevant code and probably explanation of which queries it is running and how often? Generally 429 means it is sending too many queries too often. Which shouldn't really happen on the test. But need some details.
I would start with running with -v, looking at which URLs it is trying to access and looking what these URLs actually produce.
Thu, Jun 20
I've added my reviews and updated https://gerrit.wikimedia.org/r/c/operations/puppet/+/516444 to base on refactoring patch.
I wrote a note on wikidata list. The standards seem to be conflicting here, so I'll try to research what's more correct - or at least most accepted - practice here. For now, it stays as it is, but if it turns out common practice is to omit the types, we'd have to patch.
Since wmf.10 hasn't been deployed yet (T220735) and probably won't be this week - sometime after that happens. I'll update the ticket then. It probably will take several days (after the train has been resumed), so realistically count on starting using it somewhere in July. I understand it's taking long, but combination of no deployments and then failed deployments is an unfortunate circumstance we have to adjust for.
I've sent a note to wikidata and xmldatadumps-l lists.
I think we need to drop a note to wikidata-l, maybe also add something to Weekly notes (@Lea_Lacroix_WMDE ?). Not sure what else.
This will require DB reload, so will be handled when that happens.
Tue, Jun 18
Given that we have no reliable reproduction, it's unlikely we can do anything here. If we ever get one please reopen or open a new one.
No idea about JSON dumps, but I don't see any reason not to. I don't need them, but since it's Wikibase, it makes sense to have them too.
It's a bit messy since different RDF/SPARQL/etc. standards disagree on how literals look like - newer RDF 1.1 says everything has datatype, but allows to skip datatype in some syntax, and some older standards still omit datatype...
The query works fine, so I think the problem is in the UI. Maybe some broke due to Sesame upgrade. I'll take a look.
In the Sesame/RDF, everything has data type, even string literals. Is it causing any problems? Additional field in JSON should not be problematic.
../../extensions/Wikibase/tests/phan/stubs/cirrussearch.php was in the ignore list, but since it has moved...
Mon, Jun 17
Sat, Jun 15
Fri, Jun 14
Thu, Jun 13
Deployments usually happen on Mondays. See https://wikitech.wikimedia.org/wiki/Deployments
Wed, Jun 12
Tue, Jun 11
It's the same issue as T197447. We probably need to think about more systemic solution for this... Changing ICU collation may increase data size and lead to other complications, so maybe filtering the data and removing characters like U+0200F?
Seems to work now.
Can we get away with a 'dumpwikibaserdf.sh' and some well-chosen variables for both wikidata and commons (or at worst two much much shorter scripts) instead of a whole new dumpcommonsrdf.sh?
Mon, Jun 10
Sat, Jun 8
Fri, Jun 7
Will WDQS fail in spectacular ways if it requests objects over the uncached endpoints?