Page MenuHomePhabricator

Allow federated queries with the MiMoTextBase SPARQL endpoint
Closed, ResolvedPublic

Description

Could you please white list the following sparql endpoint:
https://query.mimotext.uni-trier.de/proxy/wdqs/bigdata/namespace/wdq/sparql

This is a sparql endpoint used at https://data.mimotext.uni-trier.de, a wikibase installation (“MiMoTextBase”), built up in the project “Mining and Modeling Text” (MiMoText), maintained by the Trier Center for Digital Humanities, University of Trier.

WDQS: https://query.mimotext.uni-trier.de/
MiMoText ID: https://www.wikidata.org/wiki/Property:P12047
MiMoText project: https://www.wikidata.org/wiki/Q122476591
SPARQL tutorial: https://docs.mimotext.uni-trier.de

Event Timeline

Gehel triaged this task as High priority.Nov 20 2023, 3:13 PM
Gehel moved this task from Incoming to Operations/SRE on the Wikidata-Query-Service board.
Gehel added a project: Data-Platform-SRE.
Gehel lowered the priority of this task from High to Medium.Nov 22 2023, 9:25 AM
Gehel moved this task from Incoming to Ready for Work on the Data-Platform-SRE board.

I would like to share the information that our project will expire at the end of 2023. We would like to include federated queries in the direction of "Wikidata -> MiMoTextBase" (the other direction is already integrated) into our SPARQL tutorial. If it is possible to know approximately how long it will take, that would be interesting for us. Thanks a lot for your help!

@HinMar Sorry for missing this request - our bad! I see your earlier comment mentioned the project expiring by end of 2023. Is the project still ongoing and therefore we should still whitelist this new endpoint or should I instead close this ticket out?

@RKemper : Thank you for your message. The project has ended, but we still kindly ask you to whitelist this endpoint. We at the Trier Center for Digital Humanities will continue to work with LOD beyond this one project and would be delighted to be able to run federated queries starting from Wikidata directed towards the MiMoTextBase. We have developed an approach in MiMoText that we now want to transfer and adapt for other domains in a new project (“LODinG” – Linked Open Data in the Humanities). We are still very interested in the 'wikiverse' and in gaining as much experience as possible in the area of 'federation', which we see as the absolute key to the LOD vision. We are also planning to provide a showcase and if the whitelisting could be done relatively soon, we would like to include this new 'federation direction'. Can you estimate how long it will take?

Change 995090 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/puppet@production] wdqs: whitelist MiMoTextBase SPARQL endpoint

https://gerrit.wikimedia.org/r/995090

@RKemper : Thank you for your message. The project has ended, but we still kindly ask you to whitelist this endpoint. We at the Trier Center for Digital Humanities will continue to work with LOD beyond this one project and would be delighted to be able to run federated queries starting from Wikidata directed towards the MiMoTextBase. We have developed an approach in MiMoText that we now want to transfer and adapt for other domains in a new project (“LODinG” – Linked Open Data in the Humanities). We are still very interested in the 'wikiverse' and in gaining as much experience as possible in the area of 'federation', which we see as the absolute key to the LOD vision. We are also planning to provide a showcase and if the whitelisting could be done relatively soon, we would like to include this new 'federation direction'. Can you estimate how long it will take?

@HinMar Thanks, that context is helpful. I'm merging the patch now; if you could provide an example query to validate that things are working as intended that'd be great!

Change 995090 merged by Ryan Kemper:

[operations/puppet@production] wdqs: whitelist MiMoTextBase SPARQL endpoint

https://gerrit.wikimedia.org/r/995090

@HinMar Thanks, that context is helpful. I'm merging the patch now; if you could provide an example query to validate that things are working as intended that'd be great!

@RKemper : Thank you very much for whitelisting our endpoint. We are delighted and excited and have tried a few things to make sure that everything is configured correctly in our Wikibase instance. So far we have various results that we cannot yet explain.
With the following query, we (several of us in the team) initially had the error message "Unknown error: Service URI https://query.mimotext.uni-trier.de/proxy/wdqs/bigdata/namespace/wdq/sparql is not allowed" and then after a few repetitions (=retrying the execute query-button) it worked sometimes, but it is unstable: https://w.wiki/97Ev.
The following query does not work so far: https://w.wiki/97QT. (If you exclude lines 9-11, thus the SPARQL service part, it works: https://w.wiki/97R$).
Another query works after several repetitions: https://w.wiki/97Qd. If you completely replace a variable in the query (e.g. ?voltaire with ?author), it initially reappears "Unknown error: Service URI https://query.mimotext.uni-trier.de/proxy/wdqs/bigdata/namespace/wdq/sparql is not allowed", until it then works again: https://w.wiki/97Qi. However, a rather cumbersome construction via the reference URL is used here. The query described above using the MiMoText ID (wdt:P12047) more directly (https://w.wiki/97QT) does not work.
I hope this already helps a little in narrowing down the problem(s).

@HinMar Okay, I think we've got the endpoints properly allowed. Queries appear to be working for me. Are you seeing the same?

@RKemper : Yes, thank you for fixing the problem with the error message "endpoint is not allowed" in the first 5-10 runs. The queries now work right the first time, that’s really great! (e.g. https://w.wiki/9Ki5 or https://purl.org/mmt/federation/tq1) Could our endpoint be listed on this page: https://www.mediawiki.org/wiki/Wikidata_Query_Service/User_Manual/SPARQL_Federation_endpoints?

Gehel subscribed.

@RKemper : Yes, thank you for fixing the problem with the error message "endpoint is not allowed" in the first 5-10 runs. The queries now work right the first time, that’s really great! (e.g. https://w.wiki/9Ki5 or https://purl.org/mmt/federation/tq1) Could our endpoint be listed on this page: https://www.mediawiki.org/wiki/Wikidata_Query_Service/User_Manual/SPARQL_Federation_endpoints?

This is a wiki, feel free to add the documentation yourself! I've added a line to the table, but please review that the link to project is the right one.

Thanks a lot, I wasn't sure about the process in this case. (Also thank you for adding the line, I have adjusted the link.)