Allow cancelling / aborting queries server-side
Closed, DeclinedPublic
Actions

Assigned To

None

Authored By

	WikidataFacts
	May 28 2016, 1:30 PM

Description

Original problem

Most of the time, I expect a query to run in less than a second. If it takes longer than that, I assume that I’ve accidentally made an expensive query. In that case, a) I need to make it less expensive (e. g. a LIMIT clause), and b) I don’t actually need WDQS to run the query until it hits the time limit. And that’s always a crappy feeling when I realize that WDQS is now going to unnecessarily labor for 30 seconds (full load, presumably). Adding a button to cancel / abort a running query would take load off the service and allow the user to run the improved query earlier.

Solution this task covers

Use the Blazegraph CANCEL POST request to stop the query from running on one of the query servers.

Details

	Subject	Repo	Branch	Lines +/-
	Add ability to cancel queries	wikidata/query/gui	master	+94 -43
	Add URL to cancel running query	operations/puppet	production	+11 -0

Customize query in gerrit

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Declined		None	T136479 Allow cancelling / aborting queries server-side
		Resolved		Smalyshev	T112151 Support POST for SPARQL query endpoint

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Restricted Application added projects: Wikidata, Discovery-ARCHIVED. · View Herald TranscriptMay 28 2016, 1:30 PM

Restricted Application added subscribers: Zppix, Aklapper. · View Herald Transcript

I wanted to ask for pretty much the same thing. :) I often make stupid mistakes that mean my query will either time out or return far too many results. It used to be possible to just click "Run" again, but now that button gets disabled until it returns results or times out, so once I've fixed the mistake, there's a long delay before I can run my new query.

I'm not really bothered about specifically having a button to stop a query, but I would really like to be able to start the new query without a long delay. For me, it would be ideal if the "Run" button were re-enabled when the query changes (and if the user runs the new query before the old one finishes/times out, silently cancel the old query).

Looks like there is a way to do it: https://wiki.blazegraph.com/wiki/index.php/REST_API#queryId

Smalyshev moved this task from Incoming to GUI on the Wikidata-Query-Service board.May 31 2016, 3:08 AM

@Smalyshev: Good to know. So the client would send the cancel request while the query GET request is still pending, and that would then return… 204 No Content? 408 Request Timeout? (I suppose it doesn’t matter – the client knows it canceled the query, it doesn’t need to check the response to detect that.)

@Nikki: One workaround is to open a new query tab ;) Ctrl+L,C,T,V, Enter.

Do we know that such queries have a big performance hit or cause undue strain on the servers? I agree that in an ideal world this feature would be useful, but if these queries are not causing any significant issues then practically speaking it may not be worth investing engineering time in such a feature.

Another note: cancel requires POST support as it seems, which currently is not supported. Also, since we have more than one server, we need some kind of affinity on Varnish side, which I am not sure whether we have or not.

@Deskana one query is probably no big deal. Lots of heavy queries can be a problem. But those are usually not from GUI but from bots.

I usually end up right clicking the tab and duplicating it, that way I don't lose what I currently have on the clipboard. :) I suppose refreshing the page would work too but I never trust browsers/pages to not lose something if I do that.

Also, I didn't mention why what I suggested would be ideal (for me): Most of the options (deciding to sit and wait (and succeeding in doing that without getting distracted by something else ;)), clicking a stop button, refreshing or opening a new tab) need some kind of deliberate action from me, which interrupts what I'm doing. What I suggested would be unobtrusive - I could just continue editing and running the query until I get the results I want without it interrupting me.

When I hit that problem I just press F5 to reload the page, because the query in the editor is still preserved.

I am not sure if canceling the query is worth the effort at the moment, because of the strict Blazegraph execution limit.
In the future when that limit will be greater we might want to think about canceling queries.

Smalyshev added a subtask: T112151: Support POST for SPARQL query endpoint.Jun 10 2016, 7:18 PM

I usually introduce some error in query and then press Run :)

Smalyshev triaged this task as Medium priority.Sep 12 2016, 10:37 PM

Smalyshev mentioned this in T147114: "Run" button not re-enabled after some queries fail.Oct 2 2016, 8:30 PM

Change 316042 had a related patch set uploaded (by Smalyshev):
Add URL to cancel running query

https://gerrit.wikimedia.org/r/316042

gerritbot added a project: Patch-For-Review.Oct 15 2016, 12:54 AM

Change 316044 had a related patch set uploaded (by Smalyshev):
Add ability to cancel queries

https://gerrit.wikimedia.org/r/316044

Diffusion mentioned this in rWDQG8bceea85096a: Add ability to cancel queries.Oct 15 2016, 1:20 AM

Diffusion mentioned this in rWDQGf40c95831ba5: Add ability to cancel queries.Oct 17 2016, 11:22 PM

Diffusion mentioned this in rWDQG20d538864a46: Add ability to cancel queries.Oct 17 2016, 11:28 PM

Smalyshev merged a task: T148495: Avoid freezing when making a too big SPARQL request.Oct 18 2016, 4:23 PM

Smalyshev added subscribers: VIGNERON, Lea_Lacroix_WMDE.

Wesalius subscribed.Oct 30 2016, 6:44 PM

Smalyshev changed the status of subtask T112151: Support POST for SPARQL query endpoint from Stalled to Open.Jan 13 2017, 12:48 AM

Smalyshev closed subtask T112151: Support POST for SPARQL query endpoint as Resolved.Feb 22 2017, 8:33 PM

edwardbetts subscribed.Mar 5 2017, 11:53 AM

We can abandon query, but we can not really cancel it, since we do not know to which server the query went and have no guarantee cancel query will go to the same server.

If this ticket includes cancelling the query on the server, should we make a new ticket for simply being able to run a new query before the existing one has timed out? It used to be possible before the run button started being disabled, so it seems like making that possible again would be easy to do even if we can't actually cancel the query on the server yet. It would be nice if it could cancel the existing query when starting a new query, but I don't think it really makes a difference to the user.

My concern here is that it will encourage very bad pattern of behavior - running a bad query, seeing it takes long, abandoning it, tweaking, running it again, etc. so that at the end the server ends up with 20 heavy queries running. Yes, I know you could just open a new window or reload, but this is higher barrier to producing lots of heavy queries than just clicking a button.

Smalyshev moved this task from GUI to Need investigation on the Wikidata-Query-Service board.Aug 31 2017, 8:54 PM

Daniel_Mietchen awarded a token.Sep 1 2017, 1:57 AM

Daniel_Mietchen subscribed.

Daniel_Mietchen mentioned this in T175448: Provide mechanism(s) to avoid WDQS timeouts for certain pre-approved queries.Sep 25 2017, 11:30 PM

Smalyshev removed a project: Patch-For-Review.Feb 15 2018, 1:32 AM

Lydia_Pintscher moved this task from incoming to needs discussion or investigation on the Wikidata board.Mar 6 2018, 9:24 AM

MichaelSchoenitzer awarded a token.Mar 12 2018, 1:00 AM

NoInkling awarded a token.Jan 7 2019, 2:35 AM

NoInkling rescinded a token.

NoInkling awarded a token.

NoInkling rescinded a token.

NoInkling awarded a token.

NoInkling subscribed.

abian merged a task: T235940: Provide mean to cancel a SPARQL request from Wikidata Query UI.Oct 19 2019, 2:54 PM

abian added a subscriber: Maxlath.

Actually even if we don't know which server the query went (there really should be some indication), we can send the cancellation to all servers. Though this will not scale once we have hundreds of query server.

(About task status: If this should be fixed upstream, add Upstream to this task)

WikidataFacts changed the task status from Open to Stalled.Oct 19 2019, 3:52 PM

@WikidataFacts: Does this task requires upstream fix?

The scalability of WDQS should not depend on users canceling queries manually. We have other ideas to improve performances.

@Gehel very true. That said, it won't hurt performances either and what about all the other problems of not being able to stop a query? (I hate when I have to restart my browser and/or computer who froze just because I dumbly forgot to remove a wdt:P279*)

This does not address the issue raised in {T136479#2338442} - I'm going to file a new task for it.

Bugreporter mentioned this in T245643: Stop current query from running on Query Service.Feb 19 2020, 5:04 PM

Change 316044 abandoned by DCausse:
[wikidata/query/gui@master] Add ability to cancel queries

Reason:
no longer in progress, please reopen if not true

https://gerrit.wikimedia.org/r/316044

Lucas_Werkmeister_WMDE mentioned this in T95917: Investigate continuing long result sets.Nov 2 2020, 11:57 AM

Jane023 subscribed.Jan 10 2022, 7:43 PM

Spinster subscribed.Jan 10 2022, 9:17 PM

Reopening because it was declined for a reason unrelated to the original request. Users want to be able to cancel queries so that they can create new ones easily and not have to wait - not because this will increase performance (and what ideas do you have?).

Idk this looks like it would require an intermediate database to keep track of which query ids are being sent to which servers.

Declining again because T136479#3458656

In T136479#8722821, @Lectrician1 wrote:

Reopening because it was declined for a reason unrelated to the original request. Users want to be able to cancel queries so that they can create new ones easily and not have to wait - not because this will increase performance (and what ideas do you have?).

This is T245643: Stop current query from running on Query Service.

Yeah @Aklapper this task is requesting for the ability for users to cancel a query on the query server, not starting another query when another one is running. This task should be open.

Reopening for the above reason. Revising the task description to better-convey what this is requesting.

Lectrician1 renamed this task from Allow cancelling / aborting queries to Allow cancelling / aborting queries server-side.Apr 1 2023, 1:52 PM

Lectrician1 updated the task description. (Show Details)

Solution this task covers

Use the Blazegraph CANCEL POST request to stop the query from running on one of the query servers.

This is not going to happen, as was already explained in T136479#3429293. Declining, once again. Please stop reopening this task.

I mean it's totally possible. You just set up a tracking database to determine which requests are sent to each server. Isn't this up to WMDE to decline?

Maxlath unsubscribed.Apr 1 2023, 8:46 PM

It’s up to WMF to decline, as they are responsible for the query service backend, and WMF have already declined it, see T136479#5897103.

What does that comment even mean?

Allow cancelling / aborting queries server-sideClosed, DeclinedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Allow cancelling / aborting queries server-side
Closed, DeclinedPublic
Actions

Related Objects
Search...