Page MenuHomePhabricator

Examine cases where Blazegraph generates results that timeout and don’t make it back to the user
Closed, ResolvedPublic

Description

As someone working on WDQS, I want to better understand the limits of the service, the use cases that we don't support but that are consuming resources.

Looking at queries that timeout but could provide results should give us some idea of what users are trying to achieve, but are not able with the current limitations of the service. Since those queries are by definition expensive, it might also give us some ideas on how to reduce the load on the servers.

Things to do:

  • extract a random subset of queries that are in timeout
  • analyze this subset to understand what kind of queries this is and if we should be doing something about it

Event Timeline

The search team will extract the subset of queries and do a first pass analysis (time boxed to one day), and then hand it over to @Lydia_Pintscher and @JAllemandou to look at as part of T264194.

Gehel triaged this task as High priority.Oct 28 2020, 1:28 PM

For now, we use status_code = 500 and query_time > 60s to assert that query timed out. Example notebook: