more like this queries are the least performant queries on the cluster, and despite being ~5% of the total query volume take up more than 20% of server resources.
We should be able to measure the impact, in terms of server load, by reusing the load testing that was put together for T117714. gor allows middleware to be written in any language (python is readily available on elastic servers) to pre-process the request before sending it. This middleware can rewrite the more like this queries and we can then measure the difference in server load and any increase in throughput.
We also need to determine if the results are any better/worse like this. We can initially run a number of these through the relevance lab to see that there are any differences. Next we can run an AB test measuring user click through rates to see if users like the new suggestions any better than the old ones.
Currently any request that wants to try the new results here can append the following uri query parameters to get the new functionality:
This fits nicely into one of our Q3 goals, evaluating current cirrus options and adjusting to what makes the most sense.