We are considering using the morelike api as part of our work on inter-article recommendations. This API is currently used for RelatedArticles but this is on-demand / requires people to scroll to the bottom of the article. Consequently, this API is not yet ready to be used on a wider scale. One potential treatment of this API is to have it activate when users activate search, which would result in significantly higher traffic. So! We need to get a better sense of current traffic as well as expected future traffic, that we can then pass on to the Search team to help them know how they might scale this API.
Question we are trying to answer
- What is the current level of traffic this API experiences?
- What is the expected level of traffic it might experience? Investigate: how much traffic the search widget currently generates, how likely we are to have this content be cached / other caching implications, how many requests we experience per unit time and the characteristics of this traffic (eg are there surges / other interesting aspects?)
- For Morelike results to show in the search bar for anonymous users
- For Morelike results to show on pageload for anonymous users
Acceptance Criteria
- Work with Data Analyst on obtaining information about current and future potential usage
- Work with Search team where necessary to get additional context
- Perform analysis and obtain concrete numbers of potential future traffic, keeping in mind the above factors
