Based on a meeting we held on Aug 23, 2017, we came up with various action items for running a third test:
- Trey: review Discernatron queries for quality (https://phabricator.wikimedia.org/P5909)
- Erik: develop backend infrastructure to support lots of queries and lots of results per query (T174387)
- Erik: run the A/B test
- Mikhail: do analysis of A/B test
- Erik: extract Discernatron judgements as comparison data
(Dropped the items to add and evaluate extra non-Discernatron results as just adding extra work.)