The AQS 2.0 Druid-based testing endpoints will need data from our production data lake, to execute tests against. See T316849: Audit tests for Druid-based endpoints for a list of tests, and an initial analysis of the necessary data.
Write queries to extract this data in a format suitable for ingestion into the Druid testing environment described in T311190: Establish testing procedure for Druid-based endpoints. Document the resulting queries, including the procedure required to execute them and ingest the data into the testing environment.
This task does not cover publishing and making accessible a Docker image for the Druid testing environment containing the ingested data. That work should be covered in a separate task.
Completion criteria:
- queries are written and successfully executed against production
- testing data is ingested into the Druid testing environment (locally, not published anywhere)
- all steps required to perform the above are documented