I've noticed that tonight the data in druid in the webrequest_sampled_128 dataset has some data problem for the slot between 2023-05-20 00:00:00 and 2023-05-20 01:00:00.
When grouping by webrequest_source It looks like almost all of text data is missing while the upload one seems ok:
As a comparison the data in the the live dataset populated by benthos looks totally fine.
Did something break in the pipeline for that hour?
I'm also curious to know if there is any monitoring in place to detect this kind of situations.