See parent task for context. The goal of this task is to have the appropriate tooling and processes to make investigating data pipeline issues not too tedious.
Minimal implementation could be just dumping a json list of related events based on pageID / request ID / ...
A more complex implementation might have superset dashboards, and nice interfaces