The rdf-streaming-updater uses Flink, which creates checkpoints in Thanos-swift object storage. A recent audit by @dcausse discovered ~1 TB of data. After removing stale/unnecessary data, total usage was down to ~20 GB.
This suggests that we need to be more aggressive about removing data, particularly because we will soon be moving the Search Update Pipeline to Flink.
Creating this ticket to:
Create monitoring/alerts for object storage usageThese were already created by @dcausse , see this dashboard for an example of metrics use, and the alerts live here .Decide whether or not we need an automated cleanup processWe have decided to script this process. Automation is possible in the future, but out of scope for this ticket.- Design/implement cleanup.