We received alerts related to /srv on hosts titan1001, titan1002, and titan2002 (titan2001 has a larger RAID).
They started increasing around October 3rd.
A couple of weeks earlier, we also noticed an increase in the bytes transferred from Swift, reasonably due to: https://gerrit.wikimedia.org/r/c/operations/puppet/+/1184566
We tried deleting older files on /srv/thanos-store (after depooling host titan1001 and stopping thanos-store), but the cache was immediately repopulated to exactly the same size as before, in just a few minutes…
Moreover, we noticed some inconsistencies in the storage configuration between the titan hosts:
- titan1001 and titan1002 have some spare disks already present but they are missing on titan2002;
- titan2001 is striping multiple partitions on the same disk.






