There were alerts about thanos compact restarting, it looks like Thanos can't fully download a large block to process for downsampling, and fails on the partial block:
Aug 05 14:52:22 thanos-fe2001 thanos-compact[217300]: level=info ts=2021-08-05T14:52:22.348331551Z caller=downsample.go:257 msg="downloaded block" id=01FCAJMWC7R34X9SAW9BBG3KP4 duration=27m27.437833254s Aug 05 14:52:32 thanos-fe2001 thanos-compact[217300]: level=warn ts=2021-08-05T14:52:32.719729769Z caller=intrumentation.go:54 msg="changing probe status" status=not-ready reason="error executing compaction: firs> Aug 05 14:52:32 thanos-fe2001 thanos-compact[217300]: level=info ts=2021-08-05T14:52:32.720338438Z caller=http.go:65 service=http/server component=compact msg="internal server is shutting down" err="error executi> Aug 05 14:52:33 thanos-fe2001 thanos-compact[217300]: level=info ts=2021-08-05T14:52:33.221321273Z caller=http.go:84 service=http/server component=compact msg="internal server is shutdown gracefully" err="error e> Aug 05 14:52:33 thanos-fe2001 thanos-compact[217300]: level=info ts=2021-08-05T14:52:33.221462386Z caller=intrumentation.go:66 msg="changing probe status" status=not-healthy reason="error executing compaction: fi> Aug 05 14:52:33 thanos-fe2001 thanos-compact[217300]: level=error ts=2021-08-05T14:52:33.222682204Z caller=main.go:197 err="read TOC: read TOC: invalid checksum\nopen index file\ngithub.com/thanos-io/thanos/pkg
A few of the blocks that failed
root@thanos-fe2001:~# grep -e 01FCAZKHWX3M2PZSKPAGXHT1F6 -e 01FCAJMWC7R34X9SAW9BBG3KP4 -e 01FCAVKB28XZVJPD42YSK70T9X thanos-bucket | 01FCAJMWC7R34X9SAW9BBG3KP4 | 22-07-2021 00:00:00 | 05-08-2021 00:00:00 | 336h0m0s | -296h0m0s | 6,195,310 | 95,136,077,478 | 811,225,626 | 4 | false | prometheus=ops,replica=a,site=codfw | 0s | compactor | | 01FCAVKB28XZVJPD42YSK70T9X | 22-07-2021 00:00:00 | 05-08-2021 00:00:00 | 336h0m0s | -296h0m0s | 6,187,613 | 95,153,384,053 | 811,187,071 | 4 | false | prometheus=ops,replica=b,site=codfw | 0s | compactor | | 01FCAZKHWX3M2PZSKPAGXHT1F6 | 22-07-2021 00:00:00 | 05-08-2021 00:00:00 | 336h0m0s | -296h0m0s | 7,262,333 | 119,362,059,700 | 1,010,579,012 | 4 | false | prometheus=ops,replica=b,site=eqiad | 0s | compactor |
Notice it took 27m to download a ~200G block, which is too long, and from swift's perspective the client (thanos compact) gave up and disconnected