Yesterday we lost poolcounter1001, the one that thumbor uses, with 500s for upload being emitted as a result and thumbor failing to generate thumbs. We should investigate why this happened and make sure thumbor's behaviour is fail-open when poolcounter isn't available
Description
Description
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Resolved | • Gilles | T121388 Service-based thumbnailing re-architecture in production with Thumbor | |||
| Resolved | • Gilles | T151066 Implement PoolCounter support in Thumbor | |||
| Resolved | • Gilles | T169313 Investigate poolcounter failure leading to thumbor failing to generate thumbs |
Event Timeline
Comment Actions
It seems to be that the issue simply came from the poolcounter being a valid IP but unreachable/the poolcounter port couldn't be open.