We have flapping Icinga monitoring for recommendation_api on scb machines a lot. The pattern is always:
18:53 < icinga-wm> PROBLEM - recommendation_api endpoints health on scb1003 is CRITICAL: /{domain}/v1/translation/articles/{source}{/seed} (normal source and target) is CRITICAL: Test normal source and target returned the unexpected status 429 (expecting: 200) 18:54 < icinga-wm> RECOVERY - recommendation_api endpoints health on scb1003 is OK: All endpoints are healthy
So it gets 429 for a while and then back to 200. It's like the service is rate-limiting the Icinga check.