Page MenuHomePhabricator

Create general guidelines & processes to ensure thorough fault testing of services
Closed, DeclinedPublic

Description

Error handling paths tend to be less tested, which often leads to surprises and outages. To counteract this, we need to be especially thorough in integration testing, both before deployment, as well as during normal operation.

To ensure this happens, we should write up guidelines describing best practices, describe testing processes, and pointing out classic gotchas & ways to avoid them.

Event Timeline

GWicke triaged this task as Medium priority.Jun 8 2016, 8:43 PM

This follow-up task from an incident report has not been updated recently. If it is no longer valid, please add a comment explaining why. If it is still valid, please prioritize it appropriately relative to your other work. If you have any questions, feel free to ask me (Greg Grossmeier).

Pchelolo subscribed.

Yeah, that is probably good enough. We will look again at documentation after k8s migration is complete. This one can be closed for now.