Error handling paths tend to be less tested, which often leads to surprises and outages. To counteract this, we need to be especially thorough in integration testing, both before deployment, as well as during normal operation.
To ensure this happens, we should write up guidelines describing best practices, describe testing processes, and pointing out classic gotchas & ways to avoid them.