Have a way to monitor the crash rate and alert for sharp increases
Closed, DeclinedPublic
Actions

Assigned To

None

Authored By

	• Mholloway
	Sep 23 2016, 5:07 PM

Description

Follow-up from the unmarshalling crash incident: T144990: [CRASH] Content Service shouldn't send empty objects

We should find a way to monitor crash spikes so that we don't have a situation like T144940 where the outage goes on for hours before we hear about it. Unless I'm missing something, HockeyApp doesn't offer anything like this; the most it will do is email crash reports as they come in.

I can imagine standing up a simple service, maybe in tool labs, that would receive crash events at the same time as they're sent to HockeyApp and alert us however we want if the rate increases above a certain threshold.

Other ideas?

Related Objects

Mentioned Here: T117378: [Product Owner] Add crash event logging
T144940: (Community Rev) StartCE Insights testing - (Due Oct 21)
T144990: [CRASH] Content Service shouldn't send empty objects

Event Timeline

• Mholloway created this task.Sep 23 2016, 5:07 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptSep 23 2016, 5:07 PM

Dbrant moved this task from Needs Triage to Tech Debt Backlog on the Wikipedia-Android-App-Backlog board.Sep 26 2016, 12:42 PM

• Mholloway triaged this task as High priority.Sep 30 2016, 8:36 PM

• Niedzielski added a project: Technical-Debt.Nov 9 2016, 8:49 PM

I think Grafana provides email alerts. If T117378 is implemented, this would be easy.

Hi @Mholloway @Dbrant @Niedzielski ,

Have you tried exploring Crashlytics yet?

We've been using it in our codebase (>50 modules and around 0.13 million users) for more than 2 years now and it has been really helpful. Comes at zero cost and provides a lot of features such as alerts, attaching a good amount of useful information with each crash (or even non-fatal errors) and offering various other crash insights.

If you guys like it, I am ready to volunteer for integrating it in the current android app :)

Thanks, but HockeyApp has been perfectly adequate so far (aside from spike alerts). Switching to another provider for crash logging would necessitate a legal/privacy review, for which we currently don't have the bandwidth or necessity. As for general analytics, we prefer (and are basically required to) keep them in-house.

Jcasariego subscribed.Dec 26 2017, 8:17 PM

• Charlotte closed this task as Declined.Apr 9 2019, 5:26 PM

Have a way to monitor the crash rate and alert for sharp increasesClosed, DeclinedPublicActions

Description

Related Objects

Event Timeline

Have a way to monitor the crash rate and alert for sharp increases
Closed, DeclinedPublic
Actions