Based on my understanding of the task, it appears that the only viable way to test it is by implementing it in a live environment. While this may seem spooky, I don't think we have another environment to simulate this effectively to test the sampling.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Jan 26 2024
Jan 19 2024
Jan 17 2024
Jan 16 2024
We are back to regular on-call
Jan 15 2024
batphone enabled for MLK
Jan 12 2024
hi @colewhite, a friendly reminder that the silence is expiring on 2024-02-01.
Lowering priority due to lack of activity, we can revisit this if it continues to be a pressing matter.
Based on the lack of recent feedback indicating that this issue persists, we have decided to downgrade its severity. By doing so, we can focus our resources on more pressing concerns that require immediate attention.
I'm resolving this one on my understanding that this has already been deployed. Please reopen if that's not the case, and there's work pending. Thanks!
Jan 11 2024
Jan 10 2024
Jan 9 2024
Keeping this task open for further snapshots as the project evolves.
Jan 8 2024
In T307958#9390852, @fgiunchedi wrote:This is essentially what https://alerts.wikimedia.org/triage/ displays now, for hide_alerts_older_than: '1200h' alerts. The app also offers the user a button to open a task
Batphone has been removed, and the business-hours on-call rota is enabled again in Splunk on-call.
Dec 23 2023
@MatthewVernon, IT does look weird. I think it's just the UI; when I added Batphone to the escalation path instead of the EMEA/Americas rotation, it seems to have expanded the Batphone list as the escalation and the on/off-calls folks according to their individual schedule set within the Batphone rotation.
Dec 20 2023
Dec 14 2023
Dec 6 2023
It seems like this is not a priority, so we'll postpone it for now. We can revisit it when the time comes.
Dec 5 2023
Raising priority based on recent conversations with the team and the intent to address this in the near future as part of risk mitigations to the logging pipeline.
It seems like the issue is resolved.
Nov 29 2023
Nov 28 2023
In T349159#9361905, @Krinkle wrote:Arc Lamp metrics are visualised in Grafana but the alerts are defined in AlertManager, not in Grafana.
Nov 27 2023
Adding to ongoing quarter for visibility.