Review Logstash/Kibana's architecture and installation and identify next steps and gaps to be addressed.
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | fgiunchedi | T198753 Modernize logging, alerting and metrics monitoring infrastructure - Adopt Logstash (2018-19 Q1 Goal) | |||
Resolved | fgiunchedi | T198754 Logstash/Kibana architecture review | |||
Resolved | fgiunchedi | T63785 Improve HA for logstash cluster |
Event Timeline
Comment Actions
On Friday 6th, we had a meeting with Infrastructure Foundations and former logstash maintainers (e.g. Search, Bryan Davis) to go over the current architecture and pain points. Notes at https://etherpad.wikimedia.org/p/logstash-sre-q1-fy2018-2019 and I'm summarizing below:
- One of the biggest hurdles is explosion of indices due to logstash mapping every json field found in logs (T180051)
- ApiFeatureUsage uses logstash and its logs end up in cirrus ES cluster instead
- The upgrade to ES 6 will remove mapping types, we'll need to put some thought on how to address that
- Current architecture is four years old, nowadays we'd probably use some queues like kafka
- ApiFeatureUsage has some tech debt, namely it can block because it outputs to cirrussearch ES cluster and would need a buffer inbetween instead
Comment Actions
Non exhaustive list of things that we'll need to address:
- More insight into logstash/kibana activity via prometheus metrics (elasticsearch already has prometheus metrics)
- udp2log input AFAICT receives data from mwlog for iegreview/scholarships/scap, I think we can deprecate this input and port said applications to other logging transports (e.g. for scap T86969)
Comment Actions
In particular for logstash we'll also need to add suitable id configuration to tell different components apart in the monitoring API (e.g. https://www.elastic.co/guide/en/logstash/current/plugins-filters-grok.html#plugins-filters-grok-id)
Comment Actions
The architecture and gaps review has been carried out as part of the logging infrastructure design document