For example:
May 01, 2017 16:00 Service Ok[2017-05-01 16:27:32] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;OK;HARD;10;OK slave_sql_lag Replication lag: 0.00 seconds Service Warning[2017-05-01 16:25:32] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;WARNING;HARD;10;WARNING slave_sql_lag Replication lag: 216.50 seconds Program Start[2017-05-01 16:21:20] Icinga 1.11.6 starting... (PID=48297) Program Restart[2017-05-01 16:21:19] Caught SIGHUP, restarting... Service Ok[2017-05-01 16:15:19] SERVICE ALERT: db1092;MariaDB Slave IO: s5;OK;HARD;3;OK slave_io_state Slave_IO_Running: Yes May 01, 2017 15:00 Service Critical[2017-05-01 15:56:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;CRITICAL;HARD;10;CRITICAL slave_sql_lag Replication lag: 639.81 seconds Service Critical[2017-05-01 15:55:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;CRITICAL;SOFT;9;CRITICAL slave_sql_lag Replication lag: 579.44 seconds Service Critical[2017-05-01 15:54:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;CRITICAL;SOFT;8;CRITICAL slave_sql_lag Replication lag: 519.56 seconds Service Critical[2017-05-01 15:53:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;CRITICAL;SOFT;7;CRITICAL slave_sql_lag Replication lag: 459.30 seconds Service Critical[2017-05-01 15:52:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;CRITICAL;SOFT;6;CRITICAL slave_sql_lag Replication lag: 399.35 seconds Service Critical[2017-05-01 15:51:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;CRITICAL;SOFT;5;CRITICAL slave_sql_lag Replication lag: 339.26 seconds Service Warning[2017-05-01 15:50:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;WARNING;SOFT;4;WARNING slave_sql_lag Replication lag: 279.25 seconds Service Warning[2017-05-01 15:49:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;WARNING;SOFT;3;WARNING slave_sql_lag Replication lag: 219.23 seconds Service Critical[2017-05-01 15:49:19] SERVICE ALERT: db1092;MariaDB Slave IO: s5;CRITICAL;HARD;3;CRITICAL slave_io_state Slave_IO_Running: No, Errno: 2003, Errmsg: error reconnecting to master 'repl@db1063.eqiad.wmnet:3306' - retry-time: 60 maximum-retries: 86400 message: Can't connect to MySQL server on 'db1063.eqiad.wmnet' (111 "Connection refused") Service Warning[2017-05-01 15:48:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;WARNING;SOFT;2;WARNING slave_sql_lag Replication lag: 159.44 seconds Service Critical[2017-05-01 15:48:19] SERVICE ALERT: db1092;MariaDB Slave IO: s5;CRITICAL;SOFT;2;CRITICAL slave_io_state Slave_IO_Running: No, Errno: 2003, Errmsg: error reconnecting to master 'repl@db1063.eqiad.wmnet:3306' - retry-time: 60 maximum-retries: 86400 message: Can't connect to MySQL server on 'db1063.eqiad.wmnet' (111 "Connection refused") Service Warning[2017-05-01 15:47:29] SERVICE ALERT: db1092;MariaDB Slave Lag: s5;WARNING;SOFT;1;WARNING slave_sql_lag Replication lag: 99.46 seconds Service Critical[2017-05-01 15:47:19] SERVICE ALERT: db1092;MariaDB Slave IO: s5;CRITICAL;SOFT;1;CRITICAL slave_io_state Slave_IO_Running: No, Errno: 2003, Errmsg: error reconnecting to master 'repl@db1063.eqiad.wmnet:3306' - retry-time: 60 maximum-retries: 86400 message: Can't connect to MySQL server on 'db1063.eqiad.wmnet' (111 "Connection refused") Service entered a period of scheduled downtime[2017-05-01 15:44:54] SERVICE DOWNTIME ALERT: db1092;MariaDB Slave SQL: s5;STARTED; Service has entered a period of scheduled downtime Service entered a period of scheduled downtime[2017-05-01 15:44:54] SERVICE DOWNTIME ALERT: db1092;MariaDB Slave Lag: s5;STARTED; Service has entered a period of scheduled downtime Service entered a period of scheduled downtime[2017-05-01 15:44:54] SERVICE DOWNTIME ALERT: db1092;MariaDB Slave IO: s5;STARTED; Service has entered a period of scheduled downtime
Despite the scheduled downtime until 2017-05-03, at 16:21:32 UTC, we get
<icinga-wm> PROBLEM - MariaDB Slave Lag: s5 on db1092 is CRITICAL: CRITICAL slave_sql_lag Replication lag: 732.04 seconds
While not as bad as not sending alerts when they happen, this is very annoying and can cause bigger issues due to the false positives. This has been happening at least for a week.