Setup monitoring for database servers in beta cluster
Open, Stalled, LowPublic
Actions

Assigned To

None

Authored By

	yuvipanda
	Jan 17 2015, 10:09 AM

Description

part of making sure that everything in the MW pipeline is monitored in beta as well.

Related Objects
Search...

Status	Assigned	Task
Open	None	T53494 Use Beta cluster as a true canary for code deployments (epic)
Stalled	None	T53497 Setup monitoring for Beta Cluster (tracking)
Stalled	None	T87093 Setup monitoring for database servers in beta cluster

Event Timeline

yuvipanda created this task.Jan 17 2015, 10:09 AM

yuvipanda claimed this task.

yuvipanda raised the priority of this task from to Medium.

yuvipanda updated the task description. (Show Details)

yuvipanda added a project: Beta-Cluster-Infrastructure.

yuvipanda added subscribers: greg, scfc, Krinkle and 8 others.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJan 17 2015, 10:09 AM

greg moved this task from To Triage to Next: Maintenance on the Beta-Cluster-Infrastructure board.Jan 29 2015, 5:37 AM

greg renamed this task from Setup monitoring for database servers in betalabs to Setup monitoring for database servers in beta cluster.Mar 10 2015, 8:47 PM

greg set Security to None.

From T97120

The beta cluster MySQL servers turned out to be down for a few hours (T96905) and there is no monitoring for it.

We would need on both instances (deployment-db1 and deployment-db2) a check to ensure the mysql process is running.

The command line looks like:

/usr/sbin/mysqld --basedir=/usr --datadir=/mnt/sqldata \
  --plugin-dir=/usr/lib/mysql/plugin --user=mysql \
  --log-error=/mnt/sqldata/deployment-db1.err \
  --pid-file=/mnt/sqldata/deployment-db1.pid \
  --socket=/tmp/mysql.sock --port=3306

I guess we can just monitor whether /usr/bin/mysqld is present.

yuvipanda removed yuvipanda as the assignee of this task.Jun 7 2015, 4:51 PM

Per beta cluster weekly triage:

The MySQL databases only got down a couple times over 4 years and we quickly noticed it when it happened. Lack of monitoring is surely annoying but is not that much of a big deal, hence lowering priority.

yuvipanda unsubscribed.Jun 15 2015, 8:20 PM

hashar changed the task status from Open to Stalled.Oct 30 2015, 10:51 PM

greg moved this task from Next: Maintenance to Backlog on the Beta-Cluster-Infrastructure board.Aug 5 2016, 8:56 PM

The previous comments don't explain what/who exactly this task is stalled on ("If a report is waiting for further input (e.g. from its reporter or a third party) and can currently not be acted on"). Hence resetting task status.

If this task should not be worked on and fixing this is not worth the efforts, then task status should have the "Declined" status.)

Reflecting reality of team resourcing.

Setup monitoring for database servers in beta clusterOpen, Stalled, LowPublicActions

Description

Related ObjectsSearch...

Event Timeline

Setup monitoring for database servers in beta cluster
Open, Stalled, LowPublic
Actions

Related Objects
Search...