We received an alert this morning about the cephosd Ceph cluster being in a warning state.
Checking the status:
btullis@cephosd1001:~$ sudo ceph -s
cluster:
id: 6d4278e1-ea45-4d29-86fe-85b44c150813
health: HEALTH_WARN
1 clients failing to respond to capability release
1 MDSs report slow requests
services:
mon: 5 daemons, quorum cephosd1001,cephosd1002,cephosd1003,cephosd1004,cephosd1005 (age 3M)
mgr: cephosd1003(active, since 8d), standbys: cephosd1002, cephosd1005, cephosd1004, cephosd1001
mds: 3/3 daemons up, 2 standby
osd: 100 osds: 100 up (since 3M), 100 in (since 10M)
rgw: 5 daemons active (5 hosts, 1 zones)
data:
volumes: 3/3 healthy
pools: 17 pools, 4481 pgs
objects: 801.74k objects, 520 GiB
usage: 30 TiB used, 1.1 PiB / 1.1 PiB avail
pgs: 4481 active+clean
io:
client: 57 KiB/s rd, 448 KiB/s wr, 58 op/s rd, 82 op/s wrOne MDS (metadata server for cephfs) reports slow requests.
We should investigate this.
