Page MenuHomePhabricator

evaluate new drives in cloudcephosd102[123]
Closed, ResolvedPublic

Description

These hosts have new drives, let's learn what we can about whether they are now better or worse.

Related Objects

StatusSubtypeAssignedTask
Resolveddcaro
Resolvedtaavi
Resolveddcaro

Event Timeline

Andrew triaged this task as Medium priority.Feb 19 2025, 4:10 PM

I did a first check of the current values for the smartcl reported counters, all look good so far (no more Offline_Uncorrectable_Errors), there's one drive with some Command_Timeout, but it's very low, so no worries.

I'm going to try to have a look at some performance metrics, to see if at least there's no regression there either.

Just checked the number of operations/s (as a proxy for performance):

  • For cloudcephosd1021, comparing with 1018, there's a bit of an improvement:

image.png (727×1 px, 238 KB)

  • For cloudcephosd1022, comparing (more or less) with 1018, things seem to be like before (no improvement, but not worse):

image.png (672×1 px, 240 KB)

  • For cloudcephosd1023, compared (more or less) with 1020, more or less the same (difference gravitating towards 0):

image.png (727×1 px, 265 KB)

So I think it's ok to keep these in prod :)