Page MenuHomePhabricator

Upgrade Cassandra clusters to v4.1.7
Closed, ResolvedPublic

Description

  • aqs
  • cassandra-dev
  • ml-cache
  • restbase
  • sessionstore
Post-upgrade
  • Copy 4.1.7 packages to component/cassandra41
  • Update dev target versions to 4.x

CHANGELOG

v4.1.7

Merged from 4.0:

v4.1.6

Merged from 4.0:

Merged from 3.0:


Configuration files w/ changes (minor)

  • conf/commitlog_archiving.properties
  • conf/jvm-server.options
  • conf/hotspot_compiler
  • conf/credentials.sample

See: https://gerrit.wikimedia.org/r/1100549

Event Timeline

Eevans triaged this task as Medium priority.Nov 20 2024, 9:04 PM
Eevans added a project: Cassandra.
Reedy renamed this task from Upgrade Casssandra clusters to v4.1.7 to Upgrade Cassandra clusters to v4.1.7.Nov 21 2024, 5:05 AM

Cassandra 4.1.7 has been uploaded to apt repo's cassandradev component

Change #1100549 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] cassandra: configurations merged from upstream 4.1.7

https://gerrit.wikimedia.org/r/1100549

Change #1100859 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] cassandra_dev: upgrade Cassandra to 'dev' (aka 4.1.7)

https://gerrit.wikimedia.org/r/1100859

Change #1100860 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] cassandra: pin 'dev' to cassandra 4.1.7

https://gerrit.wikimedia.org/r/1100860

Change #1100549 merged by Eevans:

[operations/puppet@production] cassandra: configurations merged from upstream 4.1.7

https://gerrit.wikimedia.org/r/1100549

Change #1100860 merged by Eevans:

[operations/puppet@production] cassandra: pin 'dev' to cassandra 4.1.7

https://gerrit.wikimedia.org/r/1100860

Change #1100859 merged by Eevans:

[operations/puppet@production] cassandra_dev: upgrade Cassandra to 'dev' (aka 4.1.7)

https://gerrit.wikimedia.org/r/1100859

Change #1101576 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] aqs1010: canary Cassandra 4.1.7

https://gerrit.wikimedia.org/r/1101576

Change #1101576 merged by Eevans:

[operations/puppet@production] aqs1010: canary Cassandra 4.1.7

https://gerrit.wikimedia.org/r/1101576

Mentioned in SAL (#wikimedia-operations) [2024-12-09T19:58:49Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching aqs1010.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-09T20:07:27Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs1010.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Change #1102377 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] aqs: Upgrade Cassandra to 4.1.7

https://gerrit.wikimedia.org/r/1102377

Icinga downtime and Alertmanager silence (ID=1e594cac-6111-4c7c-8e83-1c9bdf32ac87) set by eevans@cumin1002 for 7 days, 0:00:00 on 1 host(s) and their services with reason: Hardware replacement

aqs1014.eqiad.wmnet

Change #1102377 merged by Eevans:

[operations/puppet@production] aqs: Upgrade Cassandra to 4.1.7

https://gerrit.wikimedia.org/r/1102377

Mentioned in SAL (#wikimedia-operations) [2024-12-12T14:55:37Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:aqs-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-12T15:03:28Z] <eevans@cumin1002> END (ERROR) - Cookbook sre.cassandra.roll-restart (exit_code=97) for nodes matching A:aqs-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-12T15:09:25Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:aqs-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-12T16:37:44Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:aqs-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-12T16:52:47Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:aqs-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-12T18:22:28Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:aqs-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

@klausman is ml-cache in use? How would you like to proceed here? Do you want me to upgrade?

Change #1104716 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase1028: canary Cassandra 4.1.7

https://gerrit.wikimedia.org/r/1104716

Change #1104716 merged by Eevans:

[operations/puppet@production] restbase1028: canary Cassandra 4.1.7

https://gerrit.wikimedia.org/r/1104716

Mentioned in SAL (#wikimedia-operations) [2024-12-16T19:07:12Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching restbase1028.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-16T19:16:34Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching restbase1028.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Change #1105011 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade Cassandra to 4.1.7

https://gerrit.wikimedia.org/r/1105011

Change #1105011 merged by Eevans:

[operations/puppet@production] restbase: Upgrade Cassandra to 4.1.7

https://gerrit.wikimedia.org/r/1105011

Mentioned in SAL (#wikimedia-operations) [2024-12-17T15:30:42Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:restbase-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-17T17:46:05Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:restbase-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-17T18:11:25Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:restbase-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-17T20:23:41Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:restbase-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Change #1105386 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] sessionstore: Upgrade Cassandra to v4.1.7

https://gerrit.wikimedia.org/r/1105386

Change #1105386 merged by Eevans:

[operations/puppet@production] sessionstore: Upgrade Cassandra to v4.1.7

https://gerrit.wikimedia.org/r/1105386

Mentioned in SAL (#wikimedia-operations) [2024-12-18T17:32:33Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore2*.codfw.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-18T17:50:20Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore2*.codfw.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-18T18:00:17Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore1*.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2024-12-18T18:18:09Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore1*.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Eevans changed the task status from Open to Stalled.Dec 18 2024, 7:33 PM
Eevans updated the task description. (Show Details)

With the completion of sessionstore, the only remaining cluster is ml-cache, which I think will have to wait until after the holidays.

@klausman ping! Would you like me to upgrade ml-cache to 4.1.7?

@klausman ping! Would you like me to upgrade ml-cache to 4.1.7?

Yes, please go ahead! The machines are currently not in use.

Change #1109750 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] ml-cache: upgrade Cassandra to 4.1.7

https://gerrit.wikimedia.org/r/1109750

Change #1109750 merged by Eevans:

[operations/puppet@production] ml-cache: upgrade Cassandra to 4.1.7

https://gerrit.wikimedia.org/r/1109750

Mentioned in SAL (#wikimedia-operations) [2025-01-10T19:23:33Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2025-01-10T19:41:15Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2025-01-10T19:43:29Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2025-01-10T20:01:13Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Change #1109767 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] cassandra: rotate target_version 'dev' to '4.x'

https://gerrit.wikimedia.org/r/1109767

Change #1109768 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] cassandra: set target_dev to 4.x (no-op)

https://gerrit.wikimedia.org/r/1109768

Change #1109767 merged by Eevans:

[operations/puppet@production] cassandra: rotate target_version 'dev' to '4.x'

https://gerrit.wikimedia.org/r/1109767

Change #1109768 merged by Eevans:

[operations/puppet@production] cassandra: set target_dev to 4.x (no-op)

https://gerrit.wikimedia.org/r/1109768

Mentioned in SAL (#wikimedia-operations) [2025-01-15T16:58:33Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:cassandra-dev: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2025-01-15T17:18:44Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Eevans claimed this task.

Done!!

Mentioned in SAL (#wikimedia-operations) [2025-02-20T17:29:51Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

Mentioned in SAL (#wikimedia-operations) [2025-02-20T17:29:51Z] <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002

FYI, this was (apparently) a PEBKAC of some sort while on the cumin host (a command from my shell history re-ran?). The cluster nodes were restarted, but not to apply 4.1.7 (that has already happened). Apologies for the noise.

Mentioned in SAL (#wikimedia-operations) [2025-02-20T17:47:33Z] <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002