Page MenuHomePhabricator

Upgrade restbase cluster to Cassandra 4.1.1
Closed, ResolvedPublic

Description

Upgrade restbase to Cassandra 4.1.1


Upgrade steps
  1. Override cassandra:settings in host-specific hiera files (hieradata/hosts/restbase*.yaml) and set:
    1. internode_encryption: all
    2. server_encryption_optional: true
    3. legacy_ssl_storage_port_enabled: true
    4. target_version: '4.x'
  2. sudo rm /etc/cassandra-[a-z]/service-enabled
  3. Merge Puppet changeset
  4. sudo run-puppet-agent
  5. Restart each instance (id=X; sudo touch /etc/cassandra-$id/service-enabled && sudo systemctl restart cassandra-$id)
  6. check-restbase

codfw
  • row b
    • restbase2013.codfw.wmnet
    • restbase2014.codfw.wmnet
    • restbase2019.codfw.wmnet
    • restbase2021.codfw.wmnet
    • restbase2024.codfw.wmnet
  • row c
    • restbase2015.codfw.wmnet
    • restbase2016.codfw.wmnet
    • restbase2020.codfw.wmnet
    • restbase2022.codfw.wmnet
    • restbase2025.codfw.wmnet
  • row d
    • restbase2012.codfw.wmnet
    • restbase2017.codfw.wmnet
    • restbase2018.codfw.wmnet
    • restbase2023.codfw.wmnet
    • restbase2026.codfw.wmnet
    • restbase2027.codfw.wmnet
eqiad
  • row a
    • restbase1016.eqiad.wmnet
    • restbase1019.eqiad.wmnet
    • restbase1020.eqiad.wmnet
    • restbase1021.eqiad.wmnet
    • restbase1028.eqiad.wmnet
    • restbase1031.eqiad.wmnet
  • row b
    • restbase1017.eqiad.wmnet
    • restbase1022.eqiad.wmnet
    • restbase1023.eqiad.wmnet
    • restbase1024.eqiad.wmnet
    • restbase1029.eqiad.wmnet
    • restbase1032.eqiad.wmnet
  • row d
    • restbase1018.eqiad.wmnet
    • restbase1025.eqiad.wmnet
    • restbase1026.eqiad.wmnet
    • restbase1027.eqiad.wmnet
    • restbase1030.eqiad.wmnet
    • restbase1033.eqiad.wmnet

Post-upgrade steps
  • Move per-host hiera settings back to role
  • Set profile::cassandra::monitor_tls_port: 7000
  • Set legacy_ssl_storage_port_enabled: false (remove assignment)
  • Set server_encryption_optional: false (remove assignment)

Details

ProjectBranchLines +/-Subject
operations/puppetproduction+3 -0
operations/puppetproduction+0 -2
operations/puppetproduction+4 -785
operations/puppetproduction+138 -0
operations/puppetproduction+138 -0
operations/puppetproduction+23 -0
operations/puppetproduction+23 -0
operations/puppetproduction+23 -0
operations/puppetproduction+23 -0
operations/puppetproduction+23 -0
operations/puppetproduction+138 -0
operations/puppetproduction+115 -0
operations/puppetproduction+23 -0
operations/puppetproduction+24 -0
operations/puppetproduction+23 -0
operations/puppetproduction+23 -0
operations/puppetproduction+23 -0
operations/puppetproduction+23 -0
Show related patches Customize query in gerrit

Event Timeline

Eevans triaged this task as Medium priority.Jun 15 2023, 8:48 PM
Eevans created this task.

The restbase test suite passes with Cassandra 4.1.1 (see: pull/1327), and deployment-restbase04 has been upgraded as well (afaict everything there is working as expected). If there are no objections, I will plan to begin upgrading the production cluster, perhaps as early as the week of the 31st.

Eevans updated the task description. (Show Details)

Change 944959 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase2013 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944959

Change 944960 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase2014 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944960

Change 944961 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase2019 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944961

Change 944962 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase2021 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944962

Change 944963 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase2024 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944963

@hnowlan I've posted gerrits for the first 5 machines (codfw, row B).

My thinking was to start with one node (say restbase2013) as a canary, and wait a healthy spell before moving forward (24 hours?) If everything goes well, we could gradually pick up the pace and do the remaining nodes over the next 2-3 days. Does this sounds OK to you? If so, I suggest targeting Monday Aug 7 to begin.

Change 944959 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase2013 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944959

Mentioned in SAL (#wikimedia-operations) [2023-08-14T14:04:59Z] <urandom> upgrading Cassandra to 4.1.1, restbase2013-{a,b,c} — T339298

Change 948574 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase1016 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948574

Change 948575 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase1019 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948575

Change 948576 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase1020 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948576

Change 948577 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase1021 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948577

Change 948578 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase1028 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948578

Change 948579 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Upgrade restbase1031 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948579

Change 948574 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase1016 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948574

Eevans updated the task description. (Show Details)

Mentioned in SAL (#wikimedia-operations) [2023-08-14T15:36:23Z] <urandom> upgrading Cassandra to 4.1.1, restbase1016-{a,b,c} — T339298

Eevans updated the task description. (Show Details)

Change 944960 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase2014 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944960

Mentioned in SAL (#wikimedia-operations) [2023-08-14T19:11:11Z] <urandom> upgrading Cassandra to 4.1.1, restbase2014-{a,b,c} — T339298

Change 944961 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase2019 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944961

Mentioned in SAL (#wikimedia-operations) [2023-08-14T19:24:23Z] <urandom> upgrading Cassandra to 4.1.1, restbase2019-{a,b,c} — T339298

Change 944962 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase2021 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944962

Mentioned in SAL (#wikimedia-operations) [2023-08-14T19:34:35Z] <urandom> upgrading Cassandra to 4.1.1, restbase2021-{a,b,c} — T339298

Change 944963 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase2024 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/944963

Mentioned in SAL (#wikimedia-operations) [2023-08-14T19:43:07Z] <urandom> upgrading Cassandra to 4.1.1, restbase2024-{a,b,c} — T339298

Change 948654 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, codfw/row C (5 hosts)

https://gerrit.wikimedia.org/r/948654

Change 948654 merged by Eevans:

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, codfw/row C (5 hosts)

https://gerrit.wikimedia.org/r/948654

Mentioned in SAL (#wikimedia-operations) [2023-08-14T19:57:02Z] <urandom> upgrading Cassandra to 4.1.1, restbase20[15,16,20,22,25]-{a,b,c} (codfw/row C) — T339298

Change 948657 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, codfw/row D (6 hosts)

https://gerrit.wikimedia.org/r/948657

Change 948657 merged by Eevans:

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, codfw/row D (6 hosts)

https://gerrit.wikimedia.org/r/948657

Mentioned in SAL (#wikimedia-operations) [2023-08-14T20:17:10Z] <urandom> upgrading Cassandra to 4.1.1, restbase20[12,17-18,23,26-27]-{a,b,c} (codfw/row C) — T339298

Change 948575 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase1019 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948575

Change 948576 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase1020 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948576

Change 948577 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase1021 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948577

Change 948578 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase1028 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948578

Change 948579 merged by Eevans:

[operations/puppet@production] restbase: Upgrade restbase1031 to Cassandra 4.1.1

https://gerrit.wikimedia.org/r/948579

Mentioned in SAL (#wikimedia-operations) [2023-08-14T20:33:46Z] <urandom> upgrading Cassandra to 4.1.1, restbase10[19-21,28,31]-{a,b,c} (eqiad/row A) — T339298

Change 948661 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, eqiad/row B (6 hosts)

https://gerrit.wikimedia.org/r/948661

Change 948661 merged by Eevans:

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, eqiad/row B (6 hosts)

https://gerrit.wikimedia.org/r/948661

Mentioned in SAL (#wikimedia-operations) [2023-08-14T21:27:43Z] <urandom> upgrading Cassandra to 4.1.1, restbase10[17,22-24,29,32]-{a,b,c} (eqiad/row B) — T339298

Change 948666 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, eqiad/row D (6 hosts)

https://gerrit.wikimedia.org/r/948666

Change 948666 merged by Eevans:

[operations/puppet@production] restbase: upgrade to Cassandra 4.1.1, eqiad/row D (6 hosts)

https://gerrit.wikimedia.org/r/948666

Mentioned in SAL (#wikimedia-operations) [2023-08-14T21:46:23Z] <urandom> upgrading Cassandra to 4.1.1, restbase10[18,25-27,30,33]-{a,b,c} (eqiad/row D) — T339298

Host rebooted by eevans@cumin1001 with reason: None

Change 949556 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: move (temporary) per-host settings back to role

https://gerrit.wikimedia.org/r/949556

Change 949556 merged by Eevans:

[operations/puppet@production] restbase: move (temporary) per-host settings back to role

https://gerrit.wikimedia.org/r/949556

Change 949587 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: set legacy ssl port & optional encryption to false

https://gerrit.wikimedia.org/r/949587

Change 949587 merged by Eevans:

[operations/puppet@production] restbase: set legacy ssl port & optional encryption to false

https://gerrit.wikimedia.org/r/949587

Mentioned in SAL (#wikimedia-operations) [2023-08-17T18:53:10Z] <urandom> Rolling Cassandra restart codfw/b — T339298

Change 950028 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: Use port 7000 for ssl monitoring checks

https://gerrit.wikimedia.org/r/950028

Change 950028 merged by Eevans:

[operations/puppet@production] restbase: Use port 7000 for ssl monitoring checks

https://gerrit.wikimedia.org/r/950028

Mentioned in SAL (#wikimedia-operations) [2023-08-17T20:03:55Z] <urandom> Rolling Cassandra restart codfw/c (RESTBase cluster) — T339298

Mentioned in SAL (#wikimedia-operations) [2023-08-17T20:20:00Z] <urandom> Rolling Cassandra restart codfw/d (RESTBase cluster) — T339298

Mentioned in SAL (#wikimedia-operations) [2023-08-17T20:30:53Z] <urandom> Rolling Cassandra restart eqiad/a (RESTBase cluster) — T339298

Mentioned in SAL (#wikimedia-operations) [2023-08-17T20:49:16Z] <urandom> Rolling Cassandra restart eqiad/b (RESTBase cluster) — T339298

Mentioned in SAL (#wikimedia-operations) [2023-08-17T20:56:06Z] <urandom> Rolling Cassandra restart eqiad/d (RESTBase cluster) — T339298