These 7 servers are being added to the cluster as replacements for analytics10[58-69]
They have been racked and installed in task T293922
Once these new server have been installed, analytics10[58-69] should be decommissioned.
BTullis | |
Jun 23 2022, 9:21 AM |
F35521189: image.png | |
Sep 15 2022, 10:26 AM |
These 7 servers are being added to the cluster as replacements for analytics10[58-69]
They have been racked and installed in task T293922
Once these new server have been installed, analytics10[58-69] should be decommissioned.
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
Put the new hadoop nodes into service | operations/puppet | production | +1 -6 |
I have run the sre.init-hadoop-workers cookbook on all nodes.
There was a slight issue with an-worker1146 because one of the RAID 0 volumes appeared as a foreign configuration. but I rectified this manually.
I also created the journalnode volume on all seven new nodes (even thought it may not be used on them).
The location of the new hadoop nodes has been added here: https://gerrit.wikimedia.org/r/c/operations/puppet/+/831532
This will need ot be merged and applied to the namenodes with a restart.
Once that is done, I will be able to change the role of the new nodes to add them to the cluster.
I've also added the new keytabs for the nodes and added them to the private puppet repo.
Change 831841 had a related patch set uploaded (by Btullis; author: Btullis):
[operations/puppet@production] Put the new hadoop nodes into service
Change 831841 merged by Btullis:
[operations/puppet@production] Put the new hadoop nodes into service
These are all in service now and the autoamtic daily rebalance job is running.
We can now proceed to decommission analytics10[58-69]