⚓ T150936 Phase out scandium.eqiad.wmnet

Subject	Repo	Branch	Lines +/-
remove scandium, keep mgmt	operations/dns	master	+0 -2
CI: decom scandium	operations/puppet	production	+1 -18
Remove zuul-merger from scandium.eqiad.wmnet	operations/puppet	production	+1 -24
Add zuul-merger on contint1001 and contint2001	operations/puppet	production	+18 -8
zuul: add contint1001/2001 to zuul merger hosts for ferm	operations/puppet	production	+2 -0

hashar created this task.Nov 17 2016, 9:45 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptNov 17 2016, 9:45 AM

Paladox subscribed.Nov 17 2016, 11:03 AM

hashar triaged this task as Medium priority.Nov 18 2016, 2:56 PM

hashar moved this task from Untriaged to Backlog on the Continuous-Integration-Infrastructure board.

Change 336807 had a related patch set uploaded (by Hashar):
Add zuul-merger on contint1001 and contint2001

https://gerrit.wikimedia.org/r/336807

gerritbot added a project: Patch-For-Review.Feb 9 2017, 4:25 PM

Change 336807 merged by Dzahn:
Add zuul-merger on contint1001 and contint2001

https://gerrit.wikimedia.org/r/336807

after the merge above, puppet run on scandium is unchanged. no-op

now there is just this to check

https://icinga.wikimedia.org/cgi-bin/icinga/status.cgi?search_string=git_daemon

git_daemon check is added and CRIT on contint1001/2001

Change 336961 had a related patch set uploaded (by Dzahn):
zuul: add contint1001/2001 to zuul merger hosts for ferm

https://gerrit.wikimedia.org/r/336961

Change 336961 merged by Dzahn:
zuul: add contint1001/2001 to zuul merger hosts for ferm

https://gerrit.wikimedia.org/r/336961

hashar created subtask T157785: zuul-merger git-daemon process is not started properly by systemd ?.Feb 10 2017, 9:47 AM

Mentioned in SAL (#wikimedia-operations) [2017-02-10T09:51:53Z] <hashar> Reenabling puppet and zuul-merger on contint1001 and contint2001. The git-daemon is running now T140297 T150936. The 'systemctl status git-daemon' thought that the service was running when it was not (filled T157785 )

Stashbot mentioned this in T157785: zuul-merger git-daemon process is not started properly by systemd ?.Feb 10 2017, 9:51 AM

We now have a zuul-merger on each of contint1001 and contint2001. Assuming they are working properly we will be able to phase out scandium.eqiad.wmnet entirely.

Change 337023 had a related patch set uploaded (by Hashar):
Remove zuul-merger from scandium.eqiad.wmnet

https://gerrit.wikimedia.org/r/337023

@Dzahn @RobH we no more need scandium.eqiad.wmnet. It was solely running the zuul-merger service which is now running on contint1001 and contint2001.

We would want to first remove the role::zuul::merger from the host https://gerrit.wikimedia.org/r/337023 and once that change is merged make sure the daemon is stopped:

sudo systemctl stop zuul-merger
sudo dpkg --purge zuul

You will then want to ACK alarms in Icinga or force refresh its configuration.

Once done, can you move the server back to spares or decommission it? Thanks!

Change 337023 merged by Dzahn:
Remove zuul-merger from scandium.eqiad.wmnet

https://gerrit.wikimedia.org/r/337023

Mentioned in SAL (#wikimedia-operations) [2017-02-10T16:15:24Z] <mutante> scandium - stopping zuul-merger service (T150936)

hashar added a project: hardware-requests.Feb 10 2017, 4:19 PM

Change 337041 had a related patch set uploaded (by Dzahn):
CI: decom scandium

https://gerrit.wikimedia.org/r/337041

gerritbot added a project: Patch-For-Review.Feb 10 2017, 4:24 PM

RobH moved this task from Backlog to Reclaim (Spares/Decommission) on the hardware-requests board.Feb 10 2017, 4:47 PM

Change 337041 merged by Dzahn:
CI: decom scandium

https://gerrit.wikimedia.org/r/337041

Change 337434 had a related patch set uploaded (by Dzahn):
remove scandium, keep mgmt

https://gerrit.wikimedia.org/r/337434

10:07 < mutante> !log scandium - ex-zuul merger - removing from puppet, revoking puppet cert, salt key..

removed from Icinga

Mentioned in SAL (#wikimedia-operations) [2017-02-13T18:18:05Z] <mutante> scandium - shutdown -h now (T150936)

RobH updated the task description. (Show Details)Feb 13 2017, 6:32 PM

Change 337434 merged by Dzahn:
remove scandium, keep mgmt

https://gerrit.wikimedia.org/r/337434

Dzahn updated the task description. (Show Details)Feb 13 2017, 6:38 PM

Dzahn removed a project: Patch-For-Review.Feb 13 2017, 6:40 PM

hashar removed a subtask: T157785: zuul-merger git-daemon process is not started properly by systemd ?.Feb 13 2017, 8:10 PM

RobH assigned this task to • Cmjohnson.Feb 21 2017, 7:18 PM

RobH edited projects, added ops-eqiad; removed Continuous-Integration-Infrastructure, Release-Engineering-Team.

This server has been decom'd and removed from rack.

Phase out scandium.eqiad.wmnet
Closed, ResolvedPublic
Actions

Description

decommission steps

Details

Related Objects

Event Timeline

Phase out scandium.eqiad.wmnetClosed, ResolvedPublicActions