Page MenuHomePhabricator

deploy francium for html/zim dumps
Closed, DeclinedPublic

Description

This is the hardware deployment tracking task for server francium as a html/zim dumps host.

  • - setup system mgmt dns entries & test (handled by @RobH) (already existed)
  • - install new 4 * 3TB disks (handled by @Cmjohnson via T93114)
  • - setup system production dns entries (handled by @RobH)
  • - update install module with server info (handled by @RobH)
  • - setup network port/vlan (handled by @RobH)
  • - install os debian/jessie (handled by @RobH) - raid10 disks with typical install
  • - sign puppet/salt keys
  • - implement service (handled by @GWicke)

Event Timeline

RobH claimed this task.
RobH raised the priority of this task from to Needs Triage.
RobH updated the task description. (Show Details)
RobH added a project: acl*sre-team.
RobH added subscribers: RobH, Cmjohnson, GWicke.
RobH set Security to None.
RobH updated the task description. (Show Details)
RobH triaged this task as Medium priority.Mar 19 2015, 10:20 PM

Change 199434 had a related patch set uploaded (by RobH):
setting francium install params raid10 partman recipe

https://gerrit.wikimedia.org/r/199434

Change 199434 merged by RobH:
setting francium install params raid10 partman recipe

https://gerrit.wikimedia.org/r/199434

Change 199642 had a related patch set uploaded (by RobH):
raid10-gpt partman recipe corrections

https://gerrit.wikimedia.org/r/199642

Change 199642 merged by RobH:
raid10-gpt partman recipe corrections

https://gerrit.wikimedia.org/r/199642

Change 199645 had a related patch set uploaded (by RobH):
testing a partman recipe

https://gerrit.wikimedia.org/r/199645

Change 199645 merged by RobH:
testing a partman recipe

https://gerrit.wikimedia.org/r/199645

Change 199647 had a related patch set uploaded (by RobH):
raid10-gpt.cfg partman fixed

https://gerrit.wikimedia.org/r/199647

All the ops side of things on this are done except for whatever site.pp and operations repo changes are needed for service implementation.

I would imagine that @GWicke would handle those and have the oncall duty opsen for the week to merge. Assigning to him.

RobH added a subscriber: ArielGlenn.

I could be wrong, and then it may be @ArielGlenn who handles this, not certain... (I just know I discussed this system request with them both.)

@RobH, thanks! The next thing we'll need is some level of access to this box, ideally sudo. It looks like I ran into the same trap of not explicitly spelling out 'and we need access too' again. Will we have to wait until next Monday?

All sudo requests have to be their own ticket in ops-access-requests and then have them approved in the monday meeting, so yep. The other way to not wait is to have ops mgmt directly override.

Change 199647 merged by RobH:
raid10-gpt.cfg partman fixed

https://gerrit.wikimedia.org/r/199647

So this is testing and not production ready? (Not sure why sudo would be needed otherwise.)

@RobH, created T94093 for the access. Sudo is not strictly needed if somebody from ops takes on the task of setting up nginx (to serve the first dumps until there is more storage on the download arrays) and nodejs.

It seems that this service could have been puppetized in labs, but wasn't.

So now services either needs to puppetize this for their use, request ops do so, or have sudo on the box.

I elect the first option, but that is up to services/ops discussion. If the puppetization properly sets things up, sudo shouldn't be required. Or if it is required, a very limited non root implementation.

Change 204617 had a related patch set uploaded (by Dzahn):
site.pp: add node francium.eqiad.wmnet

https://gerrit.wikimedia.org/r/204617

Change 204617 merged by Dzahn:
site.pp: add node francium.eqiad.wmnet

https://gerrit.wikimedia.org/r/204617

Change 204618 had a related patch set uploaded (by Dzahn):
dumps::zim: add role and firewall to francium

https://gerrit.wikimedia.org/r/204618

Change 204618 merged by Dzahn:
dumps::zim: add role and firewall to francium

https://gerrit.wikimedia.org/r/204618

Resolving on our end, as @ArielGlenn is now working on setting up dumps. See T133547 for current work.