Page MenuHomePhabricator

codfw: 1 misc node for the Kerberos KDC service
Open, NormalPublic0 Story Points

Description

Please note there are two requests currently open to add a single cpu misc host for Kerberos KDC and Kadmin daemons, one in codfw T227425 and one in eqiad T227288.

The original #hw-requests are for codfw T227425 & eqiad T227288. They are identical in all respects, merely listing one per site.

Site/Location: codfw
Number of systems: 1
Service: kerberos
Networking Requirements: internal IP, no specific network subnet (it will not need to be in the analytics vlan).
Spec: current one for misc nodes

This is one of the two nodes that will be hosting the Kerberos KDC and Kadmin daemons (one host will act as master and the other one as standby, one in eqiad and the other one in codfw). We don't need any special requirements, one node following the current misc specs will be more than enough. It would be really great, if possible, to purchase and rack eqiad and codfw nodes in Q1.

Event Timeline

MoritzMuehlenhoff triaged this task as Normal priority.Jul 8 2019, 10:51 AM
Milimetric moved this task from Incoming to Radar on the Analytics board.Jul 8 2019, 3:58 PM
elukey moved this task from Backlog to Kerberos on the User-Elukey board.Jul 15 2019, 10:10 AM
RobH moved this task from Backlog to Stalled on the hardware-requests board.Jul 16 2019, 5:06 PM
RobH assigned this task to elukey.Jul 16 2019, 6:11 PM
RobH updated the task description. (Show Details)
RobH moved this task from Stalled to In Discussion / Review on the hardware-requests board.
RobH added a subscriber: RobH.

Please note there are two requests currently open to add a single cpu misc host for Kerberos KDC and Kadmin daemons, one in codfw T227425 and one in eqiad T227288.

The original #hw-requests are for codfw T227425 & eqiad T227288. They are identical in all respects, merely listing one per site.

Unfortunately, our spare pool systems in both sites are NOT identical. We have not had to order spare pool systems in codfw as recently as eqiad. So their specifications do not exactly match. We need the sign off from @elukey that these being slightly different won't matter. @RobH doesn't think they will, since its just asking for a minimum specification host in each site, but wants to ensure this is ok with Analytics.

A list of every spare pool system is viewable via this netbox url: https://netbox.wikimedia.org/dcim/devices/?q=&role=server&status=5&mac_address=&has_primary_ip=&console_ports=&console_server_ports=&power_ports=&power_outlets=&interfaces=&pass_through_ports=&cf_owner=&cf_purchase_date=&cf_support_contract=&cf_support_until=&cf_ticket=

CODFW Host

We do not have any single cpu systems available within warranty for this use in codfw. However, it may be cheaper to allocate an existing dual CPU system (which has its warranty countdown clock already started) versus purchasing a new single CPU system. As such, I'll list the info for the dual CPU system, knowing it may be overkill but has already been purchased and is in the rack.

  • PowerEdge R430 - WMF6577 (1 of 3 of these hosts available) - purchased on T166265
    • Dual Intel Xeon E5-2623 v4 2.6GHz/4Cores
    • 64GB RAM
    • 1GB NIC
    • (4) 4TB LFF SATA disks (software raid only)

EQIAD host

  • Dell PowerEdge R440 1U system - WMF5173 (1 of 3 of these hosts available) - purchased on T216269
    • Single Intel Xeon Silver 4110 2.1G, 8C/16T (comes out to same number of cores as the dual core older system in codfw recommended above)
    • 32 GB RAM
    • 1GB NIC
    • (2) 480GB SSD (software raid only)

Before I create the private S4 procurement tasks for management approval of these allocations (which will include pricing), I wanted to check and see if this actual hardware selection above meets the approval of @elukey / MoritzMuehlenhoff / @Ottomata?

I'm tasking this over to @elukey since they created the requests for feedback on the above. Please comment and assign back to me for followup.

Thanks!

Asking confirmation to @MoritzMuehlenhoff since these hosts might be used by SRE too, but from my point of view the difference is ok (and the hw is more than enough for this use case).

Ack, this looks good to me!

elukey reassigned this task from elukey to RobH.Jul 17 2019, 8:05 AM