We need to start designing this platform in order to acquire suitable hardware for an MVP in Q2.
The design that I have in mind is similar to that set out in Common Data Infrastructure - (Currently a restricted slide deck)
The key details are:
- A five node Ceph cluster - located in rows E and F of eqiad
- Each node has two 25 Gbps network connections to the ToR switches - one for intra-cluster replication
- Ceph monitor processes are co-located with the object storage daemons (OSDs)
- Ceph rados gateway (RGWs) are also co-located with the OSDs
- The five OSD nodes are 2U servers with 24 NVMe slots in the chassis
- An optional hot tier of storage can be provisioned some using the remaining NVMe slots
- A cold tier of storage will be provisioned from HDDs running in directly connected JBOD
- The Bluestore WAL and DB devices will also be located on NVMe drives
I will begin defining a hardware spec and estimating the cost.