Page MenuHomePhabricator

Enable the cephfs services on the DPE Ceph cluster
Closed, ResolvedPublic

Description

In the original design document for the Ceph cluster that supports the Data-Platform group, only a passing reference was made to the Ceph File System and the Metadata Service (MDS) daemons that it requires.

Metadata servers (MDS) - these enable a shared Posix file system interface. This shared file system is an entirely optional component of the cluster, but could be useful.

Subsequently, when working on the technical details of the cluster, we concentrated on the Block Device and Object Gateway functionality. Both of these interfaces are now in active use.

However, we have now identified some use cases where the Ceph File System (cephfs) would be extremely useful, so we now plan to enable this additional interface and to integrate it with the dse-k8s cluster as an additional container storage interface (CSI) plugin, alongside the block device plugin.

One of the primary use cases would be to enable file systems to be provisioned to Kubernetes workloads with a ReadWriteMany access mode.
See: https://kubernetes.io/docs/concepts/storage/persistent-volumes/#access-modes for details of these access modes.

This would enable persistent volumes to be made available across the cluster, with specific components able to update the filesystem contents.
The first use cases for this functionality will be Airflow DAGs distribution (T368033) and Kerberos ticket caches (T375871).

This ticket will be an umbrella task to enable this functionality, with sub-tasks for the required elements.

We have decided to continue with the colocated design of the cluster, so the mds daemons will be installed to the five cephosd100[1-5] servers.

We will be following the guidelines for enabling cephfs from here:

Event Timeline

BTullis triaged this task as High priority.

Change #1081110 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] cephfs: Run the csi-cephfsplugin as uid 0

https://gerrit.wikimedia.org/r/1081110

Change #1081111 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] cephfs: bump the image of the ceph csi plugin image

https://gerrit.wikimedia.org/r/1081111

Change #1081112 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] ceph-rbd: Bump the ceph-csi plugin image

https://gerrit.wikimedia.org/r/1081112

Change #1081110 merged by jenkins-bot:

[operations/deployment-charts@master] cephfs: Run the csi-cephfsplugin as uid 0

https://gerrit.wikimedia.org/r/1081110

Change #1081111 merged by jenkins-bot:

[operations/deployment-charts@master] cephfs: bump the image of the ceph csi plugin image

https://gerrit.wikimedia.org/r/1081111