Page MenuHomePhabricator

Test Ceph for instance storage
Open, HighPublic

Description

Shared instance storage is a mixed bag. As long as it works, it reduces our vulnerability to the crash of a single virt host.

On the downside, if we rely on a single file system (as we did, historically, with Gluster), a single file-system bug can kill everything at once.

So, this is something we should look at but it needs to be tested extravagantly before we place any reliance on distributed instance storage again.

This task is now part of https://wikitech.wikimedia.org/wiki/Incident_documentation/20190213-cloudvps

Good technical overview of Ceph: https://www.suse.com/media/report/discover_cephfs.PDF

Event Timeline

Andrew created this task.Feb 22 2015, 12:29 AM
Andrew claimed this task.
Andrew raised the priority of this task from to Needs Triage.
Andrew updated the task description. (Show Details)
Andrew added a project: Cloud-Services.
Andrew added a subscriber: Andrew.
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptFeb 22 2015, 12:29 AM
scfc added a subscriber: scfc.Feb 22 2015, 3:59 AM
Andrew triaged this task as Normal priority.May 13 2015, 3:17 PM
Andrew set Security to None.
Andrew removed Andrew as the assignee of this task.Nov 16 2015, 5:51 PM
aborrero raised the priority of this task from Normal to High.Feb 14 2019, 12:58 PM
aborrero moved this task from To Triage to Follow-up/Actionables on the Wikimedia-Incident board.
aborrero moved this task from Inbox to Important on the cloud-services-team (Kanban) board.
aborrero added subscribers: GTirloni, Bstorm, bd808.
aborrero updated the task description. (Show Details)Feb 14 2019, 1:03 PM
GTirloni removed a subscriber: GTirloni.Mar 21 2019, 9:06 PM
GTirloni updated the task description. (Show Details)Mar 23 2019, 8:39 PM
Cmjohnson closed subtask Unknown Object (Task) as Resolved.Jun 24 2019, 5:25 PM