Page MenuHomePhabricator

Provide a simple way to backup arbitrary files from instances
Closed, DuplicatePublic

Description

Such as mysql backups, files processed, etc.

Event Timeline

yuvipanda raised the priority of this task from to Needs Triage.
yuvipanda updated the task description. (Show Details)
yuvipanda added a project: Cloud-Services.
yuvipanda added subscribers: Aklapper, yuvipanda.
akosiaris subscribed.

Let's start by evaluating a central bacula server as we have in production that will be able to backup files from labs VMs. That way we will build on established knowledge and move fast. There are various technicalities to solve like production using exported resources and we can't in labs, shared secret population, usage or not of TLS and so on.

We have some decisions like retention policy (how long back in time will our backups go, how often we will be recreating them from scratch, and so on.

Later on, we will need some disk space estimations and procure some hardware.

So ideally the workflow I'd like people to have:

  1. Specify filesets in hiera (so can do it per host or per-project)
  2. Specify retention strategy (from a few options we give them)
  3. Restores would go through phabricator

This would allow me to remove NFS from a few proejcts, and feel better about a lot more projects :)

So ideally the workflow I'd like people to have:

  1. Specify filesets in hiera (so can do it per host or per-project)

Agreed. Where however this will be defined in is a good question. The per project hiera sounds good enough for the users to configure but it is not so easily fetchable from the director/storage daemon's POVs. In production we solve that by having a single administrative domain and use exported resources but we clearly can't use that in the multi-project environments labs is.

  1. Specify retention strategy (from a few options we give them)

Nope. Let's not. Same retention policy for everything/everyone. There are anyway multiple levels of retention policies and I am pretty sure all most people will be confused by how they work. For example, per client retention policies are NOT guarantees the data will stay that long around. They more like guarantees the data is indexed. And the per volume retention policies can not be set per client.

  1. Restores would go through phabricator

Agreed.

This would allow me to remove NFS from a few proejcts, and feel better about a lot more projects :)

Just +1ing this.

I'm struggling to implement a robust backup strategy for ORES and Wikilabels right now. Because I don't want the instability of NFS, I'm currently SCPing copies of my databases to my home server.

chasemp subscribed.

Just +1ing this.

I'm struggling to implement a robust backup strategy for ORES and Wikilabels right now. Because I don't want the instability of NFS, I'm currently SCPing copies of my databases to my home server.

how would you feel about us providing space for scp/rsync instead of NFS for you to back things up to?

simple quota'd persistent space

Seems like that would be a reasonable strategy to me.