Page MenuHomePhabricator

Find and clean up instances that are unreachable by ssh
Closed, DuplicatePublic

Description

P816 is a list of instances that aren't reachable by ssh with my root key with a 30s timeout. They should be investigated, and probably fall into two categories:

  • Shut off instances
  • Instances that haven't run puppet in *months*

Both should be cleaned up.

Related Objects

Event Timeline

yuvipanda raised the priority of this task from to Needs Triage.
yuvipanda updated the task description. (Show Details)
yuvipanda added a project: Cloud-Services.
yuvipanda subscribed.

Do you think we should fix them, or just delete?

SHUTOFF should just delete, the other ones I realistically think we have no choice but to just delete - but we should delete with notification, I think

Many instances without ssh can still be reached by salt -- so that's a way to double-check things.

Also note that some instances are SHUTOFF because the owners legitimately want them saved but just have them turned off in the meantime. I probably have some instances like that.

Do you think you can produce a list of SHUTOFF instances (I couldn't figure out how to quary it from the nova commandline) and so we can find the list of instances that are on but unreachable?

nova list --all-tenants | grep SHUTOFF

interested in what's going on with this task because it's one of the things I watch when checking salt behavior on labs. still in progress?