The recent nova cloud-init/vendor data setup for /etc/hosts generates a file that doesn't work for grid engine, see T277653#6926509:
127.0.1.1 toolsbeta-sgegrid-master.toolsbeta.eqiad1.wikimedia.cloud toolsbeta-sgegrid-master 127.0.0.1 localhost
This could work:
#127.0.1.1 toolsbeta-sgegrid-master.toolsbeta.eqiad1.wikimedia.cloud toolsbeta-sgegrid-master 127.0.0.1 localhost
This also works:
172.16.0.126 toolsbeta-sgegrid-master.toolsbeta.eqiad1.wikimedia.cloud toolsbeta-sgegrid-master 127.0.0.1 localhost
This does not work:
127.0.0.1 localhost toolsbeta-sgegrid-master.toolsbeta.eqiad1.wikimedia.cloud toolsbeta-sgegrid-master
This does not work either:
127.0.0.1 toolsbeta-sgegrid-master.toolsbeta.eqiad1.wikimedia.cloud toolsbeta-sgegrid-master 127.0.0.1 localhost
As of today, the /etc/hosts file is generated by cloud-init from the template /etc/cloud/templates/hosts.debian.tmpl:
{# The value '{{hostname}}' will be replaced with the local-hostname -#} 127.0.1.1 {{fqdn}} {{hostname}} 127.0.0.1 localhost
But we don't have that template in our puppet repo (or I couldn't find it) so figure out what to do, in order to make the grid happier.