Starting point: https://hadoop.apache.org/docs/r2.6.0/hadoop-yarn/hadoop-yarn-site/NodeManagerRestart.html
Essentially we want to be able to restart the Node Manager daemons on the Hadoop worker nodes without impacting the running containers.
Starting point: https://hadoop.apache.org/docs/r2.6.0/hadoop-yarn/hadoop-yarn-site/NodeManagerRestart.html
Essentially we want to be able to restart the Node Manager daemons on the Hadoop worker nodes without impacting the running containers.
Change 336203 had a related patch set uploaded (by Elukey):
Enable Yarn's Node Manager recovery to allow graceful restarts
Mentioned in SAL (#wikimedia-operations) [2017-02-06T13:30:05Z] <elukey> applied https://gerrit.wikimedia.org/r/#/c/336203/ manually to analytics1028 (hadoop worker node) as live test - T156932
Change 336203 merged by Elukey:
Enable Yarn's Node Manager recovery to allow graceful restarts
Change 336380 had a related patch set uploaded (by Elukey):
Update the cdh's module sha
Mentioned in SAL (#wikimedia-operations) [2017-02-07T14:19:50Z] <elukey> restarting all the Yarn Node Managers on the Hadoop worker nodes to pick up the new config - T156932