Page MenuHomePhabricator

cdh::hadoop::directory (and other hdfs puppet command?) should quickly check if namenode is active before executing
Closed, ResolvedPublic

Description

If there is no active NameNode, many execs within the cdh puppet module will fail after a long timeout. They run the hdfs command, which attempts to talk to an active NameNode. We should add a dependency for all of these execs that somehow determines if a NameNode is active, perhaps via some http curl command.

This usually is only a problem when setting up a new Hadoop cluster, e.g. in labs.

Event Timeline

elukey subscribed.

ping @Ottomata is this still needed? if so, how urgently?

This is not urgent at all, is just nice to have if/when we set up new hadoop clusters in labs for testing.

Change 337866 had a related patch set uploaded (by Ottomata):
Set timeouts on various hdfs puppet execs

https://gerrit.wikimedia.org/r/337866

Change 337866 merged by Ottomata:
Set timeouts on various hdfs puppet execs

https://gerrit.wikimedia.org/r/337866

Milimetric claimed this task.

Andrew made a good enough patch for now.

Milimetric closed this task as Resolved.
Milimetric raised the priority of this task from Low to High.
Milimetric moved this task from Wikistats to Data Quality on the Analytics board.