Page MenuHomePhabricator

Update big spark jobs conf with better settings
Closed, ResolvedPublic

Event Timeline

Change 482661 had a related patch set uploaded (by Joal; owner: Joal):
[analytics/refinery/source@master] Update big spark jobs settings

https://gerrit.wikimedia.org/r/482661

fdans moved this task from Incoming to Operational Excellence on the Analytics board.

Change 482661 merged by jenkins-bot:
[analytics/refinery/source@master] Update big spark jobs settings

https://gerrit.wikimedia.org/r/482661

reviewed doc and fixed some spelling. I don't know what spill files are, but the rest made sense to me.

For the record @Milimetric : spilled files are the temporary files generated between steps when data doesn't fit in memory (they're called spilled because you first fill in memory, and it spills out to disk). For big jobs, those represent a lot of data and IOs.

Change 482661 merged by Fdans:
[analytics/refinery/source@master] Update big spark jobs settings

https://gerrit.wikimedia.org/r/482661