New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFS
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Andrew
	Jun 30 2017, 2:37 AM

Description

We upgraded labstore1004/1005 to 4.9.25-1~bpo8+3 and things got really bad. Downgraded back to the former kernel and things got better.

Attached graph shows this in terrifying color.

Related Objects
Search...

Status	Assigned	Task
Resolved	• Bstorm	T169289 Tool Labs 2017-06-29 Labstore100[45] kernel upgrade issues
Resolved	MoritzMuehlenhoff	T169290 New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFS
Resolved	• Bstorm	T203254 labstore1004 and labstore1005 high load issues following upgrades
Resolved	• Bstorm	T224582 Migrate labstore1004/labstore1005 to Stretch/Buster
Resolved	• Bstorm	T253353 Add cluster-awareness to nfs-exportd
Declined	None	T257945 NFS v4.1/2 as possible fix for elevated load and lock contention on our NFS servers
Resolved	aborrero	T277653 Toolforge: add Debian Buster to the grid and eliminate Debian Stretch
Resolved	aborrero	T277866 cloud-init: figure out how to change /etc/hosts from cloud-init/vendordata
Resolved	aborrero	T278232 Toolforge: figure out how to work with the new domain in the grid
Resolved	aborrero	T278748 Toolforge: introduce support for selecting grid queue release
Resolved	aborrero	T282972 "--release is not implemented for --backend=kubernetes" with latest tools-webservice from Git
Resolved	taavi	T288961 /usr/local/bin/webservice:109: DeprecationWarning: dist() and linux_distribution() functions are deprecated in Python 3.5
Resolved	aborrero	T300501 Toolforge grid: uwsgi in buster fails to load python3 venvs
Resolved	taavi	T280037 Toolforge: set up monitoring tooling for stretch deprecation
Duplicate	None	T280252 Toolforge Buster bastion no longer tab completes become command
Resolved	taavi	T284767 Toolforge: migrate cron servers to Debian Buster
Declined	None	T298089 Sort out Mono repositories for the buster grid
Resolved	aborrero	T298948 Toolforge grid deployment/management automation
Declined	None	T300032 spicerack: introduce GridEngine controller
Resolved	aborrero	T301665 Toolforge jobs framework: create documentation on wikitech
Resolved	taavi	T309525 Toolforge: Create a cookbook to decomission a SGE node
Resolved	taavi	T309732 New SGE nodes can't talk to the grid engine master
Resolved	• nskaggs	T309821 Buster webservice grid went BOOM!
Declined	taavi	T309902 Tiny swap on many grid nodes
Declined	None	T336034 Toolforge grid automation: consider creating a cookbook to heal the grid from D state procs

Event Timeline

Andrew created this task.Jun 30 2017, 2:37 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJun 30 2017, 2:37 AM

Andrew renamed this task from New anti-stackclash (4.9.25-1~bpo8+3 ) kernal SUPER BAD for NFS to New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFS.Jun 30 2017, 2:37 AM

Andrew added a parent task: T169289: Tool Labs 2017-06-29 Labstore100[45] kernel upgrade issues.

• bd808 mentioned this in T169281: Labstore nfsd processes report "sent only x when sending y bytes - shutting down socket".Jun 30 2017, 3:01 AM

Paladox subscribed.Jul 1 2017, 6:55 PM

Nemo_bis added a project: Upstream.Jul 2 2017, 8:19 PM

"A total of 16,214 non-merge changesets were pulled into the mainline repository for the 4.9 development cycle, making this cycle the busiest in the kernel project's history." https://www.linux.com/news/linux-weather-forecast

Which NFS services/processes caused this?

In T169290#3399875, @MoritzMuehlenhoff wrote:

Which NFS services/processes caused this?

Summarizing from IRC for posterity :)

Load was proportional to what we would expect but way inflated (periods of high use were higher and periods of low use were lower). We generally see load of .5-3 during normal operations over the last 10 months or so and here it was averaging 20-50 and we were seeing 80-110. Client side we saw load climb, and we observed a rotating cast of nfsd procs in D wait state server side. When nfs-kernel-server was stopped load dropped until it was started again. Other than performance being way off normal T169281 was the only real clue that things were wrong.

• chasemp mentioned this in T185101: Labstore1006/7 profile for meltdown kernel.Jan 17 2018, 1:34 PM

• chasemp mentioned this in T181121: Kernels errors on ganeti1005- ganeti1008 under high I/O.Mar 6 2018, 8:15 PM

I talked to someone in #drbd (lge a dev I think) who said they have no reason to think there would be an issue with 4.4 or 4.9 kernel variants with the module version 8.4.5 but they suggested we grab https://github.com/LINBIT/drbd-8.4 and build at 8.4.10 their 'out of tree' bug fix and up-to-date tag as that's the next step to really demonstrating for upstream. Suggested double checking IO scheduler doesn't change since that could have drastic effects.

This is resolved, the jessie-based labstore servers are running 4.9 since a few weeks.

• Bstorm added a subtask: T203254: labstore1004 and labstore1005 high load issues following upgrades.Mar 21 2019, 5:47 PM

• Bstorm mentioned this in T217086: Investigate why the new Son of Grid Engine grid landed in a worse state when NFS was filled than the old Sun Grid Engine grid did.Mar 21 2019, 5:50 PM

• Bstorm closed subtask T203254: labstore1004 and labstore1005 high load issues following upgrades as Resolved.Jun 11 2020, 11:59 PM

Restricted Application edited projects, added cloud-services-team (Kanban); removed cloud-services-team. · View Herald TranscriptJun 11 2020, 11:59 PM

New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFSClosed, ResolvedPublicActions

Description

Related ObjectsSearch...

Event Timeline

New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFS
Closed, ResolvedPublic
Actions

Related Objects
Search...