at 15:14 UTC or so on July 31, this alarm went off on toolschecker:
PROBLEM - toolschecker: showmount succeeds on a labs instance on checker.tools.wmflabs.org is CRITICAL: HTTP CRITICAL: HTTP/1.1 503 SERVICE UNAVAILABLE - string OK not found on http://checker.tools.wmflabs.org:80/nfs/secondary_cluster_showmount - 177 bytes in 0.023 second response time https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Toolschecker
The issue is that showmount -e nfs-tools-project.svc.eqiad.wmnet returns an error:
rpc mount export: RPC: Remote system error
The same happens on localhost for labstore1004. Restarting NFS didn't help, but on this host nfs doesn't restart very cleanly without a reboot. Log messages from 15:45 or so are me restarting NFS.
The alert is ACKed, and perhaps we should schedule a reboot of the server. Doing it immediately seems needlessly disruptive.