Page MenuHomePhabricator

operations-puppet-tests-buster-docker times out after 5 minutes
Closed, ResolvedPublic

Description

https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/587531/ had:

Patch Set 2: Verified-1

Main test build failed.

> operations-puppet-tests-buster-docker ABORTED in 5m 07s

Event Timeline

hashar triaged this task as Medium priority.
hashar moved this task from Backlog to Repo setup on the Continuous-Integration-Config board.

Change 587539 had a related patch set uploaded (by Hashar; owner: Hashar):
[integration/config@master] Slightly bump operations-puppet-tests timeout

https://gerrit.wikimedia.org/r/587539

Change 587539 merged by jenkins-bot:
[integration/config@master] Slightly bump operations-puppet-tests timeout

https://gerrit.wikimedia.org/r/587539

I have raised the timeout https://integration.wikimedia.org/ci/computer/integration-agent-puppet-docker-1001/

The instance runs on cloudvirt1005.eqiad.wmnet which has bad performances (see T223971: Old cloudvirt (with Intel Xeon) are half the speed of newer ones (Intel Sky Lake) with the root cause most probably being T225713: CPU scaling governor audit).

So I guess we should have the instance migrated to another cloudvirt that is not affected.

hashar changed the task status from Open to Stalled.Apr 8 2020, 3:26 PM

The timeout raise is a workaround. The fix is to move the instance to a better cloudvirt machine: T249727: Migrate integration-agent-puppet-docker-1001 to a different cloudvirt machine.

Jdforrester-WMF added a subscriber: Jdforrester-WMF.

All puppet patches are now run on integration-agent-puppet-docker-1002, which is on cloudvirt1015.