Page MenuHomePhabricator

Test role::analytics_cluster::coordinator on Debian Stretch
Closed, ResolvedPublic5 Estimated Story Points

Description

In T198685 we are ordering a replacement for analytics1003, the host that currently runs role::analytics_cluster::coordinator. Since we haven't tested its packages on Stretch, it is wise to do so in labs before doing the replacement in Production.

Event Timeline

elukey triaged this task as Medium priority.Sep 11 2018, 2:14 PM
elukey created this task.

Change 459795 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] profile::analytics::refinery::job::camus: ease labs testing

https://gerrit.wikimedia.org/r/459795

Change 459795 merged by Elukey:
[operations/puppet@production] profile::analytics::refinery::job::camus: ease labs testing

https://gerrit.wikimedia.org/r/459795

elukey changed the task status from Open to Stalled.Sep 12 2018, 7:52 AM

Created a subtask to track the current issue with (managed) Mariadb on Stretch, waiting for its completion before re-testing.

The mariadb issues seems resolved, but now we have this:

elukey@hadoop-coordinator-2:~$ sudo puppet agent -tv
Info: Using configured environment 'production'
Info: Retrieving pluginfacts
Info: Retrieving plugin
Info: Loading facts
Error: Could not retrieve catalog from remote server: Error 500 on SERVER: Server Error: Evaluation Error: Error while evaluating a Resource Statement, Evaluation Error: Error while evaluating a Function Call, has_key(): expects the first argument to be a hash, got "" which is of type String at /etc/puppet/modules/mediawiki/functions/state.pp:11:12 on node hadoop-coordinator-2.analytics.eqiad.wmflabs
Warning: Not using cache on failed catalog
Error: Could not retrieve catalog; skipping run

This has been introduced recently due to usage of mediawiki::state('primary_dc') in camus.pp

Change 460365 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] profile::analytics::refinery::job::camus: add parameter to enable monitors

https://gerrit.wikimedia.org/r/460365

Change 460365 merged by Elukey:
[operations/puppet@production] profile::analytics::refinery::job::camus: add parameter to enable monitors

https://gerrit.wikimedia.org/r/460365

Change 460380 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] profile::analytics::database::meta: upgrade mariadb version for labs

https://gerrit.wikimedia.org/r/460380

Change 460380 merged by Elukey:
[operations/puppet@production] profile::analytics::database::meta: upgrade mariadb version for labs

https://gerrit.wikimedia.org/r/460380

Change 460387 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] profile::analytics::database::meta: update mariadb's basedir for stretch

https://gerrit.wikimedia.org/r/460387

Change 460387 merged by Elukey:
[operations/puppet@production] profile::analytics::database::meta: update mariadb's basedir for stretch

https://gerrit.wikimedia.org/r/460387

Change 460399 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] profile::analytics::database::meta: use the same prod config in labs

https://gerrit.wikimedia.org/r/460399

Change 460399 merged by Elukey:
[operations/puppet@production] profile::analytics::database::meta: use the same prod config in labs

https://gerrit.wikimedia.org/r/460399

Fixed during the offsite, everything works!

elukey set the point value for this task to 5.Sep 26 2018, 8:23 AM
elukey moved this task from In Progress to Done on the Analytics-Kanban board.