Kask: gocql: no hosts available in the pool errors
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Eevans
	May 27 2021, 4:37 PM

Description

It would seem that if Kask loses connectivity to Cassandra (via the gocql driver), the host is permanently de-pooled (never to be re-pooled). This results in the following error message:

Error reading from storage (gocql: no hosts available in the pool)

Once this happens, the container running Kask must be restarted.

This seems to correlate with: gocql/gocql/issues/915

We should coordinate with upstream on a fix for this. In the meantime, it may be worth working around this in Kask by re-creating the session object when this error occurs.

Details

	Subject	Repo	Branch	Lines +/-
	Upgrade build environment & dependencies	mediawiki/services/kask	master	+18 -8
	Add component/gocql to bullseye	operations/puppet	production	+1 -0

Customize query in gerrit

Related Objects

Mentioned In: T327954: session storage: dissonant cluster status after reboot (was: 'cannot achieve consistency level' errors)
T327524: Kask: gocql pool errors after repeated Cassandra outages
rMSKS21cc94eb43a5: Upgrade build environment & dependencies
Mentioned Here: T253244: Upstream gocql bug effects Kask

Event Timeline

Eevans created this task.May 27 2021, 4:37 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMay 27 2021, 4:37 PM

BPirkle triaged this task as Medium priority.Jun 1 2021, 9:09 PM

BPirkle edited projects, added Platform Team Workboards (Clinic Duty Team); removed Platform Engineering.

BPirkle moved this task from Inbox to Later on the Platform Team Workboards (Clinic Duty Team) board.

Eevans raised the priority of this task from Medium to High.Oct 17 2022, 4:24 PM

Eevans added a project: Cassandra.

Eevans removed a project: Platform Team Workboards (Clinic Duty Team).Oct 18 2022, 1:18 AM

Eevans updated the task description. (Show Details)

Eevans moved this task from Backlog to Next on the Cassandra board.Oct 25 2022, 12:35 AM

Kask's dependencies are sourced entirely from Debian, the rationale for which can be found documented here. The most current version of the gocql driver in any version of Debian is 0.0~git20191102.0.9faa4c0-4 (the version we are already using); Continuing this practice will mean creating an updated package and adding it to a repository (preferably Debian, but possibly our own in the near-term).

To update to the latest gocql driver release (1.2.1 as of the time of this writing), will roughly require:

Packaging golang-github-pierrec-lz4.v4-dev (not currently in any version of Debian, but package source already exists on Salsa), and uploading it to sid
Updating golang-github-gocql-gocql-dev, and uploading it to sid
Uploading the updated golang-github-pierrec-lz4.v4-dev & golang-github-gocql-gocql-dev packages to apt.wikimedia.org (as an interim solution)
(Eventually) uploading golang-github-pierrec-lz4.v4-dev & golang-github-gocql-gocql-dev to bullseye-backports
(Eventually) removing golang-github-pierrec-lz4.v4-dev & golang-github-gocql-gocql-dev from apt.wikimedia.org

The alternative would be to update Kask to Go Modules, and henceforth source dependencies from the respective Github repos. I still wholeheartedly believe in the rationale for Debian-sourced dependencies, but feel compelled to present this option since the former will take some hours of work, and the latter...minutes.

In T283838#8358215, @Eevans wrote:

[ ... ]

To update to the latest gocql driver release (1.2.1 as of the time of this writing), will roughly require:

Packaging golang-github-pierrec-lz4.v4-dev (not currently in any version of Debian, but package source already exists on Salsa), and uploading it to sid

...