Page MenuHomePhabricator

Understand Termbox SSR timouts better
Open, Needs TriagePublic

Description

As pointed out by jayme on #wikimedia-serviceops, the timeouts in the Termbox SSR service on Kubernetes are constantly increasing.

Source: https://logstash-next.wikimedia.org/goto/05f9a05c7763f95efccc09aed9a32e9e

We should understand what causes these timeouts, specifically. That knowledge should be documented (probably at https://wikitech.wikimedia.org/wiki/WMDE/Wikidata/SSR_Service).

Further actions may or may not follow from understanding these timeouts.

What we already know:

  • these timeouts also happen on smallish entities, not only on large ones