Page MenuHomePhabricator

distributed tracing v0 [minimum viable]
Open, Needs TriagePublic

Description

Epic parent task for tracking a v0 minimal deployment of distributed tracing.

Summary of current thinking & discussions:

  • We'll be deploying Jaeger on our k8s, as it is essentially the mature FLOSS option
    • Will use OpenSearch as the backing store
  • But we'll be using OTel Collector (rather than jaeger-agent) as the local daemon for collecting and exporting trace data to Jaeger, as it gives us flexibility in the future
  • To begin with, it's likely that the only 'application' actually exporting trace data will be Envoy. For Mediawiki it serves as both the TLS terminator for incoming requests, and as a proxy for outgoing requests to services, so it's a single point that captures a lot of data simply by writing some configuration stanzas.
  • We'll prioritize modifying Mediawiki and service-template-node to propagate the various tracing metadata headers from incoming requests to outgoing requests, which Envoy will be able to see

Related Objects

StatusSubtypeAssignedTask
OpenNone
OpenNone
ResolvedClement_Goubert
ResolvedClement_Goubert
ResolvedClement_Goubert
ResolvedRLazarus
ResolvedRLazarus
ResolvedJoe
Resolvedfgiunchedi
Resolvedjhathaway
Resolvedjhathaway
Resolvedfgiunchedi
OpenNone
Resolvedpmiazga
Resolvedpmiazga
Resolvedpmiazga
ResolvedJdforrester-WMF
ResolvedEBernhardson
OpenNone
OpenNone
Resolvedpmiazga
Resolvedfgiunchedi
Resolvedfgiunchedi
Resolvedjhathaway
Resolvedjhathaway
Resolvedjhathaway
Resolvedjhathaway
ResolvedJMeybohm
ResolvedClement_Goubert
OpenNone
OpenNone
Resolvedfgiunchedi
ResolvedNone
OpenNone
OpenKrinkle
ResolvedSpikepmiazga
Resolved Aprum
Resolvedfgiunchedi
Resolvedakosiaris
Resolvedakosiaris
ResolvedCDanis
Resolvedfgiunchedi
OpenCDanis
ResolvedCDanis
OpenNone
Resolvedfgiunchedi
OpenCDanis
ResolvedCDanis
Resolvedfgiunchedi
OpenNone
ResolvedCDanis
ResolvedCDanis
ResolvedBTullis
ResolvedCDanis
ResolvedCDanis
Resolvedherron
OpenNone

Event Timeline

Change #1035829 had a related patch set uploaded (by CDanis; author: CDanis):

[operations/deployment-charts@master] jaeger: link to Mediawiki debug Logstash

https://gerrit.wikimedia.org/r/1035829

Change #1035829 merged by jenkins-bot:

[operations/deployment-charts@master] jaeger: link to Mediawiki debug Logstash

https://gerrit.wikimedia.org/r/1035829

Change #1072248 had a related patch set uploaded (by CDanis; author: CDanis):

[operations/deployment-charts@master] wikifunctions: enable tracing

https://gerrit.wikimedia.org/r/1072248

Change #1072248 merged by jenkins-bot:

[operations/deployment-charts@master] wikifunctions: enable tracing

https://gerrit.wikimedia.org/r/1072248

Change #1072251 had a related patch set uploaded (by CDanis; author: CDanis):

[operations/deployment-charts@master] mw-wikifunctions: tracing at 100%

https://gerrit.wikimedia.org/r/1072251

Change #1072251 merged by jenkins-bot:

[operations/deployment-charts@master] mw-wikifunctions: tracing at 100%

https://gerrit.wikimedia.org/r/1072251