Page MenuHomePhabricator

Request creation of wmgmc-monitoring VPS project
Closed, ResolvedPublic

Description

Project Name: wmgmc_monitoring

Developer account usernames of requestors: @XtexChooser @Yiming

Purpose: It will be used for observability infrastructures of WMGMC Technical Group.

Brief description:
As there are more and more services running by WMGMC Tech, it becomes important than we can monitor our services easily and get notified when some of them are broken.
After an internal tech RFC (https://issues.cnuser.wiki/T20), we selected Cloud VPS to host our observability infrastructures, including OTel Collector (opentelemetry-collector-contrib), Prometheus and Grafana OSS.
Previously we planned to run a simple uptime monitor like Gatus or Uptime Kuma but soon realized that we will miss lots of important metrics which may truly expose problems.

How soon you are hoping this can be fulfilled: this month before June.

Event Timeline

Hi @XtexChooser!

Would you mind if we name it wmgmc_observability, wmgmc_monitoring or similar?
I say because CloudVPS projects should not be "generic", but tied to a specific function or project. That seems to be what you want, but the name is too generic.

From https://wikitech.wikimedia.org/wiki/Help:Cloud_VPS_project#Reviews_of_Cloud_VPS_Project_requests:

The majority of project requests are approved, but if your request involves any of the following items, it may require further discussion:
...
"Umbrella" projects with a broad scope, such as all the work to be done by an engineering team or a large problem space.

"Umbrella" projects with broad scopes are difficult to track over time because of organizational changes and lack of continuity in ownership.

Thanks!
(+1 from me if the name is changed, if meant to be kept generic, might need some extra discussion)

Hi @dcaro, wmgmc_monitoring is okay. Thanks!

XtexChooser renamed this task from Request creation of wmgmc VPS project to Request creation of wmgmc_monitoring VPS project.Apr 18 2025, 10:00 AM
XtexChooser updated the task description. (Show Details)
fnegri changed the task status from Open to In Progress.Apr 18 2025, 10:13 AM
fnegri claimed this task.
fnegri moved this task from Inbox to Approved on the Cloud-VPS (Project-requests) board.

Mentioned in SAL (#wikimedia-cloud-feed) [2025-04-18T10:27:59Z] <fnegri@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.create_project (exit_code=0) for project wmgmc-monitoring in eqiad1 (T391742)

fnegri renamed this task from Request creation of wmgmc_monitoring VPS project to Request creation of wmgmc-monitoring VPS project.Apr 18 2025, 10:28 AM
fnegri closed this task as Resolved.

Project created! I had to change the name again from wmgmc_monitoring to wmgmc-monitoring, because underscores can cause issues with DNS.