Collect and display basic metrics for all tools (service groups)
Open, MediumPublic
Actions

Assigned To

None

Authored By

	bd808
	Mar 11 2016, 6:23 AM

Description

Track basic metrics for each service group:

cpu hours used (I think we can get this from qacct)
disk space used (du -s)
database usage (number of rows in service group db)
number of raw hits to tools.wmflabs.org/service-group

Provide aggregate reports and reports per service group with daily granularity.

Related Objects
Search...

Status	Assigned	Task
Open	None	T129630 Collect and display basic metrics for all tools (service groups)
In Progress	MusikAnimal	T87001 Provide basic page view metrics for individual tools on toolforge
Resolved	valhallasw	T121233 Implement metrics for tool labs (under NDA?)
Resolved	None	T227120 Toolforge toolviews API could return views for all dates
Resolved	bd808	T227163 Add CORS support to Toolforge toolviews API
Resolved	bd808	T237080 Toolviews data loading from Toolforge front proxy access log stopped on 2019-10-28

Event Timeline

bd808 created this task.Mar 11 2016, 6:23 AM

Restricted Application added a project: Cloud-Services. · View Herald TranscriptMar 11 2016, 6:23 AM

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

bd808 added a project: Community-Tech-Tool-Labs.Mar 24 2016, 11:52 PM

bd808 moved this task from Backlog to Ready on the Community-Tech-Tool-Labs board.Mar 25 2016, 2:14 AM

tom29739 subscribed.Mar 25 2016, 12:33 PM

• chasemp triaged this task as Medium priority.Apr 4 2016, 2:00 PM

One use of this would be proactively monitoring for large databases like the ones that are being looked at in T132431: labsdb1001 and labsdb1003 short on available space.

A big hammer method for checking user/tool database sizes:

SELECT
    table_schema
  , sum( data_length ) as data_bytes
  , sum( index_length ) as index_bytes
  , sum( table_rows ) as row_count
  , count(1) as tables
FROM information_schema.TABLES
WHERE table_schema regexp '^[psu][0-9]'
GROUP BY table_schema;

Something could run that once per day (or maybe even once a week) aon each distinct database host for labs and log the data in a way that could be used to produce nice timeseries reports.

On 2016-04-21 there are 223 distinct user/tool schemas on tools.labsdb. Other hosts have far fewer (c1=58, c3=35).

scfc moved this task from Backlog to Ready to be worked on on the Toolforge board.Dec 4 2016, 8:57 PM

https://tools.wmflabs.org/tool-db-usage/ now provides point-in-time database usage information.

zhuyifei1999 subscribed.Oct 14 2017, 5:38 AM

Quiddity mentioned this in T178834: Provide any rough metrics for tool and project usage.Oct 23 2017, 7:26 PM

Harej subscribed.Jan 24 2018, 11:30 PM

bd808 added a subtask: T87001: Provide basic page view metrics for individual tools on toolforge.Jan 4 2019, 1:25 AM

MusikAnimal changed the status of subtask T87001: Provide basic page view metrics for individual tools on toolforge from Open to In Progress.May 20 2023, 5:25 PM

I wonder if nowadays all this should be ingested by prometheus and exposed through grafana dashboards instead of implementing specific tools to show each bit of data

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

dcaro moved this task from Ready to be worked on to Workspace for triaging whenever needed on the Toolforge board.Feb 20 2024, 12:29 PM

dcaro moved this task from Workspace for triaging whenever needed to Ready to be worked on on the Toolforge board.Feb 21 2024, 4:02 PM

Collect and display basic metrics for all tools (service groups)Open, MediumPublicActions

Description

Related ObjectsSearch...

Event Timeline

Collect and display basic metrics for all tools (service groups)
Open, MediumPublic
Actions

Related Objects
Search...