Page MenuHomePhabricator

toolsdb: evaluate storage usage by some tools
Open, LowPublic

Description

I just noticed this:

aborrero@clouddb1001:~$ sudo du -cks /srv/labsdb/data/* | sort -rn | head
2566525640	total
1151204360	/srv/labsdb/data/s51230__linkwatcher
151231476	/srv/labsdb/data/s51434__mixnmatch_p
129605716	/srv/labsdb/data/s53220__quickstatements_p
107163660	/srv/labsdb/data/ibdata1
100784876	/srv/labsdb/data/s53685__editgroups
63725868	/srv/labsdb/data/s51114__enwp10
60054664	/srv/labsdb/data/s53952__freebase_p
50662520	/srv/labsdb/data/s51499__wikiminiatlas
50048072	/srv/labsdb/data/s51698__yetkin

Apparently we have a few heavy users:

  • s51230__linkwatcher is using 1.1TB of data, which is about 32% of the total storage available on toolsdb (3.4TB as I write this).

See also: https://tool-db-usage.toolforge.org/

image.png (496×741 px, 57 KB)

Figures doesn't match, but the first few positions are still occupied by the same tools.

Related Objects

StatusSubtypeAssignedTask
Resolveddcaro
OpenNone
OpenNone
ResolvedAndrew
Resolved aborrero
DuplicateNone
Resolved aborrero
Resolved aborrero
Resolved aborrero
ResolvedAndrew
ResolvedAndrew
ResolvedAndrew
ResolvedAndrew
ResolvedAndrew
ResolvedAndrew
Resolvedfnegri
ResolvedTBurmeister
Resolvedfnegri
ResolvedTheresNoTime
Resolvedfnegri
OpenNone
OpenNone

Event Timeline

fnegri triaged this task as Low priority.Jul 3 2024, 2:38 PM
fnegri subscribed.

DB storage is tracked in the subtask T291782: Migrate largest ToolsDB users to Trove

NFS storage does not seem to be an immediate issue, but we should probably check if there are tools/users using a disproportionate amount of space, and maybe create an alert for that.