Using paws-nfs-1 as a sample, it is my understanding this is a snapshot of paws nfs usage from March of 2022. There are a total of 4227 home directories. The following chart describes their usage:
data usage | > 1M | > 10M | > 100M | > 1G | > 10G |
# of users | 874 | 532 | 206 | 78 | 12 |
% of users | 20.7 | 12.6 | 4.9 | 1.8 | 0.3 |
% of data | 99.9 | 99.7 | 97.8 | 89.3 | 56.0 |
Of the largest directories, 1/3 do not appear to have been used in over a year prior to this snapshot (newest file was over a year older)
Considering that PAWS does not give a user very much compute it is questionable how much value is derived from being able to have a large amount of local data. Meanwhile the cost is increased usage and reduced portability of PAWS itself. Considering that only 1.8% of users have exceeded using 1G of storage and the large amount of storage that can be recovered (we wouldn't get 89.3 percent back as people could still use up to 1G, but we would probably get 80% back) it seems reasonable to investigate if we can quota in this regard and refer people who need more to toolforge.