To respect our data retention policy the files in this directory need to be pruned appropriately. They are all of the format:
NameOfThing.log-YYYYMMDD.gz
To respect our data retention policy the files in this directory need to be pruned appropriately. They are all of the format:
NameOfThing.log-YYYYMMDD.gz
Adding @Ottomata and a link to T84618 which is still pending with a number of open subtasks. IIRC some of these were being kept longer due to requests from legal but that was quite some time back. Where are we on that now?
Change 253601 had a related patch set uploaded (by Addshore):
Prune stat1002 /a/mw-log/archive after 30 days
I believe the medawiki logs were rsync'd over at @Ironholds request. They are not all the logs, just two cirrus specific logs. There should be no blockers to clearing out old ones, afaik
The oldest file i see is CirrusSearchRequests.log-20150726.gz. A 90 day retention should have deleted it on Oct 24th i believe, so it seems to be specified but is not working.
Change 253968 had a related patch set uploaded (by Ottomata):
Use mtime instead of ctime when considering file retention, fix retention for mw-logs on stat1002
Change 253968 merged by Ottomata:
Use mtime instead of ctime when considering file retention, fix retention for mw-logs on stat1002
Ok, yup, bug! the job that removed old files was using ctime instead of mtime, and apparently these files had something changed about them more recently than 90 days.
Along the way, I noticed that the retention jobs remove files in a given destination dir. Since all of these different logs were being rsynced into the same directory, the one with the shortest retention (api.log: 30 days) will win out. I've modified the rsyncs so that they copy into directories named after the log to fix this problem, e.g. /a/mw-log/CirrusSearchRequests/ etc.
It would be really good if it could have been explicitly called out (with a ping) that the directories were changing. Several data collection scripts silently broke on the 17th.
Aye ok. I should have been more explicit about this. (You were CCed on this ticket though! :) )