Page MenuHomePhabricator

Adjust database retention settings
Closed, ResolvedPublic

Description

Running 2 dumps per month will create about 2x60k tasks in database with associated XCOMs, logs, etc.

This means that we should expect about 1.5M tasks a year in our metadstore. Should we keep everything? We need to determine our database retention to make sure we're not clogging down to database, while keeping appropriate backfill/investigation capabilities.

Event Timeline

brouberol triaged this task as Medium priority.
BTullis claimed this task.
BTullis subscribed.

I think we can close this, really.
After T392461: Batch dump jobs within a single airflow task to speed up DAG execution was completed, we found that the number of task instances per dump is significantly smaller than first estimated.

So I'm OK with leaving the database retention to the default, for now. If we start seeing database bloat, then we can come back to it.