Page MenuHomePhabricator

Increase retention of training data
Closed, ResolvedPublic

Description

Some years ago we got approval to update our query_clicks data processing pipeline and update our retention times to 13 months. This still needs to be implemented. For the most part this is adjusting the query_clicks hourly and daily airflow dags to follow the requirements set forth in the approval.

Related approval: https://phabricator.wikimedia.org/T235858

Event Timeline

Gehel triaged this task as High priority.Mar 25 2024, 4:37 PM
Gehel moved this task from needs triage to ML & Data Pipeline on the Discovery-Search board.