Page MenuHomePhabricator

Create script that returns oozie time intervals every time a coordinator is started from a cron job
Closed, ResolvedPublic5 Estimated Story Points

Description

The mediarequest per file metric needs to be backfilled from 2015. Uploading all this data continuously will create a lot of strain for cassandra. To avoid that, we need to launch coordinators loading a number of days each day (we have agreed in 20). The coordinators will be started with a cron job running every day at 5AM.

We need a small python script to get the correct time ranges for each backfilling day.