Page MenuHomePhabricator

Partition event-data daily instead of hourly (for sanitized data)
Closed, DuplicatePublic

Description

This would provide 2 benefits:

  • Smaller number of bigger files for events with low volumes (less pressure on HDFS)
  • Smaller number of partitions to maintain and work with (less pressure on metastore)

Event Timeline

Milimetric renamed this task from Partition event-data daily instead of hourly to Partition event-data daily instead of hourly (for sanitized data).Feb 28 2019, 5:33 PM
Milimetric triaged this task as Low priority.
Milimetric moved this task from Incoming to Event Platform on the Analytics board.
Milimetric moved this task from Event Platform to Data Quality on the Analytics board.