Page Menu
Home
Phabricator
Search
Configure Global Search
Log In
Create Task
Maniphest
T217350
Partition event-data daily instead of hourly (for sanitized data)
Closed, Duplicate
Public
Actions
Edit Task
Edit Related Tasks...
Create Subtask
Edit Parent Tasks
Edit Subtasks
Merge Duplicates In
Close As Duplicate
Edit Related Objects...
Edit Commits
Edit Mocks
Subscribe
Mute Notifications
Protect as security issue
Award Token
Flag For Later
Assigned To
None
Authored By
JAllemandou
Feb 28 2019, 5:20 PM
2019-02-28 17:20:37 (UTC+0)
Tags
Analytics
(Data Quality)
Referenced Files
None
Subscribers
Aklapper
JAllemandou
•
Tbayer
Description
This would provide 2 benefits:
Smaller number of bigger files for events with low volumes (less pressure on HDFS)
Smaller number of partitions to maintain and work with (less pressure on metastore)
Event Timeline
JAllemandou
created this task.
Feb 28 2019, 5:20 PM
2019-02-28 17:20:37 (UTC+0)
Restricted Application
added a subscriber:
Aklapper
.
·
View Herald Transcript
Feb 28 2019, 5:20 PM
2019-02-28 17:20:38 (UTC+0)
Milimetric
renamed this task from
Partition event-data daily instead of hourly
to
Partition event-data daily instead of hourly (for sanitized data)
.
Feb 28 2019, 5:33 PM
2019-02-28 17:33:16 (UTC+0)
Milimetric
triaged this task as
Low
priority.
Milimetric
moved this task from
Incoming
to
Event Platform
on the
Analytics
board.
Milimetric
moved this task from
Event Platform
to
Data Quality
on the
Analytics
board.
•
Tbayer
subscribed.
Mar 4 2019, 8:38 PM
2019-03-04 20:38:00 (UTC+0)
JAllemandou
closed this task as a duplicate of
T236794: Find a strategy to mitigate small-files handling for long-term kept events
.
Oct 29 2019, 3:01 PM
2019-10-29 15:01:27 (UTC+0)
Log In to Comment