Page MenuHomePhabricator

Storage for banner history data
Closed, DeclinedPublic

Description

Banner history data for most of the year-end fundraising campaign was extracted from Hive EventLogging tables and is currently stored in a temporary location.

The extracted data is anonymized and approved by legal for retention beyond the 90-day limit.

We'd like to transfer it somewhere that would allow querying... It seems the best option might be to load it back into Hive, if possible, in its own location? Some additional permissions might be needed for this...

We could also consider what mechanism to use to do this on an ongoing basis...

Thanks!!

Event Timeline

@AndyRussG: can we look at the data to make sure it is safe to retain, the risk normally comes from cross checking datasets and evaluating that risks is something we have to do before we make sure dataset is safe to retain according to our privacy policy. Please be so kind to include us on your communications with legal so we can keep an eye on data coming on the pipeline.

Can you document the contents of original data plus what has been removed/anonymized , we can ping security and establish together whether dataset is safe for long term storage.

@Nuria I just added you to an email about the legal requirements. TLDR: we are allowed to keep the data for now. We need a place to park the data so we can aggregate it. That will allow us to keep it for longer.

FYI, we are planning on improving Hive EventLogging integration next quarter: T153328

We are checking some points with legal here: T161656

ggellerman moved this task from Triage to Sprint +1 on the Fundraising-Backlog board.
fdans subscribed.

@DStrine closing this task since there are no new updates. Feel free to reopen and ping us if you get back to it.