Page MenuHomePhabricator

Investigation: Migrating historical SurveyMonkey data
Open, Needs TriagePublicSpike

Description

  • We'd like to explore Fileserver as a potential solution to storing old SurveyMonkey data - we'd need to be able to access and manipulate this data occasionally.

Event Timeline

Require ability for DR to review and remove data (e.g. for GDPR requests) globally for a supporter

Desired completion date August 2023

Hi @AKanji-WMF , I'm still on hold for completing survey data-deletions in the hope that this solution makes it much more efficient.
Would you have an estimate on when we might be able to explore it?
If it will be a long time, I should consider going back to the manual method to ensure GDPR compliance.

Hi @HNordeenWMF - I'll move this to Sprint +1 so the team talks about it next Tuesday - in an initial poll in IRC there seemed to be optimism that it was relatively straightforward.

thanks @AKanji-WMF ! The key would be having a way to search through the downloaded documents' contents. I did some testing, thinking I could just store all the downloaded files in the file server and use my computers indexing, but I'm not able to search the contents of files within the fileserver :(

Dwisehaupt changed the subtype of this task from "Task" to "Spike".Oct 16 2023, 8:12 PM
Dwisehaupt subscribed.

Moving the type to spike since it is an investigation task.

Hi @AKanji-WMF is this something I could meet with someone about in the next 2 weeks to discuss? (Or I could drop by office hours, let me know what's best to help move it forward)

Documenting our requirements if we are planning to retain the last 5 years of email address data:

  • Download and store response data for around 40 distinct surveys & save in secure location
  • Upon receiving a data deletion request from DR, search across ALL of the saved response datasets for their email address
  • If a match is found for that email address, easily delete their response.

Awaiting Google Drive implementation - would support requirements listed above.