May 7 2019
If this is successfully running elsewhere now, I don't need the data in my home directory anymore nor my account. I've backed up the various scripts, so I'm good to be removed.
Apr 12 2019
Just to close this out, our developer put together something that could work as a good starting place for integrating this work into the production stack: https://github.com/berkmancenter/mw-anomaly-detection. I don't see us doing more of this work right now, but if productionizing this becomes a priority for either us or WMF, we're not starting from scratch.
Apr 3 2019
My memory of setting these perms is long gone, but if I did mark it this way, it was probably because there is private data from the stats servers sitting in there. It's WMF's data, so as far as I'm concerned, do whatever you need to with the perms.
Feb 7 2019
I only setup the reports to be dumped to disk. I didn't know they were getting emailed out, so that must have been handled somewhere else.
Jun 26 2018
Update on this: We've finally secured some developer time and are in the process of signing a contract. Can we extend three more months, please?
Apr 2 2018
OK. I do think we will be able to collaborate on this - the timing just hasn't worked out yet. Can we extend for a few more months?
Mar 30 2018
I haven't been able to secure the developer time yet for this. It's unlikely I'll have anything to show for at least two months, during which time I won't need access.
Jan 3 2018
@Nuria Sounds good, I'll dig in. I'm familiar with the various RPCA implementations (I wrote the go one), so that part shouldn't be too much trouble.
@Nuria OK, I understand those issues. Thank you for taking the time to look through all that. There are a lot of groups in our research community (and the WMF community/staff, I believe) that would still find this data useful, and we have engineering time at our disposal, so productionizing this is not out of the question for us. The guts of this whole thing is really just a parameterized Hive query and some linear algebra - the remaining 90% was just flexibility for experimentation.
Dec 20 2017
Stephen LaPorte. Sorry, I don't know his username.
The other concern is that the output from these reports is supposed to be made publicly available by WMF. That's been agreed to in principle, but the process has not been worked out yet. As that's on WMF's side, I don't know how long figuring that out is going to take. If possible, I'd prefer to not lose access to the reports until after they've been made public, so I made a conservative estimate of April.
I have a home directory on stat1005, so whatever accounts are necessary to access that.
Dec 19 2017
Jul 17 2017
I've been doing research with Zhou Zhou at WMF Legal (who I believe is zhousqaured on the list above). I'm external, so I should have an expiration date. Our research is almost wrapped, but if possible, I'd prefer to expire at the end of the year just to make sure any loose ends have time to get resolved.
Jan 15 2016
I've already signed the L3 form - sorry I forgot to mention it.