Fri, Sep 8
@bmansurov following up on our email back-and-forth:
@Capt_Swing just sent you an invitation for tomorrow. Given that you may want to put this as part of your Q2 goals, let's talk before I leave so you have enough time to think about it and explore it in quiet. ;)
Wed, Sep 6
@Capt_Swing I think now is a good time to start thinking about this objective, given that the article expansion recommendation is in a more mature place (we are very close to being able to provide section recommendations, for example). Do you want to work on this objective and consider defining goals for Q2 for it? If yes, how can I help? :)
We started with a high level design of a framework described here. The next steps are to dive deeper in every step and design an end-to-end system from the time the user arrives (registers an account) all the way to when they start doing their edits after being teamed up. We expect such a design and implementation to require a few months, likely spanning to Q3 (January-March 2018) with the goal of the experimentation (T171248) to happen in early Q4 (April 2018).
@DarTar based on the literature review (T175215) individual differences in aversion to conflict and self-confidence are the two main items that can cause the reduction in diversity in Wikipedia's editor pool when it comes to the imbalance in gender diversity. As explained in the the documentation on meta, TP12 will focus on the issue of self-confidence and how to boost that.
This task is completed now and the documentation is available at:
Tue, Sep 5
@DarTar should we assign this Epic task to you?
Fri, Sep 1
working on this with Michele, partly based on Tiziano's PhD Qualification report.
@DarTar I'll need some engineering resources for this task and I know it's a hard one, hence my question. How do you recommend I proceed? I need someone to start working on this task very soon, so we have the system ready for the last week of September or the first week of October. I can start pitching it to a few people outside of the team with the hope for mercy. ;)
Thu, Aug 31
Michele Catasta built this tool and we tested in in Wikimania. The tool lived at http://18.104.22.168/ I'm checking with Michele to see if we can keep it up, or get some screenshots of it, at the very least. (Michele is not on phab, so I assigned this task to myself, which is not correct.:)
We did skip submitting the full paper as we needed more time to tighten the research and paper. Next target is The Web Conference 2018, formerly known as WWW2018.
@ArielGlenn thanks for the update. I marked the task as Stalled. We can turn it back to Open whenever the status changes. Please do feel free to change the status whenever.
This task really never happened this way: the MOU expired eventually. Declining this task.
update: I'm waiting for instructions on what's needed from me. This task is going to stay in blocked until the instructions are in. :)
Assigning this to DarTar as the PoC. :) Feel free to reassign as you see fit.
This is not relevant now. Declining.
This unfortunately never happened and it's too late now. Declining it.
@Halfak I'll move this to backlog. Please feel free to move it around as needed, or decline it if you see you won't be doing it. :)
Wed, Aug 30
@jmatazzoni I've updated this task based on the discussions we had today. I added some more details in the description. By September 8, I need to propose the goals for this objective that need to get done in Q2 on our end and for that, I need your input. :)
Tue, Aug 29
Mon, Aug 28
I see that the Arabic text in the banner is broken. I'm looking at this page. This should not go like this to Arabic speakers. :) Can someone look into it? (I don't know Arabic fluently, but I know enough to be able to explain to whoever picks this up what's wrong in the text.)
Aug 16 2017
re QuickSurveyResponses: is event_surveySessionToken the exact same value as in QuickSurveyInitiation? If yes, it's fine to drop userAgent from QuickSurveyResponses (note that in my previous note I suggest we keep hashed userAgent in Initiation table). event_pageid and event_pageTitle are both key for the analysis (it's important to know on which page the survey was shown). I don't mind losing this data for the 2016 survey, but it's too early to purge it for the 2017 survey. For the 2017 survey, I feel much more comfortable dropping the data in December 2017, after we get to do full analysis and hear extensive feedback. Again, I'm open to brainstorming to find other solutions.
re QuickSurveyInitiation: dropping clientIp (which is hashed) and userAgent for the survey we ran towards the end of June 2017 is too risky from the analysis pov and at this point. We will need at least another month of analysis before being able to clear this data for purging, to make sure everything is correctly linked. I'm assuming this is not a problem given that the data will still be kept under 90 days.
Aug 14 2017
@DarTar Uploading before the talk didn't work, but I uploaded my slides at https://meta.wikimedia.org/wiki/Research:Characterizing_Wikipedia_Reader_Behaviour/Robustness_across_languages#Presentations . video will be added as soon as it's uploaded.
Aug 9 2017
Aug 8 2017
"I am already working with Tilen, a visiting PhD student (just like I once myself was :)), on an algorithm to identify bad users in any platform, including Wikipedia. Initial experiments show that the algorithm performs well, also on a Wikipedia vandal identification dataset. The idea is to use it to find any type of bad user, including sockpuppets. I will send you some slides tomorrow so that you get a high level overview.
The plan is get the basic framework of the algorithm done before Tilen leaves, which is in late Sept, and then tune it specifically for Wikipedia after that."
@mforns I'm looking into the detail of this data and will get back to you soon. (I need to check with Florian who works with this data to make sure we don't remove something that he needs, and he's on vacation for three weeks. I'll try to reach out to him but this may be delayed. if this can't wait for a few weeks, please let me know and I'll make a decision without Florian's input.)
Summary of results from the meeting: there's generally support for this research. Next steps:
Aug 7 2017
per IRC discussion, assigned Host status to Jonathan. ;)
great. thanks @Capt_Swing .
We submitted the abstract, we will probably skip submitting the actual paper on 2017-08-12 as we have found a couple of new directions to expand the work for The Web Conference (WWW) instead.
@mobrovac thanks for looking into this with us. Your recommendation looks good to me. And I'm fine with replacing "create" with "creation" per GWicke's last comment. On our end, and as far as I can see, that doesn't create a limitation, and if it makes things more consistent on your end, let's go with it. I'd wait for @schana and @DarTar to chime in as well, just to make sure everyone is on board. Thanks.