Page MenuHomePhabricator

Sabya (Sabya)
Volunteer Developer at Wikimedia Research

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Feb 24 2016, 2:46 AM (291 w, 4 d)
Availability
Available
IRC Nick
sabya
LDAP User
Sabya
MediaWiki User
Unknown

Recent Activity

Nov 5 2016

Sabya added a comment to T143123: [Spike] Semi-supervised machine learning.

@Halfak which classifier algorithm should I use? Current production algorithms or HashingVector + GradientBoosting?

Nov 5 2016, 4:57 PM · artificial-intelligence, Spike, Machine-Learning-Team

Sep 15 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Also, regarding ROC score difference between T128087#2600696 and current: current one is correct. GridSearchCV is calculating it. Earlier I was first calculating myself using roc_auc_score.

Sep 15 2016, 6:36 PM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)
Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Plots with and without sample weights:

Sep 15 2016, 6:20 PM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Sep 14 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Agreed. I'm on it.

Sep 14 2016, 12:46 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Sep 9 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Here is the plot.

Sep 9 2016, 4:56 PM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Sep 8 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Makes sense. Will work on it.

Sep 8 2016, 1:27 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Sep 7 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Here are the results from the grid search:

Best ROC AUC Score:
0.910445174634
Sep 7 2016, 4:40 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Sep 2 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Here is the link to compare the results against:

Sep 2 2016, 2:23 PM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)
Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Will try it

Sep 2 2016, 1:11 PM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Sep 1 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

New results with below params:

Sep 1 2016, 4:35 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Aug 27 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

@Halfak: results are out. Here you go:

Aug 27 2016, 3:48 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Aug 25 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

I am also pretty close to evaluating the first model with combined
features😃.

Aug 25 2016, 12:50 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Aug 19 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

I think I got it wrong about the threshold in my above comment. Now I get it as the threshold is to keep the features whose importance is >= threshold, right? 0.0005 yields 375 features. I updated the notebook to show these features.

Aug 19 2016, 4:08 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)
Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

Shouldn't it be (2^20)*0.0005? It is 524.

Aug 19 2016, 1:53 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Aug 18 2016

Sabya added a comment to T128087: [Spike] Investigate HashingVectorizer.

@Halfak I've plotted the histogram of the GBC (without other 77 features). Here is the exported version of the ipython output. I am exporting it and attaching here to capture the historical snapshot. Do you think there is an obvious step at 0.0005 (three zeros before five that is).

Aug 18 2016, 4:09 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Aug 9 2016

Sabya reopened T128087: [Spike] Investigate HashingVectorizer as "Open".

Closed by mistake.

Aug 9 2016, 2:52 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Jul 25 2016

Sabya closed T128087: [Spike] Investigate HashingVectorizer as Resolved.

For the record, the IRC log is here:

Jul 25 2016, 3:19 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Jul 12 2016

Sabya moved T138699: Menu icon is not clickable on iOS from in-progress to Done on the GapFinder board.
Jul 12 2016, 2:30 AM · GapFinder

Jul 9 2016

Sabya added a comment to T138865: Issues with iOS.

Here is a pull request using the first approach:

Jul 9 2016, 6:06 AM · GapFinder
Sabya added a comment to T138699: Menu icon is not clickable on iOS.

The following answer solves the problem:

Jul 9 2016, 6:03 AM · GapFinder
Sabya claimed T138699: Menu icon is not clickable on iOS.
Jul 9 2016, 5:57 AM · GapFinder
Sabya moved T138699: Menu icon is not clickable on iOS from Backlog to in-progress on the GapFinder board.
Jul 9 2016, 5:56 AM · GapFinder
Sabya reopened T138699: Menu icon is not clickable on iOS, a subtask of T133202: Issues on mobile devices, as Open.
Jul 9 2016, 5:21 AM · Epic, GapFinder
Sabya reopened T138699: Menu icon is not clickable on iOS as "Open".
Jul 9 2016, 5:21 AM · GapFinder
Sabya added a comment to T138865: Issues with iOS.

I've found the reason behind the issues. The expression var browserLanguages = navigator.languages || [ navigator.language || navigator.userLanguage ]; assigns 'en-us' in iOS/Mac safari, but in Chrome, it assigns 'en'. en-us could not be found in the sourceLanguages hence it becomes undefined by the following filter:

Jul 9 2016, 4:53 AM · GapFinder
Sabya claimed T138865: Issues with iOS.
Jul 9 2016, 4:38 AM · GapFinder
Sabya moved T138865: Issues with iOS from Backlog to in-progress on the GapFinder board.
Jul 9 2016, 4:38 AM · GapFinder
Sabya updated the task description for T138865: Issues with iOS.
Jul 9 2016, 4:37 AM · GapFinder
Sabya added a parent task for T138865: Issues with iOS: T133202: Issues on mobile devices.
Jul 9 2016, 4:32 AM · GapFinder
Sabya added a subtask for T133202: Issues on mobile devices: T138865: Issues with iOS.
Jul 9 2016, 4:32 AM · Epic, GapFinder
Sabya merged T138699: Menu icon is not clickable on iOS into T138865: Issues with iOS.
Jul 9 2016, 4:31 AM · GapFinder
Sabya merged task T138699: Menu icon is not clickable on iOS into T138865: Issues with iOS.
Jul 9 2016, 4:31 AM · GapFinder

Jul 6 2016

Sabya added a comment to T134292: Feedback while searching.

@schana replied back

Jul 6 2016, 4:26 AM · GapFinder

Jul 4 2016

Sabya added a comment to T134292: Feedback while searching.

@schana: pls look into the PR and share your thoughts there

Jul 4 2016, 2:15 AM · GapFinder

Jul 3 2016

Sabya added a comment to T134292: Feedback while searching.

Here is a first working copy:

Jul 3 2016, 11:10 AM · GapFinder

Jul 1 2016

Sabya added a comment to T134292: Feedback while searching.

Questions: what will be the behavior when I click on a search suggestion? Should it do anything other than filling up the search text input with the title of the suggestion?

Jul 1 2016, 1:54 AM · GapFinder

Jun 28 2016

Sabya updated the task description for T138865: Issues with iOS.
Jun 28 2016, 5:21 PM · GapFinder
Sabya added a comment to T138865: Issues with iOS.

IMG_0075.PNG (2×1 px, 97 KB)

Jun 28 2016, 5:18 PM · GapFinder
Sabya created T138865: Issues with iOS.
Jun 28 2016, 5:17 PM · GapFinder
Sabya added a comment to T138699: Menu icon is not clickable on iOS.

Yes. It still exists. Checked both in iOS Chrome and Safari.

Jun 28 2016, 4:53 PM · GapFinder

Jun 25 2016

Sabya added a comment to T134292: Feedback while searching.

@Jdrewniak there are dependencies to addEvent, mw, getDevicePixelRatio. Could you pls point me to the right sources for getting these?

Jun 25 2016, 2:14 AM · GapFinder

Jun 17 2016

Sabya claimed T134292: Feedback while searching.

@Jdrewniak Thanks. Let's go with the first one. An example of it being used would be very helpful.

Jun 17 2016, 2:18 AM · GapFinder

Jun 16 2016

Sabya moved T134295: Better placeholder image for articles with no image from in-progress to Done on the GapFinder board.
Jun 16 2016, 9:50 AM · GapFinder
Sabya added a comment to T134292: Feedback while searching.

Is the Wikipedia search suggestion box available as widget/extension/lib to integrate? Do we have to build one of our own?

Jun 16 2016, 4:12 AM · GapFinder

Jun 10 2016

Sabya moved T134293: Improving the fitting of images in lists from in-progress to Done on the GapFinder board.
Jun 10 2016, 12:55 PM · GapFinder

Jun 9 2016

Sabya added a comment to T134293: Improving the fitting of images in lists.

@schana - I've updated the pull request to fetch bigger images (512px). Please LMK if it is good enough for the merge.

Jun 9 2016, 3:58 AM · GapFinder
Sabya added a comment to T134293: Improving the fitting of images in lists.

I can see that some articles do have images but the query API does not return any. What could be the reason? Example:

Jun 9 2016, 3:25 AM · GapFinder

Jun 8 2016

Sabya added a comment to T134293: Improving the fitting of images in lists.

In that case, I think 250px for width is good enough; as the size of a card size is around 248px X 128px.

Jun 8 2016, 4:15 AM · GapFinder
Sabya added a comment to T135863: Ensure api requests in app are using https before 2016-06-12.

I did a quick check. If I open https://recommend.wmflabs.org/, all requests (including XHR, static) are made to https://. So, are we safe?

Jun 8 2016, 2:50 AM · GapFinder

Jun 7 2016

Sabya added a comment to T134293: Improving the fitting of images in lists.

Pginer-WMF - Also, would like to hear your opinion on images getting
pixelated.

Jun 7 2016, 5:13 PM · GapFinder
Sabya added a comment to T134293: Improving the fitting of images in lists.

I like it. What about others? Do need approval from design team?

Jun 7 2016, 3:59 AM · GapFinder

May 30 2016

Sabya added a member for Machine-Learning-Team: Sabya.
May 30 2016, 11:22 AM

May 21 2016

Sabya added a comment to T131186: Articles with '/' in title break preview.

I think it will help if we can have a predictable way to way to recreate the situation. Where exactly can we hardcode and always return articles with '/'?

May 21 2016, 10:42 AM · GapFinder
Sabya added a comment to T134412: Use icons instead of unicode characters.

I can confirm in OS X, Chrome it is red.

May 21 2016, 10:31 AM · GapFinder
Sabya added a comment to T134295: Better placeholder image for articles with no image.

Here is a pull request:

May 21 2016, 10:18 AM · GapFinder
Sabya added a comment to T134295: Better placeholder image for articles with no image.

Can we have an image like this: http://www.clker.com/clipart-15263.html

May 21 2016, 4:51 AM · GapFinder
Sabya moved T134295: Better placeholder image for articles with no image from Next Up to in-progress on the GapFinder board.
May 21 2016, 4:50 AM · GapFinder
Sabya moved T134293: Improving the fitting of images in lists from Next Up to in-progress on the GapFinder board.
May 21 2016, 4:37 AM · GapFinder
Sabya added a comment to T134293: Improving the fitting of images in lists.

Here is the pull request:

May 21 2016, 4:36 AM · GapFinder
Sabya claimed T134293: Improving the fitting of images in lists.

I tried this.

May 21 2016, 4:09 AM · GapFinder

May 20 2016

Sabya moved T134296: Highlight to affect the whole card from in-progress to Done on the GapFinder board.
May 20 2016, 5:06 PM · GapFinder
Sabya added a member for Machine-Learning-Team (Active Tasks): Sabya.
May 20 2016, 3:34 AM

May 18 2016

Sabya added a comment to T132581: [Spike] Proof of concept damage detection with hash vectors.

@JustinOrmont, @Halfak: I have few questions:

May 18 2016, 3:10 AM · Spike, editquality-modeling, Machine-Learning-Team
Sabya moved T134296: Highlight to affect the whole card from Next Up to in-progress on the GapFinder board.
May 18 2016, 2:53 AM · GapFinder
Sabya added a comment to T134296: Highlight to affect the whole card.

I have created a pull request:

May 18 2016, 2:11 AM · GapFinder

May 15 2016

Sabya added a comment to T134296: Highlight to affect the whole card.

I could think of two ways to achieve this.

May 15 2016, 10:58 AM · GapFinder

May 13 2016

Sabya added a comment to T132581: [Spike] Proof of concept damage detection with hash vectors.

@JustinOrmont Thank you for the detailed input, appreciate :)

May 13 2016, 4:30 AM · Spike, editquality-modeling, Machine-Learning-Team
Sabya created T135208: Retain last chosen source and target language preference.
May 13 2016, 4:07 AM · GapFinder

May 11 2016

Sabya added a member for GapFinder: Sabya.
May 11 2016, 3:24 AM
Sabya added a comment to T132581: [Spike] Proof of concept damage detection with hash vectors.

Here are the new scores with n_estimators=200 and learning_rate = 0.05

May 11 2016, 1:19 AM · Spike, editquality-modeling, Machine-Learning-Team

May 10 2016

Sabya committed rTESTREVSCORINGAGAIN496a90fffef9: Reuse pickled and saved features, labels (authored by Sabya).
Reuse pickled and saved features, labels
May 10 2016, 5:59 PM
Sabya added a comment to T132581: [Spike] Proof of concept damage detection with hash vectors.

How are you parameterizing the GradientBoostingClassifier model when constructing it? I think that it would be worthwhile to try large values for n_estimators and small values for learning_rate. I'm not sure what kind of effect this will have so it will certainly be an experiment.

May 10 2016, 4:09 AM · Spike, editquality-modeling, Machine-Learning-Team

May 9 2016

Sabya claimed T132581: [Spike] Proof of concept damage detection with hash vectors.
May 9 2016, 4:19 AM · Spike, editquality-modeling, Machine-Learning-Team
Sabya added a comment to T132581: [Spike] Proof of concept damage detection with hash vectors.

There were few discussions over email. I'm pasting that here for future reference. It is in reverse chronological order.

May 9 2016, 4:17 AM · Spike, editquality-modeling, Machine-Learning-Team
Sabya committed rTESTREVSCORINGAGAIN8015d77a6638: Fixed score_model_iterative to include other_features (authored by Sabya).
Fixed score_model_iterative to include other_features
May 9 2016, 3:38 AM

May 8 2016

Sabya committed rTESTREVSCORINGAGAINa9bd0e27a6b0: Now consider other existing extracted features too (authored by Sabya).
Now consider other existing extracted features too
May 8 2016, 2:25 AM

May 5 2016

Sabya committed rTESTREVSCORINGAGAIN3c1b74a9824d: Added support for calculating PR AUC and ROC AUC (authored by Sabya).
Added support for calculating PR AUC and ROC AUC
May 5 2016, 4:10 AM
Sabya committed rTESTREVSCORINGAGAIN56c06dc99b48: Added support to store positive score. (authored by Sabya).
Added support to store positive score.
May 5 2016, 1:21 AM

May 3 2016

Sabya committed rTESTREVSCORINGAGAINf8ca3e2c77ed: Fix weight calculation was slightly off (authored by Sabya).
Fix weight calculation was slightly off
May 3 2016, 4:17 AM

May 2 2016

Sabya committed rTESTREVSCORINGAGAIN6f0f16aabe3b: Fix selecting only is_damaging=True for scoring (authored by Sabya).
Fix selecting only is_damaging=True for scoring
May 2 2016, 4:29 AM
Sabya committed rTESTREVSCORINGAGAINf8e637465e8e: Merge branch 'master' of github.com:wiki-ai/revscoring into poc_hashing_vector (authored by Sabya).
Merge branch 'master' of github.com:wiki-ai/revscoring into poc_hashing_vector
May 2 2016, 4:22 AM
Sabya committed rTESTREVSCORINGAGAIN59a6d0dca69e: Added support for calculating iterative score. (authored by Sabya).
Added support for calculating iterative score.
May 2 2016, 4:22 AM

Apr 30 2016

Sabya committed rTESTREVSCORINGAGAIN8876c6b32114: Merge branch 'master' of github.com:wiki-ai/revscoring into poc_hashing_vector (authored by Sabya).
Merge branch 'master' of github.com:wiki-ai/revscoring into poc_hashing_vector
Apr 30 2016, 8:40 AM
Sabya committed rTESTREVSCORINGAGAIN40268fa22780: Support building model, testing with examples. (authored by Sabya).
Support building model, testing with examples.
Apr 30 2016, 8:36 AM

Apr 26 2016

Sabya committed rTESTREVSCORINGAGAINe6ed3e44d8a7: Rough notes on designing the proof of concept (authored by Sabya).
Rough notes on designing the proof of concept
Apr 26 2016, 4:54 AM
Sabya committed rTESTREVSCORINGAGAIN649690492ca6: Added support for working from sqlite (authored by Sabya).
Added support for working from sqlite
Apr 26 2016, 4:54 AM

Apr 21 2016

Sabya committed rTESTREVSCORINGAGAINdbbea897f646: Initial commit (authored by Sabya).
Initial commit
Apr 21 2016, 4:50 PM

Apr 9 2016

Sabya placed T106638: Deploy precaching daemon for ORES up for grabs.

@schana interested in this task? It seems like setting this up in puppet makes sense. Here's the command we run to start the sync server:
ores precached stream.wikimedia.org https://ores.wmflabs.org --config ores-wikimedia-config/config/00-main.yaml

Apr 9 2016, 10:06 AM · User-Ladsgroup, ORES, Machine-Learning-Team (Active Tasks)

Mar 23 2016

Sabya moved T128087: [Spike] Investigate HashingVectorizer from Non-Project Work to Backlog on the Machine-Learning-Team (Active Tasks) board.
Mar 23 2016, 3:43 AM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)

Mar 22 2016

Sabya claimed T128087: [Spike] Investigate HashingVectorizer.
Mar 22 2016, 5:46 PM · artificial-intelligence, Spike, revscoring, Machine-Learning-Team (Active Tasks)
Sabya added a comment to T106638: Deploy precaching daemon for ORES.

@Halfak Please review when you get a chance. Should I be assigning this card to you (the reviewer)? Not sure what workflow we follow here.

Mar 22 2016, 3:51 AM · User-Ladsgroup, ORES, Machine-Learning-Team (Active Tasks)

Mar 20 2016

Sabya moved T106638: Deploy precaching daemon for ORES from Backlog to Review on the Machine-Learning-Team (Active Tasks) board.
Mar 20 2016, 2:15 PM · User-Ladsgroup, ORES, Machine-Learning-Team (Active Tasks)
Sabya added a comment to T106638: Deploy precaching daemon for ORES.

Here is the commit for the change:

Mar 20 2016, 12:20 PM · User-Ladsgroup, ORES, Machine-Learning-Team (Active Tasks)

Mar 18 2016

Sabya added a comment to T106638: Deploy precaching daemon for ORES.

I have coded the part to install systemd service. I have craeted a new class using wikitech "manage puppets group". It is called role::labs::ores::precached.

Mar 18 2016, 5:49 PM · User-Ladsgroup, ORES, Machine-Learning-Team (Active Tasks)

Mar 12 2016

Sabya added a comment to T106638: Deploy precaching daemon for ORES.

Note to self from a chat with yuvi:

Mar 12 2016, 2:26 AM · User-Ladsgroup, ORES, Machine-Learning-Team (Active Tasks)

Mar 10 2016

Sabya added a comment to T106638: Deploy precaching daemon for ORES.

Is anyone working on this? If not, I'll take it up.

Mar 10 2016, 3:43 AM · User-Ladsgroup, ORES, Machine-Learning-Team (Active Tasks)

Feb 24 2016

Sabya added a member for ORES: Sabya.
Feb 24 2016, 3:27 AM
Sabya renamed T122676: Implement sentinel for ORES production Redis from Implement tewmproxy for ORES in production to Implement twemproxy for ORES in production.
Feb 24 2016, 2:48 AM · Machine-Learning-Team, ORES