Sumit (Sumit)
User

Projects (7)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Dec 16 2014, 4:23 PM (135 w, 4 d)
Availability
Available
IRC Nick
codezee
LDAP User
Sumit
MediaWiki User
Sumit.iitp

Recent Activity

Thu, Jul 20

GitHub <noreply@github.com> committed rOEQ957951e01838: Merge 3c11d1a779feff020a6142b9545ce816de6d91f1 into… (authored by Sumit).
Merge 3c11d1a779feff020a6142b9545ce816de6d91f1 into…
Thu, Jul 20, 3:45 PM
Sumit committed rOEQ3c11d1a779fe: Add label param for enwiki goodfaith in Makefile (authored by Sumit).
Add label param for enwiki goodfaith in Makefile
Thu, Jul 20, 3:45 PM

Mon, Jul 17

Sumit closed T170069: Add ORES technical documentation as Resolved.
Mon, Jul 17, 3:20 PM · Documentation, Scoring-platform-team
Sumit closed T170069: Add ORES technical documentation, a subtask of T148974: [Epic] Clean up ORES service documentation, as Resolved.
Mon, Jul 17, 3:20 PM · Scoring-platform-team, Epic, Documentation, ORES
Sumit moved T170069: Add ORES technical documentation from Active to Epics on the Scoring-platform-team board.
Mon, Jul 17, 3:20 PM · Documentation, Scoring-platform-team
Sumit added a comment to T170069: Add ORES technical documentation.

The above page is complete in documentation of technical details and is linked from https://www.mediawiki.org/wiki/ORES hence closing.

Mon, Jul 17, 3:20 PM · Documentation, Scoring-platform-team

Sun, Jul 16

Sumit closed T163009: Train/test damaging & goodfaith models for Albanian Wikipedia as Resolved.
Sun, Jul 16, 5:10 AM · Scoring-platform-team, artificial-intelligence, editquality-modeling
Sumit closed T163009: Train/test damaging & goodfaith models for Albanian Wikipedia, a subtask of T130213: [Epic] Edit quality models (damaging/goodfaith), as Resolved.
Sun, Jul 16, 5:10 AM · artificial-intelligence, Epic, editquality-modeling, Scoring-platform-team
Sumit moved T163009: Train/test damaging & goodfaith models for Albanian Wikipedia from Active to Done on the Scoring-platform-team board.
Sun, Jul 16, 5:10 AM · Scoring-platform-team, artificial-intelligence, editquality-modeling

Fri, Jul 14

GitHub <noreply@github.com> committed rOEQ0e41603a5bdd: Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into… (authored by Sumit).
Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into…
Fri, Jul 14, 7:53 PM

Thu, Jul 13

Sumit added a project to T170177: Test draftquality sentiment feature on Editquality: draftquality-modeling.
Thu, Jul 13, 3:38 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Wed, Jul 12

Sumit moved T170205: Add test to ensure timeout of functions taking too long from Review to Done on the Scoring-platform-team board.
Wed, Jul 12, 5:18 PM · Scoring-platform-team

Mon, Jul 10

GitHub <noreply@github.com> committed rORESc482c430b979: Merge 6f79568e9f2cf50df4bc73c35f29e2b658bc00fb into… (authored by Sumit).
Merge 6f79568e9f2cf50df4bc73c35f29e2b658bc00fb into…
Mon, Jul 10, 9:23 PM
Sumit committed rORES6f79568e9f2c: Remove flake8 errors (authored by Sumit).
Remove flake8 errors
Mon, Jul 10, 9:23 PM
GitHub <noreply@github.com> committed rORESbe70a99d44eb: Merge a7550e723751c2319a0c5d4a3b9ae78a01b5afbb into… (authored by Sumit).
Merge a7550e723751c2319a0c5d4a3b9ae78a01b5afbb into…
Mon, Jul 10, 9:11 PM
Sumit committed rORESa7550e723751: Add Timeout test for a function taking a long time (authored by Sumit).
Add Timeout test for a function taking a long time
Mon, Jul 10, 9:11 PM
Sumit added a subtask for T168965: Why don't timeouts work during long regular expression matching?: T170205: Add test to ensure timeout of functions taking too long.
Mon, Jul 10, 9:11 PM · revscoring, ORES, Scoring-platform-team, artificial-intelligence
Sumit added a parent task for T170205: Add test to ensure timeout of functions taking too long: T168965: Why don't timeouts work during long regular expression matching?.
Mon, Jul 10, 9:11 PM · Scoring-platform-team
Sumit added a comment to T170205: Add test to ensure timeout of functions taking too long.

https://github.com/wiki-ai/ores/pull/219

Mon, Jul 10, 9:10 PM · Scoring-platform-team
Sumit created T170205: Add test to ensure timeout of functions taking too long.
Mon, Jul 10, 9:10 PM · Scoring-platform-team
Sumit added a comment to T168369: Add language support for Albanian.

https://github.com/wiki-ai/revscoring/pull/335

Mon, Jul 10, 7:18 PM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit added a comment to T168369: Add language support for Albanian.

Hi, we're done, please see here https://www.mediawiki.org/wiki/Research:Revision_scoring_as_a_service/Word_lists/sq

Let us know if you seen any problems.

Thanks!

Mon, Jul 10, 7:18 PM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit created T170177: Test draftquality sentiment feature on Editquality.
Mon, Jul 10, 5:38 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Sun, Jul 9

Sumit created T170069: Add ORES technical documentation.
Sun, Jul 9, 6:48 AM · Documentation, Scoring-platform-team

Sat, Jul 8

GitHub <noreply@github.com> committed rODQf9dfb7665ca5: Merge 2e24b1e39f51207ba9c930131460dceb31b5e592 into… (authored by Sumit).
Merge 2e24b1e39f51207ba9c930131460dceb31b5e592 into…
Sat, Jul 8, 8:50 AM
Sumit committed rODQ3c47adb653e8: Take most common word sense for polarity score (authored by Sumit).
Take most common word sense for polarity score
Sat, Jul 8, 8:50 AM
Sumit committed rODQd568fe6b85bb: (WIP) Add feature for polarity using SentiWordnet (authored by Sumit).
(WIP) Add feature for polarity using SentiWordnet
Sat, Jul 8, 8:50 AM
Sumit committed rODQ2e24b1e39f51: ADD SentiWordnet requirement to README (authored by Sumit).
ADD SentiWordnet requirement to README
Sat, Jul 8, 8:50 AM
Sumit added a comment to T167305: Experiment with Sentiment score feature for draftquality.

New PR - https://github.com/wiki-ai/draftquality/pull/9

Sat, Jul 8, 8:34 AM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Fri, Jul 7

GitHub <noreply@github.com> committed rOEQb0d94fc6c5ec: Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into… (authored by Sumit).
Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into…
Fri, Jul 7, 7:35 PM
Sumit moved T156503: Build damaging/goodfaith models for Romanian Wikipedia from Review to Done on the Scoring-platform-team board.
Fri, Jul 7, 7:27 PM · artificial-intelligence, Scoring-platform-team, revscoring, editquality-modeling

Mon, Jul 3

GitHub <noreply@github.com> committed rODQ3086fdda8d50: Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into… (authored by Sumit).
Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into…
Mon, Jul 3, 6:04 PM
Sumit moved T167305: Experiment with Sentiment score feature for draftquality from Review to Active on the Scoring-platform-team board.
Mon, Jul 3, 3:52 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Sat, Jul 1

Sumit added a comment to T168369: Add language support for Albanian.

Left the following note on their talk pages:

Hi
Can you goto https://phabricator.wikimedia.org/T168369 and see if you can help in segregating a list of about 250 words in Albanian into badwords and informal words. We need these lists to help build damaging and goodfaith models for Albanian Wikipedia. A good way to do that would be to edit the https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service/Word_lists/sq and simply copy the generated list to badwords and informal words and remove the words that do not fall in the respective category. Your help is much appreciated! Let me know or leave a comment on the task itself in case of any issue.-Thanks!

Sat, Jul 1, 2:05 PM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit added a comment to T168369: Add language support for Albanian.

Hi @Margott @Liridon @Arianit Can you please goto https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service/Word_lists/sq and segregate the generated list words into badwords and informal words. Refer to task description for badwords and informal words definition.

Sat, Jul 1, 1:52 PM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence

Thu, Jun 29

GitHub <noreply@github.com> committed rODQ94090c738a39: Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into… (authored by Sumit).
Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into…
Thu, Jun 29, 12:26 AM

Wed, Jun 28

GitHub <noreply@github.com> committed rODQe1414c7f1d22: Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into… (authored by Sumit).
Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into…
Wed, Jun 28, 5:53 PM

Tue, Jun 27

Sumit moved T156503: Build damaging/goodfaith models for Romanian Wikipedia from Active to Review on the Scoring-platform-team board.
Tue, Jun 27, 7:01 PM · artificial-intelligence, Scoring-platform-team, revscoring, editquality-modeling
GitHub <noreply@github.com> committed rOEQfb90656e903c: Merge 1cb2cf67d841a242358c5fc4ddc948b86e21f960 into… (authored by Sumit).
Merge 1cb2cf67d841a242358c5fc4ddc948b86e21f960 into…
Tue, Jun 27, 6:42 PM
Sumit committed rOEQ1cb2cf67d841: Add models and tuning reports (authored by Sumit).
Add models and tuning reports
Tue, Jun 27, 6:42 PM
Sumit committed rOEQf8fd6bc9b6b7: Add rowiki damaging, goodfaith models to Makefile (authored by Sumit).
Add rowiki damaging, goodfaith models to Makefile
Tue, Jun 27, 6:42 PM
Sumit committed rOEQ8b048f106562: Retain reverted autolabelled (authored by Sumit).
Retain reverted autolabelled
Tue, Jun 27, 6:42 PM
Sumit committed rOEQ94cfa18df50c: Fetch human labels (authored by Sumit).
Fetch human labels
Tue, Jun 27, 6:42 PM
Sumit added a comment to T156503: Build damaging/goodfaith models for Romanian Wikipedia.

https://github.com/wiki-ai/editquality/pull/78

Tue, Jun 27, 6:39 PM · artificial-intelligence, Scoring-platform-team, revscoring, editquality-modeling
Sumit added a comment to T156503: Build damaging/goodfaith models for Romanian Wikipedia.

need to retrain the models after the regex update, PR soon.

Tue, Jun 27, 2:54 PM · artificial-intelligence, Scoring-platform-team, revscoring, editquality-modeling
Sumit added a comment to T156503: Build damaging/goodfaith models for Romanian Wikipedia.
make models/rowiki.goodfaith.gradient_boosting.model                                     [97/1922]
cat datasets/rowiki.labeled_revisions.w_cache.20k_2016.json | \
        revscoring cv_train \
                revscoring.scorer_models.GradientBoosting \
                editquality.feature_lists.rowiki.goodfaith \
                goodfaith \
                --version=0.3.0 \
                -p 'max_depth=3' \
                -p 'learning_rate=0.1' \
                -p 'max_features="log2"' \
                -p 'n_estimators=300' \
                -s 'table' -s 'accuracy' -s 'precision' -s 'recall' -s 'pr' -s 'roc' -s 'recall_at_fpr(max_fpr=0.10)' -s 'filter_rate_at_recall(min_recall=0.9)' -s 'filt
er_rate_at_recall(min_recall=0.75)' -s 'recall_at_precision(min_precision=0.995)' -s 'recall_at_precision(min_precision=0.99)' -s 'recall_at_precision(min_precision=0.98
)' -s 'recall_at_precision(min_precision=0.90)' -s 'recall_at_precision(min_precision=0.75)' -s 'recall_at_precision(min_precision=0.60)' -s 'recall_at_precision(min_pre
cision=0.45)' -s 'recall_at_precision(min_precision=0.15)' \
                --balance-sample-weight \
                --center --scale > models/rowiki.goodfaith.gradient_boosting.model
2017-06-27 13:11:03,053 INFO:revscoring.utilities.cv_train -- Cross-validating model statistics for 10 folds...
2017-06-27 13:11:03,907 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 1...
2017-06-27 13:13:54,482 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 2...
2017-06-27 13:17:12,485 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 3...
2017-06-27 13:19:46,401 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 4...
2017-06-27 13:22:17,370 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 5...
2017-06-27 13:25:08,119 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 6...
2017-06-27 13:27:31,615 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 7...
2017-06-27 13:29:51,620 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 8...
2017-06-27 13:32:09,126 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 9...
2017-06-27 13:34:20,776 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 10...
2017-06-27 13:36:25,349 INFO:revscoring.utilities.cv_train -- Training model on all data...
ScikitLearnClassifier
 - type: GradientBoosting
 - params: max_features="log2", min_samples_leaf=1, min_weight_fraction_leaf=0.0, warm_start=false, balanced_sample=false, balanced_sample_weight=true, center=true, loss
="deviance", min_samples_split=2, max_leaf_nodes=null, verbose=0, max_depth=3, random_state=null, n_estimators=300, learning_rate=0.1, scale=true, subsample=1.0, init=nu
ll, presort="auto"
 - version: 0.3.0
 - trained: 2017-06-27T13:36:32.777290
Tue, Jun 27, 1:42 PM · artificial-intelligence, Scoring-platform-team, revscoring, editquality-modeling
Sumit added a comment to T156503: Build damaging/goodfaith models for Romanian Wikipedia.
make models/rowiki.damaging.gradient_boosting.model       
cat datasets/rowiki.labeled_revisions.w_cache.20k_2016.json | \
        revscoring cv_train \
                revscoring.scorer_models.GradientBoosting \
                editquality.feature_lists.rowiki.damaging \
                damaging \
                --version=0.3.0 \
                -p 'max_depth=5' \
                -p 'learning_rate=0.01' \
                -p 'max_features="log2"' \
                -p 'n_estimators=700' \
                -s 'table' -s 'accuracy' -s 'precision' -s 'recall' -s 'pr' -s 'roc' -s 'recall_at_fpr(max_fpr=0.10)' -s 'filter_rate_at_recall(min_recall=0.9)' -s 'filt
er_rate_at_recall(min_recall=0.75)' -s 'recall_at_precision(min_precision=0.995)' -s 'recall_at_precision(min_precision=0.99)' -s 'recall_at_precision(min_precision=0.98
)' -s 'recall_at_precision(min_precision=0.90)' -s 'recall_at_precision(min_precision=0.75)' -s 'recall_at_precision(min_precision=0.60)' -s 'recall_at_precision(min_pre
cision=0.45)' -s 'recall_at_precision(min_precision=0.15)' \
                --balance-sample-weight \
                --center --scale > models/rowiki.damaging.gradient_boosting.model
2017-06-27 08:00:43,699 INFO:revscoring.utilities.cv_train -- Cross-validating model statistics for 10 folds...
2017-06-27 08:00:44,352 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 1...
2017-06-27 08:03:13,756 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 2...
2017-06-27 08:05:56,730 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 3...
2017-06-27 08:08:40,903 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 4...
2017-06-27 08:11:27,209 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 5...
2017-06-27 08:14:17,733 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 6...
2017-06-27 08:17:35,238 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 7...
2017-06-27 08:20:17,584 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 8...
2017-06-27 08:23:33,992 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 9...
2017-06-27 08:26:46,826 INFO:revscoring.scorer_models.sklearn_classifier -- Performing cross-validation 10...
2017-06-27 08:29:07,741 INFO:revscoring.utilities.cv_train -- Training model on all data...
ScikitLearnClassifier
 - type: GradientBoosting
 - params: max_leaf_nodes=null, learning_rate=0.01, min_samples_split=2, verbose=0, center=true, warm_start=false, n_estimators=700, presort="auto", balanced_sample_weig
ht=true, loss="deviance", min_samples_leaf=1, balanced_sample=false, init=null, random_state=null, subsample=1.0, max_features="log2", scale=true, min_weight_fraction_le
af=0.0, max_depth=5
 - version: 0.3.0
 - trained: 2017-06-27T08:29:32.376824
Tue, Jun 27, 8:48 AM · artificial-intelligence, Scoring-platform-team, revscoring, editquality-modeling
Sumit added a comment to T165668: Weekly Reports for Outreachy Round-14 project: Allow Programs & Events Dashboard to make automatic edits on connected wikis.

Hi @Medhabansal a gentle reminder to keep your weekly reports updated!

Tue, Jun 27, 6:25 AM · Education-Program-Dashboard, Outreachy (Round-14)
Sumit added a comment to T164645: Weekly report of GSoC 2017 Project : Adding Data storage feature and upgrading Quiz extension.

Hi @Harjotsingh please keep your weekly reports updated.

Tue, Jun 27, 6:24 AM · MediaWiki-extensions-Quiz
Sumit added a comment to T164627: Weekly report for Automatic editing suggestions and feedbacks for articles in Wiki Ed Dashboard.

Hi @Keer25 , please keep your weekly reports updated.

Tue, Jun 27, 6:24 AM · Education-Program-Dashboard
Sumit added a comment to T164623: Weekly Reports : Add a "hierarchy" type to the Cargo extension [GSoC-2017].

Hi, a gentle reminder to update your weekly report.

Tue, Jun 27, 6:23 AM · MediaWiki-extensions-Cargo
Sumit added a comment to T164612: Weekly reports of Wiki Ed Foundation Project-"To provide enhanced usability for Wikimedia Programs & Events Dashboard".

Hi, a gentle reminder to keep weekly reports updated.

Tue, Jun 27, 6:23 AM · Education-Program-Dashboard
Sumit added a comment to T164531: Weekly reports for Implement Thanks support in Pywikibot.

Hi, a gentle reminder to update weekly report.

Tue, Jun 27, 6:22 AM · Pywikibot-core, Pywikibot-Thanks

Mon, Jun 26

GitHub <noreply@github.com> committed rOEQ72119239607a: Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into… (authored by Sumit).
Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into…
Mon, Jun 26, 10:21 PM
GitHub <noreply@github.com> committed rODQ134c5c32b528: Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into… (authored by Sumit).
Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into…
Mon, Jun 26, 7:31 PM
GitHub <noreply@github.com> committed rOEQ88439263938f: Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into… (authored by Sumit).
Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into…
Mon, Jun 26, 7:24 PM
Sumit added a comment to T167305: Experiment with Sentiment score feature for draftquality.

So I could setup a test with the library - https://github.com/kevincobain2000/sentiment_classifier/ that generates raw polarity scores for each document as an aggregate of positive and negative terms in the document. I used the https://github.com/wiki-ai/draftquality/blob/master/datasets/enwiki.draft_quality.75_not_OK_sample.censored.tsv and made the following observations:

Apologies if you've already covered this, but it might be helpful to also do a sentiment analysis of non-damaging edits, to determine our baseline?

Mon, Jun 26, 6:13 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Jun 22 2017

GitHub <noreply@github.com> committed rOEQ74afc8d07e3e: Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into… (authored by Sumit).
Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into…
Jun 22 2017, 10:15 AM
GitHub <noreply@github.com> committed rOEQfaa88074c14d: Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into… (authored by Sumit).
Merge 0247b751f0e47ba3bfb62ed65999783fd4bb2f86 into…
Jun 22 2017, 6:24 AM
Sumit committed rOEQ0247b751f0e4: Take top 20000 labelled instances then shuffle (authored by Sumit).
Take top 20000 labelled instances then shuffle
Jun 22 2017, 6:24 AM
Sumit committed rOEQ0baef8bd6bc6: Remove shuf and take top 20000 labelled instances (authored by Sumit).
Remove shuf and take top 20000 labelled instances
Jun 22 2017, 6:22 AM
GitHub <noreply@github.com> committed rOEQ623c2fe05346: Merge 0baef8bd6bc69a95a915b8e06a78ac6f98608b63 into… (authored by Sumit).
Merge 0baef8bd6bc69a95a915b8e06a78ac6f98608b63 into…
Jun 22 2017, 6:22 AM
GitHub <noreply@github.com> committed rOEQf2f4aea23a66: Merge 4a796af285d34a4ac358bd75a4a749605423c33f into… (authored by Sumit).
Merge 4a796af285d34a4ac358bd75a4a749605423c33f into…
Jun 22 2017, 6:17 AM
Sumit committed rOEQ4a796af285d3: Remove shuf (authored by Sumit).
Remove shuf
Jun 22 2017, 6:17 AM

Jun 21 2017

Sumit updated subscribers of T168369: Add language support for Albanian.

@Halfak @Ladsgroup I'm not familiar with automatic Bad-words list creation and their review, any pointers where to look for?

Jun 21 2017, 12:38 PM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit added a project to T168369: Add language support for Albanian: Scoring-platform-team.
Jun 21 2017, 12:25 PM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit committed rODQ5f8b47e72814: Take most common word sense for polarity score (authored by Sumit).
Take most common word sense for polarity score
Jun 21 2017, 12:24 PM
GitHub <noreply@github.com> committed rODQ5c4e7b79b735: Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into… (authored by Sumit).
Merge 5f8b47e72814e1deb54710c124b1e4c913dc1b46 into…
Jun 21 2017, 12:24 PM
Sumit added a comment to T167305: Experiment with Sentiment score feature for draftquality.

Most dominant word sense achieves tremendous improvement in computation, now taking only milliseconds. Now only need to verify if this assumption preserves our hypothesis related to sentiment of draft.

Jun 21 2017, 12:24 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team
Sumit moved T167305: Experiment with Sentiment score feature for draftquality from Active to Review on the Scoring-platform-team board.
Jun 21 2017, 12:22 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Jun 20 2017

GitHub <noreply@github.com> committed rOEQ359b5198ebdf: Merge 65d9839eb1efe93e1ae59376262738258aedcc22 into… (authored by Sumit).
Merge 65d9839eb1efe93e1ae59376262738258aedcc22 into…
Jun 20 2017, 3:16 PM
Sumit committed rOEQ65d9839eb1ef: Add sqwiki features and rules to fetch labeled revision to Makefile (authored by Sumit).
Add sqwiki features and rules to fetch labeled revision to Makefile
Jun 20 2017, 3:16 PM
Sumit added a comment to T163009: Train/test damaging & goodfaith models for Albanian Wikipedia.

https://github.com/wiki-ai/editquality/pull/74

Jun 20 2017, 2:29 PM · Scoring-platform-team, artificial-intelligence, editquality-modeling
Sumit updated the task description for T168369: Add language support for Albanian.
Jun 20 2017, 9:23 AM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit added a subtask for T163009: Train/test damaging & goodfaith models for Albanian Wikipedia: T168369: Add language support for Albanian.
Jun 20 2017, 9:22 AM · Scoring-platform-team, artificial-intelligence, editquality-modeling
Sumit added a parent task for T168369: Add language support for Albanian: T163009: Train/test damaging & goodfaith models for Albanian Wikipedia.
Jun 20 2017, 9:22 AM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit updated the task description for T168369: Add language support for Albanian.
Jun 20 2017, 9:20 AM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence
Sumit added a comment to T163009: Train/test damaging & goodfaith models for Albanian Wikipedia.

I've added makefile rules and the features file but looks like we don't yet have language assets for Albanian in revscoring.

Jun 20 2017, 9:20 AM · Scoring-platform-team, artificial-intelligence, editquality-modeling
Sumit created T168369: Add language support for Albanian.
Jun 20 2017, 9:18 AM · Scoring-platform-team, Bad-Words-Detection-System, revscoring, artificial-intelligence

Jun 19 2017

Sumit added a comment to T167305: Experiment with Sentiment score feature for draftquality.

Removing stop words or restricting sentence to [-1,+1] during disambiguation did not give significant improvement.

Jun 19 2017, 4:30 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Jun 12 2017

Sumit added a comment to T167305: Experiment with Sentiment score feature for draftquality.

Some dirty digging around performance, by profiling a script scoring long ( > 20) lines sentences:

Tue Jun 13 02:29:04 2017    stats
Jun 12 2017, 9:18 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team
Sumit removed a project from T167697: [Discuss] Hosting the draft quality dataset on labsDB: DBA.

@Aklapper I was not aware that DBA would require immediate action, this task still needs some discussion, removing currently.

Jun 12 2017, 8:29 PM · Scoring-platform-team-Backlog, draftquality-modeling, artificial-intelligence
Sumit added a project to T167697: [Discuss] Hosting the draft quality dataset on labsDB: draftquality-modeling.
Jun 12 2017, 4:58 PM · Scoring-platform-team-Backlog, draftquality-modeling, artificial-intelligence
Sumit added projects to T167305: Experiment with Sentiment score feature for draftquality: artificial-intelligence, draftquality-modeling.
Jun 12 2017, 4:58 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team
Sumit added projects to T167697: [Discuss] Hosting the draft quality dataset on labsDB: artificial-intelligence, DBA.
Jun 12 2017, 4:58 PM · Scoring-platform-team-Backlog, draftquality-modeling, artificial-intelligence
Sumit created T167697: [Discuss] Hosting the draft quality dataset on labsDB.
Jun 12 2017, 4:57 PM · Scoring-platform-team-Backlog, draftquality-modeling, artificial-intelligence
Sumit renamed T167305: Experiment with Sentiment score feature for draftquality from Experiment with WNAffect for draftquality to Experiment with Sentiment score feature for draftquality.
Jun 12 2017, 4:49 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team
GitHub <noreply@github.com> committed rODQ830be68ffc48: Merge c351f34fbe2d45b11cefe7765fbc877a9859a63c into… (authored by Sumit).
Merge c351f34fbe2d45b11cefe7765fbc877a9859a63c into…
Jun 12 2017, 2:11 PM
Sumit committed rODQc351f34fbe2d: (WIP) Add feature for polarity using SentiWordnet (authored by Sumit).
(WIP) Add feature for polarity using SentiWordnet
Jun 12 2017, 2:11 PM
Sumit moved T167305: Experiment with Sentiment score feature for draftquality from Active to Review on the Scoring-platform-team board.
Jun 12 2017, 1:07 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team
Sumit added a comment to T167305: Experiment with Sentiment score feature for draftquality.

Tracked in https://github.com/wiki-ai/draftquality/pull/3

Jun 12 2017, 1:07 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team
Sumit added a comment to T167305: Experiment with Sentiment score feature for draftquality.

So I could setup a test with the library - https://github.com/kevincobain2000/sentiment_classifier/ that generates raw polarity scores for each document as an aggregate of positive and negative terms in the document. I used the https://github.com/wiki-ai/draftquality/blob/master/datasets/enwiki.draft_quality.75_not_OK_sample.censored.tsv and made the following observations:

Jun 12 2017, 12:56 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

Jun 7 2017

Sumit created T167305: Experiment with Sentiment score feature for draftquality.
Jun 7 2017, 2:49 PM · draftquality-modeling, artificial-intelligence, Scoring-platform-team

May 30 2017

GitHub <noreply@github.com> committed rODQc74b5758dd80: Merge b00030576f5b926c55321484041e1abd0cef077c into… (authored by Sumit).
Merge b00030576f5b926c55321484041e1abd0cef077c into…
May 30 2017, 8:06 PM
Sumit committed rODQb00030576f5b: Add json2tsv in requirements.txt (authored by Sumit).
Add json2tsv in requirements.txt
May 30 2017, 8:06 PM

May 19 2017

Sumit assigned T164612: Weekly reports of Wiki Ed Foundation Project-"To provide enhanced usability for Wikimedia Programs & Events Dashboard" to Sek2016.
May 19 2017, 9:53 PM · Education-Program-Dashboard

May 7 2017

Sumit placed T158678: Allow Programs & Events Dashboard to make automatic edits on connected wikis up for grabs.

removing assignee as this is the project task.

May 7 2017, 8:01 AM · Google-Summer-of-Code (2017), Outreach-Programs-Projects, Outreachy (Round-14), Possible-Tech-Projects, Education-Program-Dashboard
Sumit removed a project from T164531: Weekly reports for Implement Thanks support in Pywikibot: Google-Summer-of-Code (2017).
May 7 2017, 7:58 AM · Pywikibot-core, Pywikibot-Thanks
Sumit edited projects for T164581: Weekly reports: Improvements to ProofreadPage Extension and Wikisource, added: ProofreadPage; removed Google-Summer-of-Code (2017).
May 7 2017, 7:57 AM · ProofreadPage
Sumit added a comment to T164560: Build a similar to @NYPLEmoji bot for Commons images.

@harshcrop please mention names of your mentors and add T143593 as parent of this task.

May 7 2017, 7:57 AM · Google-Summer-of-Code (2017), Commons