Sumit (Sumit)
User

Projects (8)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Dec 16 2014, 4:23 PM (161 w, 2 d)
Availability
Available
IRC Nick
codezee
LDAP User
Sumit
MediaWiki User
Sumit.iitp

Recent Activity

Yesterday

Sumit added a comment to T185147: Host Google-News-word2vec.bin publicly.

The binary *was* on ores-misc-01 which is now nuked. I'll upload it to ores-staging-01 from my system again from where it can be put somewhere public.

Wed, Jan 17, 9:47 PM · Scoring-platform-team (Current)
Sumit created T185147: Host Google-News-word2vec.bin publicly.
Wed, Jan 17, 9:46 PM · Scoring-platform-team (Current)

Tue, Jan 16

Sumit added a comment to T184765: Back up ores-misc-01 to ores-staging-01.

I've taken backup of the tuning reports, and the GradientBoosting and RandomForest models.

Tue, Jan 16, 4:49 PM · ORES, Scoring-platform-team (Current)

Fri, Dec 22

Sumit edited projects for T183392: Drafttopic: Add utility to extract dependents, added: Scoring-platform-team; removed Scoring-platform-team (Current).
Fri, Dec 22, 6:24 PM · Scoring-platform-team
Sumit edited projects for T183355: Drafttopic: add article text fetching utility, added: Scoring-platform-team; removed Scoring-platform-team (Current).
Fri, Dec 22, 6:24 PM · Scoring-platform-team
Sumit moved T183580: class weights support for multilabel classification from Active to Done on the Scoring-platform-team (Current) board.
Fri, Dec 22, 6:23 PM · Scoring-platform-team (Current), artificial-intelligence, revscoring, drafttopic-modeling
Sumit edited projects for T183580: class weights support for multilabel classification, added: Scoring-platform-team (Current); removed Scoring-platform-team.
Fri, Dec 22, 6:23 PM · Scoring-platform-team (Current), artificial-intelligence, revscoring, drafttopic-modeling
Sumit added projects to T183580: class weights support for multilabel classification: drafttopic-modeling, revscoring.

https://github.com/wiki-ai/revscoring/pull/385

Fri, Dec 22, 6:22 PM · Scoring-platform-team (Current), artificial-intelligence, revscoring, drafttopic-modeling
Sumit created T183580: class weights support for multilabel classification.
Fri, Dec 22, 6:22 PM · Scoring-platform-team (Current), artificial-intelligence, revscoring, drafttopic-modeling

Wed, Dec 20

Sumit moved T183392: Drafttopic: Add utility to extract dependents from Active to Review on the Scoring-platform-team (Current) board.
Wed, Dec 20, 5:55 PM · Scoring-platform-team
Sumit added a comment to T183392: Drafttopic: Add utility to extract dependents.

https://github.com/wiki-ai/drafttopic/pull/15

Wed, Dec 20, 5:55 PM · Scoring-platform-team
Sumit created T183392: Drafttopic: Add utility to extract dependents.
Wed, Dec 20, 5:55 PM · Scoring-platform-team
Sumit moved T183355: Drafttopic: add article text fetching utility from Active to Review on the Scoring-platform-team (Current) board.
Wed, Dec 20, 12:53 PM · Scoring-platform-team
Sumit added a comment to T183355: Drafttopic: add article text fetching utility.

https://github.com/wiki-ai/drafttopic/pull/14

Wed, Dec 20, 12:52 PM · Scoring-platform-team
Sumit created T183355: Drafttopic: add article text fetching utility.
Wed, Dec 20, 12:52 PM · Scoring-platform-team

Dec 11 2017

Sumit added a comment to T181163: Revscoring tune does not recognize a set of labels as target.

https://github.com/wiki-ai/revscoring/pull/376

Dec 11 2017, 3:52 PM · Scoring-platform-team, drafttopic-modeling, Research Ideas, artificial-intelligence
Sumit added a comment to T181166: Revscoring: Statistic for multilabel classification.

https://github.com/wiki-ai/revscoring/pull/376

Dec 11 2017, 3:51 PM · drafttopic-modeling, Scoring-platform-team (Current), Research Ideas, artificial-intelligence

Nov 29 2017

Sumit committed rODQ9097be964f74: Take most common word sense for polarity score (authored by Sumit).
Take most common word sense for polarity score
Nov 29 2017, 11:22 PM
Sumit committed rODQ4434ab188ecf: ADD SentiWordnet requirement to README (authored by Sumit).
ADD SentiWordnet requirement to README
Nov 29 2017, 11:22 PM
Sumit committed rODQa7f323398241: Address review comments in https://github.com/wiki-ai/draftquality/pull/9 (authored by Sumit).
Address review comments in https://github.com/wiki-ai/draftquality/pull/9
Nov 29 2017, 11:22 PM
Sumit committed rODQfd0a6e184361: (WIP) Add feature for polarity using SentiWordnet Adds a library… (authored by Sumit).
(WIP) Add feature for polarity using SentiWordnet Adds a library…
Nov 29 2017, 11:22 PM
Sumit committed rODQ91cde1284bc4: Add json2tsv in requirements.txt (authored by Sumit).
Add json2tsv in requirements.txt
Nov 29 2017, 11:22 PM
Sumit committed rOEQ026c38534f69: Add label param for enwiki goodfaith in Makefile (authored by Sumit).
Add label param for enwiki goodfaith in Makefile
Nov 29 2017, 10:50 PM
Sumit committed rOEQ95a9a24c17fa: Take top 20000 labelled instances then shuffle (authored by Sumit).
Take top 20000 labelled instances then shuffle
Nov 29 2017, 10:50 PM
Sumit committed rOEQf885552f2091: Add sqwiki features and rules to fetch labeled revision to Makefile (authored by Sumit).
Add sqwiki features and rules to fetch labeled revision to Makefile
Nov 29 2017, 10:50 PM
Sumit committed rOEQ6619bdbf3d3c: Retain reverted autolabelled (authored by Sumit).
Retain reverted autolabelled
Nov 29 2017, 10:50 PM
Sumit committed rOEQ6ba178bf235c: Add models and tuning reports (authored by Sumit).
Add models and tuning reports
Nov 29 2017, 10:50 PM
Sumit committed rOEQd5f9c69677eb: Add rowiki damaging, goodfaith models to Makefile (authored by Sumit).
Add rowiki damaging, goodfaith models to Makefile
Nov 29 2017, 10:50 PM
Sumit committed rOEQee7839d5d36a: Fetch human labels (authored by Sumit).
Fetch human labels
Nov 29 2017, 10:50 PM

Nov 28 2017

Sumit moved T172321: Build mid-level WikiProject category training set from Active to Review on the Scoring-platform-team (Current) board.
Nov 28 2017, 5:58 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a comment to T172321: Build mid-level WikiProject category training set.

We now have a dataset at figshare - https://doi.org/10.6084/m9.figshare.5640526.v1 \o/

Nov 28 2017, 5:57 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a comment to T181522: Fix response processing logic in drafttopic.fetch_page_wikiprojects.

https://github.com/wiki-ai/drafttopic/pull/13

Nov 28 2017, 5:14 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a comment to T179311: Generate mid-level WikiProject categories.

@Sumit, please move to the "done" column before closing tasks. We need this in order to consistently report what has been "done".

Nov 28 2017, 4:56 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a project to T172321: Build mid-level WikiProject category training set: drafttopic-modeling.
Nov 28 2017, 4:27 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a project to T172325: Efficient method for mapping a WikiProject template to the WikiProject Directory: drafttopic-modeling.
Nov 28 2017, 4:26 PM · drafttopic-modeling, Scoring-platform-team
Sumit added a project to T172326: Create machine-readable version of the WikiProject Directory: drafttopic-modeling.
Nov 28 2017, 4:26 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit edited projects for T175037: Publish Machine-Readable WikiProjects Dataset, added: drafttopic-modeling; removed Scoring-platform-team (Current).
Nov 28 2017, 4:26 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit added a project to T179311: Generate mid-level WikiProject categories: drafttopic-modeling.
Nov 28 2017, 4:25 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a project to T181166: Revscoring: Statistic for multilabel classification: drafttopic-modeling.
Nov 28 2017, 4:25 PM · drafttopic-modeling, Scoring-platform-team (Current), Research Ideas, artificial-intelligence
Sumit added a project to T181163: Revscoring tune does not recognize a set of labels as target: drafttopic-modeling.
Nov 28 2017, 4:25 PM · Scoring-platform-team, drafttopic-modeling, Research Ideas, artificial-intelligence
Sumit added a project to T181522: Fix response processing logic in drafttopic.fetch_page_wikiprojects: drafttopic-modeling.
Nov 28 2017, 4:23 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit created T181522: Fix response processing logic in drafttopic.fetch_page_wikiprojects.
Nov 28 2017, 4:22 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit closed T179311: Generate mid-level WikiProject categories as Resolved.

Looks like we don't include the top level category names yet. @Sumit said he'd like to do that in a separate PR.

Nov 28 2017, 3:06 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit closed T179311: Generate mid-level WikiProject categories, a subtask of T172321: Build mid-level WikiProject category training set, as Resolved.
Nov 28 2017, 3:06 PM · drafttopic-modeling, Scoring-platform-team (Current)

Nov 22 2017

Sumit edited projects for T181166: Revscoring: Statistic for multilabel classification, added: Scoring-platform-team (Current); removed Scoring-platform-team.
Nov 22 2017, 4:23 PM · drafttopic-modeling, Scoring-platform-team (Current), Research Ideas, artificial-intelligence
Sumit edited projects for T181163: Revscoring tune does not recognize a set of labels as target, added: Scoring-platform-team (Current); removed Scoring-platform-team.
Nov 22 2017, 4:23 PM · Scoring-platform-team, drafttopic-modeling, Research Ideas, artificial-intelligence
Sumit edited parent tasks for T181163: Revscoring tune does not recognize a set of labels as target, added: T181166: Revscoring: Statistic for multilabel classification; removed: T123327: Train/test draft topic model (new article routing AI).
Nov 22 2017, 4:21 PM · Scoring-platform-team, drafttopic-modeling, Research Ideas, artificial-intelligence
Sumit removed a subtask for T123327: Train/test draft topic model (new article routing AI): T181163: Revscoring tune does not recognize a set of labels as target.
Nov 22 2017, 4:20 PM · Research Ideas, artificial-intelligence, Scoring-platform-team
Sumit added a subtask for T181166: Revscoring: Statistic for multilabel classification: T181163: Revscoring tune does not recognize a set of labels as target.
Nov 22 2017, 4:20 PM · drafttopic-modeling, Scoring-platform-team (Current), Research Ideas, artificial-intelligence
Sumit created T181166: Revscoring: Statistic for multilabel classification.
Nov 22 2017, 4:20 PM · drafttopic-modeling, Scoring-platform-team (Current), Research Ideas, artificial-intelligence
Sumit updated the task description for T181163: Revscoring tune does not recognize a set of labels as target.
Nov 22 2017, 4:14 PM · Scoring-platform-team, drafttopic-modeling, Research Ideas, artificial-intelligence
Sumit created T181163: Revscoring tune does not recognize a set of labels as target.
Nov 22 2017, 4:13 PM · Scoring-platform-team, drafttopic-modeling, Research Ideas, artificial-intelligence

Nov 21 2017

Sumit created T181074: Refactor scripts fetching text and other metadata.
Nov 21 2017, 6:38 PM · Scoring-platform-team

Nov 6 2017

Sumit moved T172321: Build mid-level WikiProject category training set from Active to Review on the Scoring-platform-team (Current) board.
Nov 6 2017, 5:20 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a comment to T172321: Build mid-level WikiProject category training set.

https://github.com/wiki-ai/drafttopic/pull/11

Nov 6 2017, 5:08 PM · drafttopic-modeling, Scoring-platform-team (Current)

Nov 4 2017

Sumit edited projects for T172321: Build mid-level WikiProject category training set, added: Scoring-platform-team (Current); removed Scoring-platform-team.
Nov 4 2017, 10:12 AM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit moved T179311: Generate mid-level WikiProject categories from Review to Active on the Scoring-platform-team (Current) board.
Nov 4 2017, 10:11 AM · drafttopic-modeling, Scoring-platform-team (Current)

Nov 3 2017

Sumit added a comment to T179592: Could use some more disk space on ores-misc-01.ores-staging.eqiad.wmflabs:/srv.

Could free up 2.2G more...

Nov 3 2017, 4:33 PM · Scoring-platform-team
Sumit added a comment to T179592: Could use some more disk space on ores-misc-01.ores-staging.eqiad.wmflabs:/srv.

Removed 800MB of my stuff which included cached models and datasets.

Nov 3 2017, 4:26 PM · Scoring-platform-team

Oct 30 2017

Sumit moved T179311: Generate mid-level WikiProject categories from Active to Review on the Scoring-platform-team (Current) board.
Oct 30 2017, 4:56 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit edited projects for T179311: Generate mid-level WikiProject categories, added: Scoring-platform-team (Current); removed Scoring-platform-team.
Oct 30 2017, 4:56 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a comment to T179311: Generate mid-level WikiProject categories.

https://github.com/wiki-ai/drafttopic/pull/5

Oct 30 2017, 4:55 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit created T179311: Generate mid-level WikiProject categories.
Oct 30 2017, 4:55 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit added a comment to T172321: Build mid-level WikiProject category training set.
Oct 30 2017, 4:49 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit closed T172325: Efficient method for mapping a WikiProject template to the WikiProject Directory as Resolved.
Oct 30 2017, 4:48 PM · drafttopic-modeling, Scoring-platform-team
Sumit closed T172325: Efficient method for mapping a WikiProject template to the WikiProject Directory, a subtask of T172321: Build mid-level WikiProject category training set, as Resolved.
Oct 30 2017, 4:48 PM · drafttopic-modeling, Scoring-platform-team (Current)

Oct 16 2017

Sumit closed T172326: Create machine-readable version of the WikiProject Directory as Resolved.
Oct 16 2017, 7:31 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit closed T172326: Create machine-readable version of the WikiProject Directory, a subtask of T172325: Efficient method for mapping a WikiProject template to the WikiProject Directory, as Resolved.
Oct 16 2017, 7:31 PM · drafttopic-modeling, Scoring-platform-team
Sumit closed T175037: Publish Machine-Readable WikiProjects Dataset as Resolved.
Oct 16 2017, 7:31 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit closed T175037: Publish Machine-Readable WikiProjects Dataset, a subtask of T172326: Create machine-readable version of the WikiProject Directory, as Resolved.
Oct 16 2017, 7:31 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit added a comment to T175037: Publish Machine-Readable WikiProjects Dataset.

Published on ORES account of figshare.

Oct 16 2017, 7:30 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team

Oct 7 2017

Sumit added a comment to T172325: Efficient method for mapping a WikiProject template to the WikiProject Directory.

https://github.com/wiki-ai/drafttopic/pull/4

Oct 7 2017, 8:13 PM · drafttopic-modeling, Scoring-platform-team

Oct 6 2017

D3r1ck01 awarded T143490: Endterm evaluation for "Automated Testing and Integration of IFTTT support to Wikidata" a Mountain of Wealth token.
Oct 6 2017, 2:59 PM · Google-Summer-of-Code (2016), Wikidata

Sep 25 2017

Sumit updated the task description for T172321: Build mid-level WikiProject category training set.
Sep 25 2017, 2:59 PM · drafttopic-modeling, Scoring-platform-team (Current)
Sumit updated the task description for T172326: Create machine-readable version of the WikiProject Directory.
Sep 25 2017, 2:51 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team

Sep 20 2017

Qgil awarded T164525: Weekly reports of GSoC17/Outreachy14 projects (tracking) a Yellow Medal token.
Sep 20 2017, 9:56 AM · Outreachy (Round-14), Google-Summer-of-Code (2017)

Sep 6 2017

Sumit updated the task description for T172326: Create machine-readable version of the WikiProject Directory.
Sep 6 2017, 8:05 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit added a comment to T172326: Create machine-readable version of the WikiProject Directory.

PR for tests - https://github.com/wiki-ai/drafttopic/pull/1

Sep 6 2017, 8:05 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit removed a project from T172326: Create machine-readable version of the WikiProject Directory: Scoring-platform-team (Current).
Sep 6 2017, 5:51 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit edited projects for T173107: New Page Patrol - Number of users, added: Scoring-platform-team; removed Scoring-platform-team (Current).
Sep 6 2017, 5:21 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol
Sumit edited projects for T173210: New Pages patrol - Number of re-reviews, added: Scoring-platform-team; removed Scoring-platform-team (Current).
Sep 6 2017, 5:21 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol

Sep 5 2017

Halfak awarded T172326: Create machine-readable version of the WikiProject Directory a Like token.
Sep 5 2017, 4:48 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit created T175037: Publish Machine-Readable WikiProjects Dataset.
Sep 5 2017, 4:06 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit added a comment to T172326: Create machine-readable version of the WikiProject Directory.
Sep 5 2017, 3:15 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team

Aug 28 2017

Sumit added a project to T172326: Create machine-readable version of the WikiProject Directory: Scoring-platform-team (Current).
Aug 28 2017, 3:42 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit merged T172720: Data and Shell access related to Scoring Platform project on drafts and page reviews into T172719: Get Sumit access to deleted page data for quality modeling.
Aug 28 2017, 3:39 PM · draftquality-modeling, articlequality-modeling, artificial-intelligence, Scoring-platform-team (Current)
Sumit merged task T172720: Data and Shell access related to Scoring Platform project on drafts and page reviews into T172719: Get Sumit access to deleted page data for quality modeling.
Aug 28 2017, 3:39 PM · WMF-NDA-Requests, Scoring-platform-team (Current)

Aug 26 2017

Sumit claimed T172326: Create machine-readable version of the WikiProject Directory.
Aug 26 2017, 6:02 AM · drafttopic-modeling, Research Ideas, Scoring-platform-team

Aug 21 2017

Sumit moved T172726: Project around page reviewing and drafts from Active to Review on the Scoring-platform-team (Current) board.
Aug 21 2017, 3:09 PM · artificial-intelligence, draftquality-modeling, Scoring-platform-team (Current)

Aug 20 2017

Sumit added a comment to T172726: Project around page reviewing and drafts.

https://meta.wikimedia.org/wiki/Research:Automatic_new_article_topics_suggestion

Aug 20 2017, 4:40 PM · artificial-intelligence, draftquality-modeling, Scoring-platform-team (Current)

Aug 18 2017

Sumit added a comment to T172326: Create machine-readable version of the WikiProject Directory.

might be useful if we sync the machine readable format from here, probably using a cron script - https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Directory/All ( Updated by the reports bot )

Aug 18 2017, 8:09 PM · drafttopic-modeling, Research Ideas, Scoring-platform-team
Sumit added a comment to T123327: Train/test draft topic model (new article routing AI).

Also from eranroz, a bot tagging new articles with wikiprojects or lists using a rule-based system - https://en.wikipedia.org/wiki/User:AlexNewArtBot

Aug 18 2017, 7:50 PM · Research Ideas, artificial-intelligence, Scoring-platform-team

Aug 12 2017

Sumit moved T173107: New Page Patrol - Number of users from Done to Review on the Scoring-platform-team (Current) board.
Aug 12 2017, 6:42 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol
Sumit moved T173210: New Pages patrol - Number of re-reviews from Done to Review on the Scoring-platform-team (Current) board.
Aug 12 2017, 6:42 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol
Sumit updated the task description for T173107: New Page Patrol - Number of users.
Aug 12 2017, 6:34 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol
Sumit updated the task description for T173210: New Pages patrol - Number of re-reviews.
Aug 12 2017, 6:34 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol
Sumit created T173210: New Pages patrol - Number of re-reviews.
Aug 12 2017, 6:31 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol

Aug 11 2017

Sumit created T173107: New Page Patrol - Number of users.
Aug 11 2017, 4:39 PM · Scoring-platform-team, English-Wikipedia-New-Pages-Patrol

Aug 7 2017

Sumit updated the task description for T172726: Project around page reviewing and drafts.
Aug 7 2017, 7:56 PM · artificial-intelligence, draftquality-modeling, Scoring-platform-team (Current)
Sumit updated the task description for T172720: Data and Shell access related to Scoring Platform project on drafts and page reviews.
Aug 7 2017, 7:14 PM · WMF-NDA-Requests, Scoring-platform-team (Current)