Thanks @Halfak for the invite to this task! I am interested in understanding how long standing editor practices can be best encoded in the editing interface that can help editors do better and allow us to build robust models from structured editing data that can automatically flag outstanding issues for editors to fix. One of the problems with building effective automated content flaw detection to help Wikipedians is the lack of precise information around historical edits (like what exactly was improved in this edit?)
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Feed Advanced Search
Advanced Search
Advanced Search
Feb 24 2022
Feb 24 2022
Sumit added a comment to T265163: Create a system to encode best practices into editing experiences.
Sumit updated the task description for T265163: Create a system to encode best practices into editing experiences.
May 7 2020
May 7 2020
I'm curious to know what kind of features is the Wikidata topic models API using? Is it the same features as the original topic model developed for English Wikipedia or something different?
Mar 1 2020
Mar 1 2020
Please test run your solutions locally. If it runs and gives expected results, submit a PR and it can be reviewed, if it doesn't seek help regarding the error. Screenshot of a code doesn't give much context to comment on.
Feb 28 2020
Feb 28 2020
In T246438#5927946, @Chtnnh wrote:
Jan 29 2019
Jan 29 2019
Sumit committed rODQ9f82ce9394c9: Address review comments in https://github.com/wiki-ai/draftquality/pull/9.
Address review comments in https://github.com/wiki-ai/draftquality/pull/9
Take most common word sense for polarity score
ADD SentiWordnet requirement to README
Sumit committed rODQ82b05b8d1a25: (WIP) Add feature for polarity using SentiWordnet Adds a library….
(WIP) Add feature for polarity using SentiWordnet Adds a library…
Add json2tsv in requirements.txt
Jan 21 2019
Jan 21 2019
Add label param for enwiki goodfaith in Makefile
Take top 20000 labelled instances then shuffle
Sumit committed rOEQdeb1b19154fc: Add sqwiki features and rules to fetch labeled revision to Makefile.
Add sqwiki features and rules to fetch labeled revision to Makefile
Add models and tuning reports
Retain reverted autolabelled
Add rowiki damaging, goodfaith models to Makefile
Dec 17 2018
Dec 17 2018
Add label param for enwiki goodfaith in Makefile
Sumit committed rOEQb25d9ae86253: Add sqwiki features and rules to fetch labeled revision to Makefile.
Add sqwiki features and rules to fetch labeled revision to Makefile
Take top 20000 labelled instances then shuffle
Add rowiki damaging, goodfaith models to Makefile
Add models and tuning reports
Retain reverted autolabelled
Sep 6 2018
Sep 6 2018
Prepend 'talk_' to headers and fix flake8
Sumit committed rDRAFTTOPICe109cf5d6105: Commit paws script used for fetching pageids for wikiprojects.
Commit paws script used for fetching pageids for wikiprojects
Sumit committed rDRAFTTOPIC4f37293585b7: Add POC for multilabel classification with text data in sklearn.
Add POC for multilabel classification with text data in sklearn
fix documentation of script
Remove bag of words features
Sumit committed rDRAFTTOPIC6899f7a9b19f: Add hashed bow features, script to fetch dependents from text.
Add hashed bow features, script to fetch dependents from text
use older fetch text
Aug 20 2018
Aug 20 2018
Add label param for enwiki goodfaith in Makefile
Take top 20000 labelled instances then shuffle
Sumit committed rOEQa9753b6fc02f: Add sqwiki features and rules to fetch labeled revision to Makefile.
Add sqwiki features and rules to fetch labeled revision to Makefile
Add rowiki damaging, goodfaith models to Makefile
Retain reverted autolabelled
Add models and tuning reports
Add mediawiki-utilities to requirements
Add extract from text
Sumit committed rDRAFTTOPIC6af3f6b11d77: Rewrite fetch_text to process observations in chunks, add test.
Rewrite fetch_text to process observations in chunks, add test
Change page_title to talk_page_title
Add script to text for a list of page titles
remove rev_id and timestamp
Add drafttopic model with 150 estimators, edit Makefile
Add feature extraction rule using revscoring extract
Add gradient boosting config
Drafttopic add GB model, tuning_report and Makefile
Add a labels-processing script to drafttopic
Change word vector features to use a wrapper
Add word vectors to drafttopic
change path to filename
address review comments
fetch_page_wikiprojects: Fix page processing logic
fix flake8 yet again :/
Append meaningful prefixes to mid-level wikiprojects
Add talk_page_title to tests
Address review comments
Escape angular brackets
Fix doc, and remove buggy code in except
Add time profiling to the script
restructure logic for request processing
Add mid_level_wp to arguments
Use update with dict for python3.4 compatibility
Add script to label pages with all wikiprojects
Assert on sorted lists
Commit culture_parsed.json
Edit README and replace print with logging
Add untracked trim_wikiprojects file
Always print starting and end messages of parsing
Sumit committed rDRAFTTOPIC29f69ceb198e: Parser code for generating mapping of mid-level topics to wikiprojects.
Parser code for generating mapping of mid-level topics to wikiprojects
remove traces of mid-level from fetch_wikiprojects
Extract and generate list of mid-level wikiprojects
Cleanup: Remove and add date in output file
Make each file have its own logger
remove helper functions from test_fetch_wikiprojects
Add Wikiproject name to directory mapping
catch generic errors with traceback.format_exc
flake8 sanitizations
Exception handling for request failures
Fix broken regexes, move them before main
PEP8 compliance - Make variables snake case
Add requirements, minor cleanups
Add sanity tests for WikiProjects parser
Add parser tests directory
Fix version to 0.1.1
Sumit committed rDRAFTTOPIC4f236561c80c: Drafttopic: Add bootstrap code and WikiProjects parsing script.
Drafttopic: Add bootstrap code and WikiProjects parsing script
Sumit committed rODQ36d230ee0d8a: Address review comments in https://github.com/wiki-ai/draftquality/pull/9.
Address review comments in https://github.com/wiki-ai/draftquality/pull/9
ADD SentiWordnet requirement to README
Take most common word sense for polarity score
Content licensed under Creative Commons Attribution-ShareAlike (CC BY-SA) 4.0 unless otherwise noted; code licensed under GNU General Public License (GPL) 2.0 or later and other open source licenses. By using this site, you agree to the Terms of Use, Privacy Policy, and Code of Conduct. · Wikimedia Foundation · Privacy Policy · Code of Conduct · Terms of Use · Disclaimer · CC-BY-SA · GPL