- Find Hindi speakers to help us
- Run Bad-Words-Detection-System to get potential badword list
- Human review of BWDS list
- Integrate into revscoring
Description
Related Objects
Event Timeline
I left a note on the User talk page for @hindustanilanguage, this discussion is currently at an early, introductory phase.
@Halfak Found a Hindi word list which is ready for review: https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service/Word_lists/hi
Here are some suggestions for what we're looking for in a review, https://www.mediawiki.org/wiki/ORES/BWDS_review
Admins are interested in opening the discussion, and would like to see a demo of what ORES can accomplish. https://hi.wikipedia.org/wiki/सदस्य_वार्ता:Hindustanilanguage#Reaching_out_for_help_with_ORES
This is one of user's personal discussion page. this is not community discussion page and all.
Discussion moved to Community Village pump:
https://hi.wikipedia.org/wiki/विकिपीडिया:चौपाल#Reaching_out_for_help_with_ORES
Thanks @Igarg2001! Right now, we need someone to review the list generated at https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service/Word_lists/hi
We need someone to sort this list into bad words and informal words. It's common that many words in the list will be there by mistake, so we need to filter those out. Can you help?
FYI: I just reviewed the discussion on hiwiki linked above and it seems clear to me that there is general, positive support for ORES -- at least there was back in 2017 when we first asked for help with this task.
@Halfak I can help sorting those words. But most of them are already in https://github.com/wikimedia/revscoring/blob/master/revscoring/languages/hindi.py so we still need sorting?
Aha! It does look like this is set. Does the word list you see at https://github.com/wikimedia/revscoring/blob/master/revscoring/languages/hindi.py look mostly alright to you?
The previous comments don't explain who or what (task?) exactly this task is stalled on ("If a report is waiting for further input (e.g. from its reporter or a third party) and can currently not be acted on"). Hence resetting task status, as tasks should not be stalled (and then potentially forgotten) for unclear reasons.
Removing task assignee due to inactivity, as this open task has been assigned for more than two years. See the email sent to the task assignee on February 06th 2022 (and T295729).
Please assign this task to yourself again if you still realistically [plan to] work on this task - it would be welcome.
If this task has been resolved in the meantime, or should not be worked on ("declined"), please update its task status via "Add Action… 🡒 Change Status".
Also see https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup for tips how to best manage your individual work in Phabricator.
Looks like this is done as part of T252581: Train and test editquality models for Hindi Wikipedia