- generate TFiDF badword lists
- review and aggregation of badwords/informal words by native speaker
- implement revscoring.language.Language (Language utility)
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | Halfak | T106836 Russian language utilities | |||
Resolved | Ladsgroup | T110964 TF-IDF to determine global stop words | |||
Resolved | Ladsgroup | T109844 Omit the interwikilinks from stop words |
Event Timeline
Comment Actions
@putnik just helped us make sure that this is done. It's now up to me to integrate the language assets into revscoring