Page MenuHomePhabricator

Update revscoring's requirements and support for new versions of python
Closed, ResolvedPublic

Description

Let's take this opportunity (lots of new features in revscoring that various models might use) to update the libraries we rely on and build up support for new versions of python.

New features:

  • Better handling of pt dictionaries (pt models)
  • Portuguese words to watch (pt models)
  • Improved tokenization (all models)

Extend support for python 3.7 and 3.8.

Use the lastest version of sklearn, numpy, and scipy.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

This was way more difficult than usual due to some weird version issues with flake8, pyflakes, and python 3.8. But I've worked around them. It seems this is ready to go. https://github.com/wikimedia/revscoring/pull/489