Page MenuHomePhabricator

allow TextCat to use multiple language model directories
Closed, ResolvedPublic

Description

Allows us to use WikiText-based models and query-text-based models without having to put them in one directory (which requires duplication and confuses provenance). Generalize to any number of directories. Expected outcome is improved recall and possible boost to precision, by identifying some languages for which we have no query-text-based models, but for which we have or can easily generate wiki-text-based models.

Update Perl and PHP versions of TextCat.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Change 320852 had a related patch set uploaded (by Tjones):
Allow TextCat to use multiple language model directories

https://gerrit.wikimedia.org/r/320852

Change 320852 merged by jenkins-bot:
Allow TextCat to use multiple language model directories

https://gerrit.wikimedia.org/r/320852