It looks like we are extracting different counts of English Words
In beta (https://ores-beta.wmflabs.org/scores/euwiki/?models=articlequality&revids=7239990 ), we get:
{ "euwiki": { "models": { "articlequality": { "version": "0.8.1" } }, "scores": { "7239990": { "articlequality": { "features": { "feature.euwiki.revision.category_links": 12.0, "feature.euwiki.revision.cn_templates": 0.0, "feature.euwiki.revision.image_links": 2.0, "feature.euwiki.revision.infobox_templates": 0.0, "feature.euwiki.revision.paragraphs_without_refs_total_length": 0.0, "feature.len(<datasource.basque.dictionary.revision.dict_words>)": 3220.0, "feature.len(<datasource.english.dictionary.revision.dict_words>)": 631.0, "feature.len(<datasource.spanish.dictionary.revision.dict_words>)": 904.0, "feature.len(<datasource.wikitext.revision.words>)": 3994.0, "feature.wikitext.revision.chars": 37639.0, "feature.wikitext.revision.content_chars": 19054.0, "feature.wikitext.revision.external_links": 39.0, "feature.wikitext.revision.headings_by_level(2)": 10.0, "feature.wikitext.revision.headings_by_level(3)": 5.0, "feature.wikitext.revision.ref_tags": 79.0, "feature.wikitext.revision.wikilinks": 132.0 }, "score": { "prediction": "GA", "probability": { "B": 0.3166585497835499, "C": 0.03317708333333332, "FA": 0.12386093073593067, "GA": 0.5158198051948053, "Start": 0.010483630952380953, "Stub": 0.0 } } } } } } }
In production (https://ores.wikimedia.org/scores/euwiki/?models=articlequality&revids=7239990 ), we sometimes get something different:
{ "euwiki": { "models": { "articlequality": { "version": "0.8.1" } }, "scores": { "7239990": { "articlequality": { "features": { "feature.euwiki.revision.category_links": 12.0, "feature.euwiki.revision.cn_templates": 0.0, "feature.euwiki.revision.image_links": 2.0, "feature.euwiki.revision.infobox_templates": 0.0, "feature.euwiki.revision.paragraphs_without_refs_total_length": 0.0, "feature.len(<datasource.basque.dictionary.revision.dict_words>)": 3220.0, "feature.len(<datasource.english.dictionary.revision.dict_words>)": 563.0, "feature.len(<datasource.spanish.dictionary.revision.dict_words>)": 904.0, "feature.len(<datasource.wikitext.revision.words>)": 3994.0, "feature.wikitext.revision.chars": 37639.0, "feature.wikitext.revision.content_chars": 19054.0, "feature.wikitext.revision.external_links": 39.0, "feature.wikitext.revision.headings_by_level(2)": 10.0, "feature.wikitext.revision.headings_by_level(3)": 5.0, "feature.wikitext.revision.ref_tags": 79.0, "feature.wikitext.revision.wikilinks": 132.0 }, "score": { "prediction": "GA", "probability": { "B": 0.3101853354978356, "C": 0.03265624999999999, "FA": 0.12594426406926404, "GA": 0.5207305194805196, "Start": 0.010483630952380953, "Stub": 0.0 } } } } } } }
Other times we get this:
{ "euwiki": { "models": { "articlequality": { "version": "0.8.1" } }, "scores": { "7239990": { "articlequality": { "features": { "feature.euwiki.revision.category_links": 12.0, "feature.euwiki.revision.cn_templates": 0.0, "feature.euwiki.revision.image_links": 2.0, "feature.euwiki.revision.infobox_templates": 0.0, "feature.euwiki.revision.paragraphs_without_refs_total_length": 0.0, "feature.len(<datasource.basque.dictionary.revision.dict_words>)": 3220.0, "feature.len(<datasource.english.dictionary.revision.dict_words>)": 712.0, "feature.len(<datasource.spanish.dictionary.revision.dict_words>)": 904.0, "feature.len(<datasource.wikitext.revision.words>)": 3994.0, "feature.wikitext.revision.chars": 37639.0, "feature.wikitext.revision.content_chars": 19054.0, "feature.wikitext.revision.external_links": 39.0, "feature.wikitext.revision.headings_by_level(2)": 10.0, "feature.wikitext.revision.headings_by_level(3)": 5.0, "feature.wikitext.revision.ref_tags": 79.0, "feature.wikitext.revision.wikilinks": 132.0 }, "score": { "prediction": "GA", "probability": { "B": 0.31277958152958163, "C": 0.03202380952380952, "FA": 0.12197104978354974, "GA": 0.5208669282106781, "Start": 0.012358630952380953, "Stub": 0.0 } } } } } } }
But we also get something that looks consistent with ores-beta too. All the differences seen to be due to the count of English language words.