Page MenuHomePhabricator

Collect all data for draftquality model in enwiki
Closed, ResolvedPublic

Description

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

This an amazing bug, It fails to hold more than two rows because of this unique index:

CREATE UNIQUE INDEX /*i*/oresc_rev_predicted_model ON /*_*/ores_classification (oresc_rev, oresc_is_predicted, oresc_model);

when oresc_is_predicted = 1, it's useful because we should not have more than one predicated class per rev and model but when oresc_is_predicted = 0, it is too restrictive. I checked and to me, the whole index is useless so I completely drop it.

It's done now. \o/:

MariaDB [enwiki_p]> select * from ores_classification where oresc_model = 26 order by oresc_rev desc limit 50;
+-----------+-----------+-------------+-------------+-------------------+--------------------+
| oresc_id  | oresc_rev | oresc_model | oresc_class | oresc_probability | oresc_is_predicted |
+-----------+-----------+-------------+-------------+-------------------+--------------------+
| 173926332 | 811688346 |          26 |           3 |             0.069 |                  0 |
| 173926331 | 811688346 |          26 |           2 |             0.188 |                  0 |
| 173926330 | 811688346 |          26 |           1 |             0.727 |                  1 |