Page MenuHomePhabricator

Run threshold analysis for enwiki in stat8

Authored By
gkyziridis
Jul 28 2025, 2:41 PM
Size
1 KB
Referenced Files
None
Subscribers
None

Run threshold analysis for enwiki in stat8

============ - enwiki - ============
- Raw data shape: (0, 17)
- Duplicate rows found and removed: 0
- Clean data shape: (0, 17)
- Unique revision_ids: 0 | Data Shape: 0 | Same? : -> True
- Removing edits that are reverts from df | New Shape: (0, 17)
- Is any revert_risk_score NA? : False
- Is any user_edit_count NA? : False
- Is any time_to_revert NA? : False
Traceback (most recent call last):
File "/srv/home/gkyziridis/revert_risk_threshold_for_all_wikies.py", line 392, in <module>
rc = ROC(actual=actual, probs=predicted, plot=True, name=wiki_name)
File "/srv/home/gkyziridis/revert_risk_threshold_for_all_wikies.py", line 113, in ROC
fpr, tpr, thresholds = roc_curve(actual, probs)
File "/home/gkyziridis/.conda/envs/2025-05-08T10.39.42_gkyziridis/lib/python3.10/site-packages/sklearn/utils/_param_validation.py", line 216, in wrapper
return func(*args, **kwargs)
File "/home/gkyziridis/.conda/envs/2025-05-08T10.39.42_gkyziridis/lib/python3.10/site-packages/sklearn/metrics/_ranking.py", line 1150, in roc_curve
fps, tps, thresholds = _binary_clf_curve(
File "/home/gkyziridis/.conda/envs/2025-05-08T10.39.42_gkyziridis/lib/python3.10/site-packages/sklearn/metrics/_ranking.py", line 835, in _binary_clf_curve
pos_label = _check_pos_label_consistency(pos_label, y_true)
File "/home/gkyziridis/.conda/envs/2025-05-08T10.39.42_gkyziridis/lib/python3.10/site-packages/sklearn/utils/validation.py", line 2641, in _check_pos_label_consistency
raise ValueError(
ValueError: y_true takes value in {} and pos_label is not specified: either make y_true take value in {0, 1} or {-1, 1} or pass pos_label explicitly.

File Metadata

Mime Type
text/plain; charset=utf-8
Storage Engine
blob
Storage Format
Raw Data
Storage Handle
21615332
Default Alt Text
Run threshold analysis for enwiki in stat8 (1 KB)

Event Timeline