We have been receiving reports from users, particularly in the Machine-Learning-Team that downloading files using wget or curl from https://analytics.wikimedia.org has become very unreliable since around Feb 1st 2024.
Here is a paste with some test results.
P56346
A specific test case that is not working for me from my workstation is:
wget -4 --continue https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl
That request took four attemps to succeed, as shown here:
--2024-02-06 17:07:46-- https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl Resolving analytics.wikimedia.org (analytics.wikimedia.org)... 185.15.59.224 Connecting to analytics.wikimedia.org (analytics.wikimedia.org)|185.15.59.224|:443... connected. HTTP request sent, awaiting response... 200 OK Length: 2649622715 (2.5G) Saving to: ‘model.pkl’ model.pkl 31%[====================================> ] 803.00M 12.2MB/s in 68s 2024-02-06 17:08:55 (11.7 MB/s) - Connection closed at byte 842006528. Retrying. --2024-02-06 17:08:56-- (try: 2) https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl Connecting to analytics.wikimedia.org (analytics.wikimedia.org)|185.15.59.224|:443... connected. HTTP request sent, awaiting response... 206 Partial Content Length: 2649622715 (2.5G), 1807616187 (1.7G) remaining Saving to: ‘model.pkl’ model.pkl 59%[+++++++++++++++++++++++++++++++++++++===============================> ] 1.46G 12.1MB/s in 68s 2024-02-06 17:10:04 (10.2 MB/s) - Connection closed at byte 1566572544. Retrying. --2024-02-06 17:10:06-- (try: 3) https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl Connecting to analytics.wikimedia.org (analytics.wikimedia.org)|185.15.59.224|:443... connected. HTTP request sent, awaiting response... 206 Partial Content Length: 2649622715 (2.5G), 1083050171 (1.0G) remaining Saving to: ‘model.pkl’ model.pkl 83%[+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++============================> ] 2.06G 11.5MB/s in 68s 2024-02-06 17:11:15 (9.12 MB/s) - Connection closed at byte 2216689664. Retrying. --2024-02-06 17:11:18-- (try: 4) https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl Connecting to analytics.wikimedia.org (analytics.wikimedia.org)|185.15.59.224|:443... connected. HTTP request sent, awaiting response... 206 Partial Content Length: 2649622715 (2.5G), 432933051 (413M) remaining Saving to: ‘model.pkl’ model.pkl 100%[++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++===================>] 2.47G 12.0MB/s in 57s 2024-02-06 17:12:15 (7.23 MB/s) - ‘model.pkl’ saved [2649622715/2649622715]
The first recorded occurrence was 2024-01-31T17:22:01+0100 and it has been happening frequently since then.
The web server behind analytics.wikimedia.org, that's an-web1001, was upgraded to bullseye on Feb 6th under ticket: T349398
However, this behaviour was observed both prior to and after the upgrade.
This is causing some inconvenience for the ML team, as they have to retry their downloads.
I have tried a wget from stat1004 to analytics.wikimedia.org and that also required two attempts.
btullis@stat1004:~$ wget https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl --2024-02-06 17:16:38-- https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl Resolving analytics.wikimedia.org (analytics.wikimedia.org)... 2620:0:861:ed1a::1, 208.80.154.224 Connecting to analytics.wikimedia.org (analytics.wikimedia.org)|2620:0:861:ed1a::1|:443... connected. HTTP request sent, awaiting response... 200 OK Length: 2649622715 (2.5G) Saving to: ‘model.pkl’ model.pkl 74%[=======================================================================================> ] 1.85G 28.3MB/s in 66s 2024-02-06 17:17:44 (28.6 MB/s) - Connection closed at byte 1987051520. Retrying. --2024-02-06 17:17:45-- (try: 2) https://analytics.wikimedia.org/published/wmf-ml-models/revertrisk/multilingual/20230810110019/model.pkl Connecting to analytics.wikimedia.org (analytics.wikimedia.org)|2620:0:861:ed1a::1|:443... connected. HTTP request sent, awaiting response... 206 Partial Content Length: 2649622715 (2.5G), 662571195 (632M) remaining Saving to: ‘model.pkl’ model.pkl 100%[++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++=============================>] 2.47G 28.6MB/s in 39s 2024-02-06 17:18:24 (16.2 MB/s) - ‘model.pkl’ saved [2649622715/2649622715]