Page MenuHomePhabricator

TimeoutError with PDF retrieval from some repositories
Open, Needs TriagePublicBUG REPORT

Description

Reported at https://en.wikipedia.org/w/index.php?title=Wikipedia_talk:OABOT&diff=prev&oldid=1172192048

HTTPSConnectionPool(host='lirias.kuleuven.be', port=443): Max retries exceeded with url: /bitstream/123456789/619108/1/MM_JChromB_2017_1_to%20Lirias.docx (Caused by NewConnectionError(': Failed to establish a new connection: [Errno 110] Connection timed out'))
	
Traceback (most recent call last):
	
File "/data/project/oabot/www/python/venv/lib/python3.9/site-packages/urllib3/connection.py", line 203, in _new_conn
	
sock = connection.create_connection(
	
File "/data/project/oabot/www/python/venv/lib/python3.9/site-packages/urllib3/util/connection.py", line 85, in create_connection
	
raise err
	
File "/data/project/oabot/www/python/venv/lib/python3.9/site-packages/urllib3/util/connection.py", line 73, in create_connection
	
sock.connect(sa)
	
TimeoutError: [Errno 110] Connection timed out

Event Timeline

Some repositories are inevitably stricter than others and will block us, there's little we can do about it. However,

  1. We could reduce the occurrence by shifting more requests to the user's browser, as we used to do for all HTTPS URLs. (This could cause more previews to become downloads instead.)
  2. We must ensure that the main web tool page loads even if the preview fails.