In this change https://gerrit.wikimedia.org/r/#/c/393100/ @Zoranzoki21 showed the problem of downloading a large archive to use reflinks.py.
It is therefore proposed to create a web server, for which reflinks.py could interact by API.
Description
Details
Event Timeline
In each instalation, to user can use reflinks script, have to have this file in main folder.. I support idea for creating web server, with which reflinks.py will interact by API. Than user(s) will not need to every time download this file.
https://github.com/wikimedia/pywikibot/blob/master/scripts/reflinks.py. Not 100% sure what it does though.
Change 395094 had a related patch set uploaded (by Zoranzoki21; owner: jenkins-bot):
[pywikibot/core@master] Disable needing text file for running reflinks.py script
Change 395094 abandoned by Zoranzoki21:
Disable needing text file for running reflinks.py script
Reason:
What happened?
@Zoranzoki21, @zhuyifei1999, @Cyberpower678: reflinks.py is a Python script, which goes through bare links in a wiki page (pages) and finds out more details (page title, mime type). This script uses a pre-generated text file containing 404 links (gathered from wiki pages) to avoid marking temporarily unaccessible links as dead links. The text file could be downloaded from script-author's webpage, but it is old and unmaintained there and contains only enwiki articles. It can also be newly created by yourself using Python script weblinkchecker.py, but it takes a week to create this list (in order to eliminate temporarily unaccessible links). Currently we are looking for a better solution.
Change 395095 had a related patch set uploaded (by Zoranzoki21; owner: Zoranzoki21):
[pywikibot/core@master] reflinks.py: Disable needing 404-links.txt for running script
I thinking on one solution. To I run weblinkchecker from toolforge. And then we will got clean updated file.
You know I have a container web script that can run checks on an array of links. I can set it up.
Change 395095 abandoned by Zoranzoki21:
reflinks.py: Disable needing 404-links.txt for running script