Page MenuHomePhabricator

Process findings of Commons:Monuments database/Images without id
Open, Needs TriagePublic

Description

As summarized at Commons:Monuments database/Images without id, ErfgoedBot has found ca 31,000 Commons file pages used at Wikipedia monument lists, where a monument template with monument ID is missing. Most of them (8500) are related to Czech monuments, as listed here. The list is updated periodically since October 2018 - however, no tool and no bot processes the findings.

As discussed here, some tool exists, but it had some problems (T206398). However, the Czech list is not affected by that problem IMHO. According to the dicussion, I'm noting that task to phabricator to be not forget.

P.S.: Commons category pages of listed monuments, lacking monument ID templates, should be processed in a similar way.

Event Timeline

So I spotted that on the 10th of February the list got emptied. @SJu did you do a massive tagging drive or did the job just fail?

In case it was just a failing job then the following is what we should run to try and add the templates.
jsub -once -j y -o /data/project/heritage/logs/cz_cs_image_templates.log -N cz_cs_image_templates /data/project/heritage/bin/run_erfgoedbot_script.sh erfgoedbot/images_of_monuments_without_id.py -countrycode:cz -langcode:cs -add_template >> /data/project/heritage/logs/cz_cs_image_templates2.log

Spotted that loads of these jobs crashed on the 10th so went ahead and ran the command. Not sure why @Stashbot didn't pick up on the SAL entry.

Spotted that loads of these jobs crashed on the 10th so went ahead and ran the command. Not sure why @Stashbot didn't pick up on the SAL entry.

Stashbot seems to have hit a few issues due to an issue in the eqiad datacentre - ops are aware

Templates seem to be adding fine. I'm going to leave it running in the background for now, it should be done before the regular daily update job kicks in.

File:Štramberk, Horní Bašta 293.jpg seems to be a false positive in the list Images_of_cultural_heritage_monuments_in_Czech_Republic_without_id, version 2020-02-12, 05:15 UTC. The list listed this photo to be added {{Cultural Heritage Czech Republic|13170/8-3390}}, while the file page contains this tag since if was uploaded 2014-09-03.

ErfgoedBot added a duplicate monument-ID tempate to the file page. (2020-02-11, 23:31).

Aklapper subscribed.

Removing task assignee due to inactivity as this open task has been assigned for more than two years. See the email sent to the task assignee on August 22nd, 2022.
Please assign this task to yourself again if you still realistically [plan to] work on this task - it would be welcome!
If this task has been resolved in the meantime, or should not be worked on ("declined"), please update its task status via "Add Action… 🡒 Change Status".
Also see https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup for tips how to best manage your individual work in Phabricator. Thanks!