Hi,
We started deploying ContentTranslation in January without adding interlanguage links to the created translated articles. Initially we were publishing pages as drafts, and we assumed that people will add interlanguage links after proper publishing to the main space.
Later we started publishing directly to the main space, so it made sense to add interlanguage links automatically. After some delays we have a patch to do this. See T87410 and https://gerrit.wikimedia.org/r/#/c/214119/ .
We have over 4000 published articles, and all of them are supposed to have interlanguage links, but some still don't because the people forgot to add them. I linked some of them manually, but with so many pages it's inefficient to do it manually.
There should be a bot that does the following:
- Goes over all the articles created using ContentTranslation. I guess that it's better to get the list from the central contenttranslation database, but maybe it's better to get them from the tags? @santhosh, @Nikerabbit, your opinion?
- If the article has a site link - all good, nothing more to do. (This will probably be the majority.)
- If the article doesn't have a site link, then link it with the article from which it was translated.
- Quite a lot of translated articles were moved. Some redirects were kept and some weren't. If a redirect was kept, link the target page with the article from which it was translated.
- If an article cannot be found, then it was either deleted, or moved without leaving a redirect. The bot should create a list of those to check manually. It would be nice to at least have a list of what to the article - just deleted or moved without leaving a redirect; it should be possible for a bot to do this.