List of steps to reproduce (step by step, including full links if applicable):
- Search in Commons for "Kota Surakarta", a city name in Indonesia: https://commons.wikimedia.org/w/index.php?title=Special:MediaSearch&search=kota+surakarta&fulltext=Cari&type=image
- Get many dozens of results that are not related to the city at all
- Cluttering search result and confusing viewer
- related to [[mediawiki:Topic:Wyh8f0obblz9wdd9]]
What happens?:
Some of the search results include:
- https://commons.wikimedia.org/wiki/File:Solar_Solo.jpg - a non-Indonesian bus, unused file
- https://commons.wikimedia.org/wiki/File:Solo-645.jpg - a chainsaw, again, non-Indonesian
- https://commons.wikimedia.org/wiki/File:Hope_Solo.jpg - an American footballer (and many more of her images)
- https://commons.wikimedia.org/wiki/File:Solo_T%C3%BCrk_1.jpg - a Turkish(?)/Polish(?) jet
- https://commons.wikimedia.org/wiki/File:Solo_Z%C3%BCnder_SOLO_match_factories_01.JPG - an Austrian match product
- https://commons.wikimedia.org/wiki/File:Bobby_Solo_2008.jpg - an Italian musician
- https://commons.wikimedia.org/wiki/File:Gateway_Solo_2200.JPG - a 1996 laptop
- https://commons.wikimedia.org/wiki/File:Solo_Solo_(11741234423).jpg - a British bus, unused file
None of them have anything to do with Surakarta City, and nowhere in the title, description, meta, matches any resemblance with "Surakarta" or "City" ("Kota", in Indonesian).
I happened to found this bug when I was creating a Wikistory in Indonesian Wikipedia:
https://www.mediawiki.org/wiki/Topic:Wyh8f0obblz9wdd9
I suspect this has to do that https://id.wikipedia.org/wiki/Surakarta is colloquially known as "Solo" for short. But where does this information seeped into the search result?
What should have happened instead?:
- Nothing unrelated to "Kota Surakarta" should be displayed
- Images with titles, description, category (and subcat), matching the search term should have greater weight, and displayed on the top, therefore, images with neither title, description, category (and subcat), matching the search term should be pushed to the very last of search result.
- Ideally, images should be sorted by usages in projects. Greater usages = better quality image (hopefully). Other consideration for extra weight would be: Picture of the Day status, title matching exactly the search term, how old is the file, multiple occurences of the search terms in the title and description and categories, whether the image is in the top category or way deep in the subcategories.
- Images in the category with Exact match as the search term should be displayed
Software version (if not a Wikimedia wiki), browser information, screenshots, other information, etc.: