Page MenuHomePhabricator

Add a Commons user search preference to exclude certain file types / mime types
Open, LowPublic

Assigned To
None
Authored By
Fae
Dec 11 2020, 1:05 PM
Referenced Files
None
Tokens
"100" token, awarded by Huntster."Yellow Medal" token, awarded by Fae."Mountain of Wealth" token, awarded by Pigsonthewing.

Description

There is no way for users to opt for a default search option to exclude documents. Adding "-filemime:pdf -filemime:djvu" verges on being incomprehensible for most non-tech users.

This could be usefully added as a site user preference, or made another field in the Commons search UI.

This is an issue made more significant recently, with the IA books project adding a million PDFs to the collections on Commons. Consequently even simple (non-document type) searches like "cats with flowers" are returning lots of uninteresting looking PDFs in the top search returns, unless you happen to be very interested in Seed Trade Catalogs.

Event Timeline

When looking at a category, we already have a gadget for displaying only a certain filetype. When searching in other ways, we have selectors for namespace. I wonder if we could have a search-selector for filetype?

Aklapper renamed this task from Add a Commons user search preference to exclude documents to Add a Commons user search preference to exclude certain file types / mime types.Dec 18 2020, 6:41 AM
CBogen removed a project: SDAW-MediaSearch.
CBogen subscribed.

Removing the SDAW-MediaSearch tag because the MediaSearch interface already has tabs that allow users to view results by file type, so presumably this is about the old/standard Commons search interface.

the MediaSearch interface already has tabs that allow users to view results by file type

If you mean the search currently used by en.Wikipedia, for example, it allows "search for one file type", it does not allow the user to search one, or a few, types and see results for all other types.

the MediaSearch interface already has tabs that allow users to view results by file type

If you mean the search currently used by en.Wikipedia, for example, it allows "search for one file type", it does not allow the user to search one, or a few, types and see results for all other types.

The SDAW-MediaSearch project refers to the interface at https://commons.wikimedia.org/wiki/Special:MediaSearch.

MPhamWMF subscribed.

Closing out low/est priority tasks over 6 months old with no activity within last 6 months in order to clean out the backlog of tickets we will not be addressing in the near term. Please feel free to reopen if you think a ticket is important, but bare in mind that given current priorities and resourcing, it is unlikely for the Search team to pick up these tasks for the indefinite future. We hope that the requested changes have either been addressed by or made irrelevant by work the team has done or is doing -- e.g. upgrading Elasticsearch to a newer version will solve various ES-related problems -- or will be subsumed by future work in a more generalized way.

RhinosF1 removed a project: Discovery-Search.
RhinosF1 subscribed.

Re-opening tasks and removing from team workboard per IRC feedback given yesterday and discussion with MPham.