Problem
As part of the work under WE5.4 to protect our infrastructure from abusive scraping, we want to be able to distinguish between requests for media files that come from authenticated and anonymous users.
This will allow us to exclude our logged in users, including community bots, from any filtering that we implement for access to media files, in particular for generating thumbnails not currently in cache.
Approach
Enable identification of logged in users requesting media files by supporting authentication cookies on upload.wikimedia.org. The CDN can then validate the JWT session to use the information as a signal in abuse filtering.
Acceptance criteria
- JWT session cookies included on requests to upload.wikimedia.org
- Other cookies are not required to be present, but may be included
- SRE can configure CDN to validate JWT sessions on media requests