Page MenuHomePhabricator

Google caches serious vandalism for a relatively long time.
Closed, ResolvedPublic

Description

Author: jeh

Description:
Screen shot of Jim Carrey's search listing

This week in two separate incidents high profile articles Wikipedia articles were vandalized and reverted within minutes. During the short window of vandalism, Googlebot cached and displayed slanderous material for a full day at the top of the search results. Due to the large number of articles in Wikipedia, and high search rankings, this is an increasing problem with the potential to damage the lives of article subjects and embarrass Wikipedia.

See attached screen shot, and these discussions:

One resolution strategy is to use an allowable form of cloaking, called "content delivery." We could apply the semi-protection criteria (not semi-protection itself) to article history to determine that last version that was saved by "good" user. This version can be accessed with an additional URL parameter, such as ?version=lastgood. When a search engine bot, such as Googlebot shows up and identifies itself (through the user-agent field in the http request header), you program a conditional redirect via .htaccess to append "?version=lastgood" to the URL, thus serving a slightly older, but more reliable copy of the page. This would avoid further embarrassment to Wikipedia, and help prevent harm to subjects of articles.

Over at the Wikipedia Administrators' Notice board, it was suggested to file a bug report. If you need further help with this, feel free to contact me. I am a professional SEO and web developer who can donate services.


Version: unspecified
Severity: major
URL: http://en.wikipedia.org

Attached:

jim-carrey.jpg (289×623 px, 193 KB)

Details

Reference
bz9983

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 9:39 PM
bzimport added a project: MediaWiki-Search.
bzimport set Reference to bz9983.
bzimport added a subscriber: Unknown Object (MLST).

ayg wrote:

This will be fixed by stable versions, which appear to be moving ahead full steam. Unless someone has a really quick and easy fix, there's probably not much point putting in effort to fix this immediately when it should be fixed soon enough anyway.

jeh wrote:

Thanks. Where can I follow the news and status of stable versions? Bear with me. I am new here.

FlaggedRevs are now live on en: Whether the community wants to retain them, and if to retain, to which extent to use them, is a non-technical problem.