nostalgia.wikipedia.org possibly should be robots.txt'd out of search engines
Closed, ResolvedPublic
Actions

Assigned To

None

Authored By

	• brion
	Aug 21 2008, 12:05 AM

Description

Have received vague reports of nostalgia.wikipedia.org showing up unexpectedly in regular Google search results. (This holds a copy of Wikipedia's database from early 2002, displayed in the old-style 'Nostalgia' skin, and was put up for one of Wikipedia's anniversary celebrations a few years ago.)

The nostalgia site appears to be served out of the primary document root, so gets the regular robots.txt; we should possibly give it a custom docroot with a blanked Disallow robots.txt, which would phase it out of general web search indexes.

Version: unspecified
Severity: normal

Details

Reference: bz15253

Related Objects

Mentioned In: T326334: Maintenance scripts create pages on the Nostalgia Wikipedia

Event Timeline

• bzimport raised the priority of this task from to Medium.Nov 21 2014, 10:19 PM

• bzimport added projects: WMF-General-or-Unknown, Shell.

• bzimport set Reference to bz15253.

• bzimport added a subscriber: Unknown Object (MLST).

• brion created this task.Aug 21 2008, 12:05 AM

We should just redirect robots.txt to extract2.php and have them edited via the web.

Hmmm, sounds kind of scary but would probably work fine. :)

jeluf wrote:

Will robots follow redirects for robots.txt? I guess we should proxy the request to extract2.php.

jeluf wrote:

Done.

User-agent: *
Disallow: /

Nintendofan885 mentioned this in T326334: Maintenance scripts create pages on the Nostalgia Wikipedia.Jan 5 2023, 4:55 PM

nostalgia.wikipedia.org possibly should be robots.txt'd out of search enginesClosed, ResolvedPublicActions

Description

Details

Related Objects

Event Timeline

nostalgia.wikipedia.org possibly should be robots.txt'd out of search engines
Closed, ResolvedPublic
Actions