Page MenuHomePhabricator

Spanish wikipedia XML dump errors
Closed, ResolvedPublic

Description

Author: capellan2000

Description:
Recently, browsing the spanish wiki dump (pages-articles.xml)
from 2009 June 15, noticed that many articles shows
wrong contents and redirections. Thousands articles
displays these problems.

For example:

Raúl Brandão redirects to Aklan
Yo argentino redirects to Orquesta Sinfónica de la Universidad de Concepción
Anexo:Descubridores de Planetas Menores shows Categoría:Música clásica de Chile, etc...
Zea perennis redirects to Provincia de Quirino
Joseph-Hector Fiocco shows Raúl Brandão
Orquesta Sinfónica de Concepción redirects to Yo, argentino
Orquesta Sinfónica Universidad de Concepción shows Anexo:Descubridores de Planetas Menores
Categoría:Esculturas del Románico shows Zea perennis
Cuatro días de setiembre shows Joseph-Hector Fiocco
Halloween: Resurección redirects to Orquesta Sinfónica de la Universidad de Concepción
Derecho de representación redirects to Orquesta Sinfónica de la Universidad de Concepción
ONE (single) redirects to Cuatro días de septiembre
Data East Corporation redirects to Derecho de representación (España)
Caso Sanlúcar shows Rencor apasionado
etc...

This error is more frecuent in the last articles of the dump,
than in the beginning articles.

Notice that any website that uses this XML dump, will shows the
same errors, so for this reason i mark this problem as blocker.

If you need an extensive list of these pages, i could create it,
although will take me a few days to verify that every article
had content and redirection problems.


Version: unspecified
Severity: blocker

Details

Reference
bz19420

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 10:37 PM
bzimport set Reference to bz19420.

Marking as duplicate. Thank you for providing a set of examples as I had been waiting for a set.

  • This bug has been marked as a duplicate of bug 18694 ***