Page MenuHomePhabricator

Wikipedia August dump MySQL syntax issue
Closed, DeclinedPublic

Description

Hi,

we were trying to run Mediawiki Dumper and we got that:

ERROR 1064 (42000) at line 127478: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near ''{{Orphan|date=April 2014}}\n\nThe \'\'\'lung float test\'\'\', also called the ' at line 1

do you know how to get the issue fixed?

This is the command we ran:

java -server -classpath /data/servers/data_load/lib/commons-compress.jar:/data/servers/data_load/lib/mwdumper.jar org.mediawiki.dumper.Dumper --format=sql:1.5 /data/servers/data_load/en/20160801/enwiki-20160801-pages-articles.xml.bz2 | /usr/bin/mysql --max_allowed_packet=1G --default-character-set=utf8 --force -h 127.0.0.1 -uXXXXXXX -pXXXXXXX -P 3306 -D wikimirror_en

mysql version we are using is… 5.6.22

Event Timeline

Can you try to narrow it down some to a small number of pages?

So first ,which pages-articles subfile (1...27) causes the error, and then if at all possible grab a few pages before and after the problematic one and verify that you still get the error, and attach that much smaller test file to this task so we can play? That would cut down the test time a lot.

After that it would be great if you could attach the raw mwdumper output from that small (3-4 pages) test file to this task also and then we can see what's going on.

We really need more information to be able to proceed on this ticket. It's been a few months without a reply, so I'm hoping that means the problem is resolved. For that reason I'm closing it as 'declined'. If not and you want to follow up, just re-open this ticket and add more information to it. Thanks!