Page MenuHomePhabricator

DBQ-94 Find all SVG Images on commons
Closed, ResolvedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-94.
Summary: Find all SVG Images on commons
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Done
Assignee: Guandalug <A.Meiske@nightstone.de>


From: Leo k <dieb.uche@gmail.com>

Date: Fri, 25 Jun 2010 10:48:03

Could someone provide me with a list of all SVGs uploaded to commons? I'm an admin at commons and this would be helpful in finding broken svgs, which can't be currently rendered.
I can't provide an exact query though, but I don't think it would be too hard to do.


Version: unspecified
Severity: major

Details

Reference
bz59346

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:24 AM
bzimport set Reference to bz59346.

From: Guandalug <A.Meiske@nightstone.de>

Date: Fri, 25 Jun 2010 23:35:39

Run query:

mysql -X -h commonswiki-p.db.toolserver.org -D commonswiki_p --default-character-set=UTF8 -n --delimiter=';' -e "select img_name,img_minor_mime from image where lower(img_name) like '%.svg';" | tee SVGList.xml

There's most likely a better way doing it, but this ought to work.


From: Guandalug <A.Meiske@nightstone.de>

Date: Fri, 25 Jun 2010 23:36:30

Compressed XML result file of the forementioned query


From: Guandalug <A.Meiske@nightstone.de>

Date: Fri, 25 Jun 2010 23:37:33

Query done


From: Leo k <dieb.uche@gmail.com>

Date: Sat, 26 Jun 2010 11:38:45

Did you maybe double-encode the data? I tried opening as UTF8, UTF16, Latin1 and a few others & I always get the nonstandard chars garbled


From: DaB. <dab@ts.wikimedia.org>

Date: Sat, 26 Jun 2010 12:00:14

@Guadalug "--default-character-set=UTF8" is wrong, the data is stored as latin1 (but it is in reality utf8). Remove that and check the charset of your terminal, if you get garbish.


From: Guandalug <A.Meiske@nightstone.de>

Date: Sat, 26 Jun 2010 13:52:00

New attempt. CSV this time, Characters seem to be intact as far as my UTF8-capable editor can be trusted.

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: guandalug@nurfuerspam.de
CC list: guandalug@nurfuerspam.de, wikimedia-bugzilla@dabpunkt.eu