Page MenuHomePhabricator

Generate all description files and look at category statistics for SMVK-Mexico
Closed, ResolvedPublic

Description

Create a text file, for each media file, that contains the wikitext which need to be entered verbatim, on the Commons page that goes with the media file. Also create a list with two columns, one with the original file name and one with the filename you will use on Wikimedia Commons. This will be valuable if SMVK would like to keep track of their images, or if you mess up.

  • Write script to generate text files and make it publicly available: on github
  • Look at a few of the text files to ensure the structure and contents looks correct (do a sanity check).
  • Look over the list of generated file names to ensure they are descriptive (as specified when creating the mapping document).
  • Ensure category coverage (not including maintenance categories) is sufficiently high (for you to feel comfortable defending the decision to not do more work).

If any of these are unsatisfactory iterate over the mapping and repeat.

Event Timeline

Noticed that one new filename includes an exotic character: Från_utgrävningarna_vid_Thomas_Palmas_hus._Dr_Sigvald_Linné._-_SMVK_-_0307.a.0040.info -even though it has gone through the "format_filename"-method in BatchUploadTools. Here is a screen dump of the actual glyph. Is this a problem @Lokal_Profil ?

Selection_011.jpg (33×948 px, 4 KB)

Here's it's unicode data

Unicode character Oct Dec Hex HTML

\x{8F} ss3 single shift three 0217 143 0x8F 

Noticed that one new filename includes an exotic character: Från_utgrävningarna_vid_Thomas_Palmas_hus._Dr_Sigvald_Linné._-_SMVK_-_0307.a.0040.info -even though it has gone through the "format_filename"-method in BatchUploadTools. Here is a screen dump of the actual glyph. Is this a problem @Lokal_Profil ?

Selection_011.jpg (33×948 px, 4 KB)

Here's it's unicode data

Unicode character Oct Dec Hex HTML

\x{8F} ss3 single shift three 0217 143 0x8F 

Jupp. I'll try to update the filter to get rid of that one. Thanks for letting me know.

Noticed that one new filename includes an exotic character: Från_utgrävningarna_vid_Thomas_Palmas_hus._Dr_Sigvald_Linné._-_SMVK_-_0307.a.0040.info -even though it has gone through the "format_filename"-method in BatchUploadTools. Here is a screen dump of the actual glyph. Is this a problem @Lokal_Profil ?

Selection_011.jpg (33×948 px, 4 KB)

Here's it's unicode data

Unicode character Oct Dec Hex HTML

\x{8F} ss3 single shift three 0217 143 0x8F 

Jupp. I'll try to update the filter to get rid of that one. Thanks for letting me know.

The newest version of the py3compat branch should fix this. (Note that pywikibot is acting up so things might not work as desired on that side)

I noticed that if the photographer is Sigvald Linné, the field photographer wasn't filled i, only a category was added. That is fixed now.

Lokal_Profil renamed this task from Generate all description files and look at category statistics for Medelhavsmuseet to Generate all description files and look at category statistics for SMVK-Mexico.Dec 6 2016, 3:47 PM
Lokal_Profil updated the task description. (Show Details)

I've made two PRs to the repo:

OK. Could you do a sanity check on my solution to the canary bird-patch?

I've made a new PR to prevent it from creating double descriptions.

If you want me to take a look at things it's easiest to put them in a separate branch and then make a PR (that way I can compare the code changes over multiple commits). It also makes it easy to squash multiple small commits into one afterwards which is useful if you are trying out different solutions or making incremental changes.