Page MenuHomePhabricator

Generate all description files and look at category statistics for SMVK-Cypern-2017-01
Closed, ResolvedPublic10 Estimated Story Points

Description

Create a text file, for each media file, that contains the wikitext which need to be entered verbatim, on the Commons page that goes with the media file. Also create a list with two columns, one with the original file name and one with the filename you will use on Wikimedia Commons. This will be valuable if the GLAM would like to keep track of their images, or if you mess up.

  • Write script to generate text files and make it publicly available: create_infotexts.py
  • Look at a few of the text files to ensure the structure and contents looks correct (do a sanity check).
  • Look over the list of generated file names to ensure they are descriptive (as specified when creating the mapping document).
  • Ensure category coverage (not including maintenance categories) is sufficiently high (for you to feel comfortable defending the decision to not do more work).

If any of these are unsatisfactory iterate over the mapping and repeat.

Related Objects

StatusSubtypeAssignedTask
ResolvedLokal_Profil
Resolved Mattias_Ostmar-WMSE
ResolvedLokal_Profil
DeclinedNone
ResolvedJopparn
InvalidNone
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
InvalidNone
ResolvedLokal_Profil
InvalidNone
InvalidNone
InvalidNone
ResolvedLokal_Profil
InvalidJopparn
Invalid AxelPettersson_WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
InvalidNone
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Declined Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Duplicate Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Invalid Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
ResolvedLokal_Profil
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE
Resolved Mattias_Ostmar-WMSE

Event Timeline

Mattias_Ostmar-WMSE changed the point value for this task from 8 to 12.Feb 13 2017, 8:34 AM
Jopparn changed the point value for this task from 12 to 8.Feb 13 2017, 10:34 AM
Jopparn subscribed.

Mattias worked on this 4 hours last week, but the time estimate was optimistic so still 8 hours are remaining.

Jopparn changed the point value for this task from 8 to 2.Feb 27 2017, 10:20 AM

This has been divided into subtasks that include the Story Points.

Mattias_Ostmar-WMSE changed the point value for this task from 2 to 6.Apr 3 2017, 7:32 AM
Mattias_Ostmar-WMSE changed the point value for this task from 6 to 10.Apr 18 2017, 7:46 AM

This task includes generation of category statistics, which will require some new coding and analysis.

The current mapping generates the following statistics for categories, i.e. about 86 per cent does are not categorized at all:
('cnt_0', 532)
('cnt_1', 59)
('cnt_2', 17)
('cnt_3', 3)
('cnt_4', 3)

With PR#30 merged (which it isn't at the moment) we get the following category statistics:
Content categories statistics
(No of cats, no of cases)
('cnt_0', 279)
('cnt_1', 285)
('cnt_2', 36)
('cnt_3', 9)
('cnt_4', 5)

After adding of categories to Commons and looking for places in <Beskrivning>:

[('Swedish Cyprus Expedition', 147),
('Galini, Cyprus', 103),
('Soloi', 47),
('Agia (Ayia) Irini (Eirini), Cyprus', 46),
('History_of_Nicosia', 39),
('Alfred Westholm', 38),
('Marion, Cyprus', 38),
('Lapithos', 37),
('Erik Sjöqvist', 31),
('Amathus', 26),
('Idalion', 25),
('John Lindros', 20),
('Einar Gjerstad', 19),
('Larnaca', 18),
('Mersinaki, Cyprus', 15),
('Kythrea, Cyprus', 13),
('Milia, Famagusta', 10),
('Nitovikla, Cyprus', 6),
('Findings_from_Enkomi_(Cyprus)', 6),
('Petra tou limniti', 5),
('Martin Gjerstad', 3),
('Ernst Kjellberg', 2),
('Kyrenia', 2),
('Asine, Cyprus', 1),
('Knut Thyberg', 1)]

Currently the category statistics look like this:

no of catsno of cases
0111
1357
2125
318
53

Currently the category statistics look like this:

no of catsno of cases
0111
1357
2125
318
53

I think this should be ok for us to go ahead with the upload. Especially if we get some eyes on the maintenance categories during the edit-a-thon (ping @AxelPettersson_WMSE)