The goal is to find for each year of WLM, and per country an estimate of what new contributor retention looks like. We can only use public data for this.
I have a list of submitted images to WLM with their uploader (on PAWS). For each uploader, I'd like to query:
- Did they get renamed
- When was their first contribution (i.e. were they a new contributor when they submitted)
- How many contributions did they make in the 4th month after their first contribution
- How many contributions did they make in the 4th + 5th + 6th month after their first contribution
- How many contributions in lifetime
- When was the most recent contribution
This across projects.
It would be great if we can then calculate the same for the complement to compare with:
- Find non-WLM uploaders from the same months that were new contributors
- Do the same as above