Project Name: wikicommunityhealth
Wikitech Usernames of requestors: CristianCantoro, marcmiquel, sdivad, elaragon
Purpose: this server will be used in the scope of the project Community Health Metrics: Understanding Editor Drop-off. We will need to parse dumps and retrieve data from multiple Wikipedias.
Brief description: the project will need to retrieve and analyze several datasets and process them.
- the server will run a collection of python script using several packages to analyze data from Mediawiki history dumps, Wikipedia dumps and other datasets. The code will be hosted under this organization on GitHub: WikiCommunityHealth
- the server will host a website to show the results.
According to the list of instance types, we estimate that we will need a g2.cores8.ram16.disk160 instance.
How soon you are hoping this can be fulfilled: this month would be great. For development, we are using machines provided by Eurecat - Centre Tecnològic de Catalunya and by prof. Montresor at the University of Trento.