Page MenuHomePhabricator

Request creation of wikicommunityhealth VPS project (Community Health Metrics: Understanding Editor Drop-off)
Closed, ResolvedPublic

Description

Project Name: wikicommunityhealth

Wikitech Usernames of requestors: CristianCantoro, marcmiquel, sdivad, elaragon

Purpose: this server will be used in the scope of the project Community Health Metrics: Understanding Editor Drop-off. We will need to parse dumps and retrieve data from multiple Wikipedias.

Brief description: the project will need to retrieve and analyze several datasets and process them.

  • the server will run a collection of python script using several packages to analyze data from Mediawiki history dumps, Wikipedia dumps and other datasets. The code will be hosted under this organization on GitHub: WikiCommunityHealth
  • the server will host a website to show the results.

According to the list of instance types, we estimate that we will need a g2.cores8.ram16.disk160 instance.

How soon you are hoping this can be fulfilled: this month would be great. For development, we are using machines provided by Eurecat - Centre Tecnològic de Catalunya and by prof. Montresor at the University of Trento.

Event Timeline

Andrew subscribed.

Approved -- I'll set this up soon.

I've created the project and added CristianCantoro, marcmiquel, and elaragon.

I was not able to locate 'sdivad' as a user, but any existing project admins can add new members via Horizon.

I managed to add him (davids). Thanks.

Looking forward to seeing the instance.

I have created an instance of a VPS within this project, for which I have an internal IP address (172.16.3.146).

To SSH in this server, I first have to go through bastion: ssh -A cristiancantoro@bastion.wmflabs.org and then ssh into the server ssh 172.16.3.146.

I don't understand why this command is logging me in directly:

ssh -A -o "ProxyCommand=ssh -A cristiancantoro@bastion.wmflabs.org -W %h:%p" cristiancantoro@172.16.1.136

Any idea of why this is necessary?

Thanks for your help!

Any idea of why this is necessary?

I have received an answer via IRC, i.e. to use ProxyJump whose instructions are here: https://wikitech.wikimedia.org/wiki/Help:Accessing_Cloud_VPS_instances#ProxyJump_(recommended)