Project Name: Wikidata Concepts
Purpose: I am working as a Data Analyst for WMDE. We are building a project that will track and provide advanced analytics on the usage of custom, pre-specified selections of concepts and categories from Wikidata across the projects.
Wikitech Username of requestor: GoranSMilovanovic
Brief description: I need a labs instance where I can install everything that I need for analytics in R, and most probably the Anaconda ecosystem in the future. The instance will run a local postgreSQL server in order to support the application development there. The instance will be accessing the mySQL replicas to track the usage of Wikidata across >900 projects. I will need to install R, RStudio Server, Shiny Server, postgreSQL, and many Linux packages there. In order for the Shiny dashboards that will be developed there to be available to the team/community, the instance will probably need to have a public IP address; the ports 8080 (default RStudio Server) and 3838 (default Shiny Server) opened, at least. In the near future the instance will also have to be able to connect to Spark from RStudio using {sparklyr} on our Hadoop cluster (production).
How soon you are hoping this can be fulfilled: one week or so.