Page MenuHomePhabricator

Request creation of "dump-references-processor" VPS project
Closed, ResolvedPublic

Description

Project Name: dump-references-processor

Wikitech Usernames of requestors: awight, lilients, thiemowmde, wmde-fisch (please give these users the project admin role)

Purpose: One-time, intensive multi-core processing of enterprise HTML dumps. (We'll also want to run again in 1-2 years, but that can be on a renewed project allocation.)

Brief description:
Requested hardware allocation:

  • 16 VCPUs
  • 16GB RAM
  • 10+GB attached volume storage

For the first task, we'll allocate all of this in one instance.

Planned configuration:

  • Will NFS mount the Enterprise HTML dumps data so we need to be in some sort of access group for that.
  • We'll install Elixir, with the Apache-2.0 license. For simplicity and to get the latest improvements, we'll compile from source code. A major dependency is Erlang + OTP, also under Apache-2.0.

How soon you are hoping this can be fulfilled: Within a month would be ideal, as we expect to finish writing our processing script in this timeframe. With a quarter is also acceptable but starts to impact our goal of collecting baseline metrics in advance of our feature work.

Event Timeline

Per https://wikitech.wikimedia.org/wiki/Help:Cloud_VPS_project#Reviews_of_Cloud_VPS_Project_requests:

The majority of project requests are approved, but there are some things which will cause further discussion:
...
"Umbrella" projects with a broad scope, such as all the work to be done by an engineering team or a large problem space.

"Umbrella" projects with broad scopes are difficult to track over time because of organizational changes and lack of continuity in ownership.

"Umbrella" projects with a broad scope, such as all the work to be done by an engineering team or a large problem space.

Thanks for pointing this out! We're happy to destroy the project when the processing task is completed, in that case. My expectation of continuing to reuse the allocation was based on real-world observation of other projects I'm involved in, but it makes sense that there's a trade-off between hardware resources and staff overhead...

I'll update the task to reflect this understanding.

My expectation of continuing to reuse the allocation was based on real-world observation of other projects I'm involved in, but it makes sense that there's a trade-off between hardware resources and staff overhead...

Thank you for being understanding. With the scope of the project reduced would there be a more descriptive or specific name that we could give the Cloud VPS project? It sounds like the focus of work will be HTML dumps somehow?

awight renamed this task from Request creation of "wmde-technical-wishes" VPS project to Request creation of "dump-references-processor" VPS project.Mar 23 2023, 9:18 AM
awight updated the task description. (Show Details)

With the scope of the project reduced would there be a more descriptive or specific name that we could give the Cloud VPS project? It sounds like the focus of work will be HTML dumps somehow?

Hehe I was avoiding that because my team will have my head, but let's do it anyway. Updated!

+1 from me as a Cloud VPS admin to proceed with creating this project

aborrero changed the task status from Open to In Progress.Mar 29 2023, 3:41 PM
aborrero triaged this task as Medium priority.
aborrero moved this task from Inbox to Approved on the Cloud-VPS (Project-requests) board.
aborrero subscribed.

+1

Andrew claimed this task.
Andrew subscribed.

I've created this project with the requested quotas. Awight, I didn't add all of your suggested members but you should be able to do that via Horizon.

I've created this project with the requested quotas. Awight, I didn't add all of your suggested members but you should be able to do that via Horizon.

Update: now I added them!

Thanks, the project configuration and quotas are right, and I was able to create the requested instance!

Hi, folks! Is this project still in use?

Either way, please update https://wikitech.wikimedia.org/wiki/News/Cloud_VPS_2024_Purge accordingly. Thank you!