Page MenuHomePhabricator

Goal 2: People outside the ML team can ssh into an ml-lab machine, run a Jupyter Notebook, and run PyTorch powered by a GPU.
Open, Needs TriagePublic

Event Timeline

Update:

  • Waiting for ml-lab machines to be delivered to the eqiad data center.

Update:

  • machines are racked but not set up. Will set up one first to figure out disk layout and then the other one. Then will release to the research team

Update: Deciding version of ROCm (and thus, Tensorflow or Pytorch), then packaging them on WMF infra.

  • Working on bundling things on the lab machines - We should be working with ROCm versions >= 6.0
Miriam subscribed.

Hello! I just added this as child task to our hypothesis work, as suggested by the objective steering committee. I hope it works for you.

Gehel moved this task from Scratch to Incoming on the Data-Platform-SRE board.