Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | diego | T377159 [SDS 1.2.1 B] Test existing AI models for internal use-cases | |||
Open | None | T371396 Goal 2: People outside the ML team can ssh into an ml-lab machine, run a Jupyter Notebook, and run PyTorch powered by a GPU. | |||
Resolved | klausman | T375076 ml-lab: add ROCM 6.1 packages to WMF apt repo | |||
Resolved | klausman | T376380 ml-lab: create puppet role to install ROCm packages and make the machine accessible to people outside ML Team | |||
Resolved | isarantopoulos | T377574 [ml-lab] Use a (jupyter) notebook and load a LLM from huggingface | |||
Unknown Object (Task) | |||||
Open | None | T381394 Q2:install SSD (hot swap additions) to ml-lab100[12] | |||
Resolved | klausman | T376974 ml-lab should have documentation |
Event Timeline
Comment Actions
Update:
- machines are racked but not set up. Will set up one first to figure out disk layout and then the other one. Then will release to the research team
Comment Actions
Update: Deciding version of ROCm (and thus, Tensorflow or Pytorch), then packaging them on WMF infra.
Comment Actions
- Working on bundling things on the lab machines - We should be working with ROCm versions >= 6.0
Comment Actions
Hello! I just added this as child task to our hypothesis work, as suggested by the objective steering committee. I hope it works for you.