Please provide the following information.
- Provide a short summary of your proposed post for the Wikimedia Technical Blog. Blog readers will see this as the preview to your post:
What happens when you connect AMD GPUs to hadoop cluster nodes, and try to train distributed machine learning models on top? In the past year, the Data Engineering, ML Platform and Research team have been working on testing a open-source environment to run machine learning tasks in a distributed fashion. In this blog post, we will talk about our journey towards scaling up our infrastructure for machine learning computing using (almost) fully open-source tools.
- Which topic type does your blog post fall under? See:
https://www.mediawiki.org/wiki/Wikimedia_technical_blog_editorial_guidelines#Outlines_for_topics:
Big picture
An overview post about a consequential launch, migration, new feature, etc
- Which audience or audiences do you think your post is appropriate for?:
ML engineers and practitioners, tech community members
- Will you need assistance with writing your blog post, or do you already have a draft? If you have a draft, please provide a link here:
We are working on a draft, the workflow will be similar to what is outlined in this presentation: https://docs.google.com/presentation/d/1Ykzf7SRNBKh5wluNKrKLojG1ubh4i64yCn7yyD-36hw/edit#slide=id.p
- Does your post need to be published by a certain date?
We are not in a rush, but we are doing a writing sprint as part of the DSE hackathon this week.
- Do you have an image in mind for the featured image? You can learn more here: https://www.mediawiki.org/wiki/Wikimedia_technical_blog_editorial_guidelines#Images_used_in_your_post
- Do you have any other questions or comments?
Once your request is received, a technical blog admin will review it and reach out to you through Phabricator.