Oct 27 2020
Thanks for all the help. Yes, the ticket can be closed! :)
Sep 17 2020
@elukey and @klausman thanks!
It works fine for me.
Just to clarify, I use the following /opt/rocm/bin/rocm-smi --showpids without sudo, and it works just fine. I am assuming this is how it is supposed to be, rite?
Aug 3 2020
I tried using both regular and large kernel, and the issue persisted. In fact, I did this debugging with @JAllemandou and the issue persisted, post which, he asked me to raise a ticket!
Jul 24 2020
@elukey just curious if there are any updates on this?
Jul 22 2020
May 14 2020
May 7 2020
@elukey thanks much for your prompt response. This is what I get from 'nvidia-smi' in our EPFL machine. As you can see it displays the number of processes currently running, and the pids. However, I am not sure if there is a flag in rocm-smi that displays similar information.
@elukey thanks much for your response. However, none of these monitoring tools give information about the pids of the processes or the number of processes currently using the GPU. 'nvidia-smi' provides that, and thus, I am assuming its equivalent 'rocm-smi' should also provide that.
Is there no reasonable workaround to install 'rocm-smi'?
May 5 2020
Feb 18 2020
Thank you so much for your prompt response and help. I am now able to use "hadoop" services seamlessly. We can safely close this task now.
I am reopening this request, as I am unable to use "hadoop" on the stat machines.
Jan 9 2020
I am currently traveling and the internet access is not that great. Thus, I will try to log in once I am back in Lausanne. I will comment on this thread, if I face any issues.
Dec 19 2019
@leila surely. Thanks once again for your prompt actions!