HPCKP'16 Conferences

 

Abstract:

With the increasing number of cpu cores in compute nodes of high performance clusters, proper monitoring tools become essential to understand the usage and the behavior of the applications running in the cluster.

In this work a new approach to near real-time monitoring is presented, using the Slurm profiling plugin to display resource usage information for each of the processes running in the cluster. This data improves the understanding of the applications running and can help in highlighting to the user any application-related issue.

Authors:

Speaker: Carlos Fenoy Garcia

Date: April 21st, 2016

Location: Faculty of Physics (UB), C/ Martí i Franqués 1, 08028 Barcelona, Spain.

SLIDES