LIKWID Monitoring Stack: A Flexible Framework Enabling Job Specific Performance monitoring for the masses

@article{Rhl2017LIKWIDMS,
  title={LIKWID Monitoring Stack: A Flexible Framework Enabling Job Specific Performance monitoring for the masses},
  author={Thomas R{\"o}hl and Jan Eitzinger and Georg Hager and Gerhard Wellein},
  journal={2017 IEEE International Conference on Cluster Computing (CLUSTER)},
  year={2017},
  pages={781-784}
}
  • Thomas Röhl, Jan Eitzinger, +1 author Gerhard Wellein
  • Published 2017
  • Computer Science
  • 2017 IEEE International Conference on Cluster Computing (CLUSTER)
  • System monitoring is an established tool to measure the utilization and health of HPC systems. Usually system monitoring infrastructures make no connection to job information and do not utilize hardware performance monitoring (HPM) data. To increase the efficient use of HPC systems automatic and continuous performance monitoring of jobs is an essential component. It can help to identify pathological cases, provides instant performance feedback to the users, offers initial data to judge on the… CONTINUE READING

    Figures and Topics from this paper.

    Explore key concepts

    Links to highly relevant papers for key concepts in this paper:

    Citations

    Publications citing this paper.
    SHOWING 1-9 OF 9 CITATIONS

    ClusterCockpit — A web application for job-specific performance monitoring

    VIEW 3 EXCERPTS
    CITES METHODS

    High Performance Cluster Monitoring System

    VIEW 1 EXCERPT
    CITES BACKGROUND

    Power model for resource-elastic applications

    VIEW 1 EXCERPT
    CITES METHODS

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 14 REFERENCES

    Comprehensive Resource Use Monitoring for HPC Systems with TACC Stats

    VIEW 1 EXCERPT

    NWPerf: a system wide performance monitoring tool for large Linux clusters

    VIEW 1 EXCERPT

    Caliper: Performance Introspection for HPC Software Stacks

    VIEW 1 EXCERPT

    Monitoring High Performance Computing Systems for the End User

    VIEW 1 EXCERPT

    The HOPSA Workflow and Tools

    VIEW 1 EXCERPT