Product Overview
UCit have packaged its HPC and machine learning expertise in a software tool which assists HPC system administrators to be even more effective. Analyze-IT provides an extensible platform that presents the state of your HPC infrastructure through simple and comprehensible dashboards.Whether you need high level KPIs to report the cluster usage, or low level information to track down the origin of an issue; Analyze-IT gives you the right level of details.
Analyze-IT provides hundreds of KPIs and supports all major job schedulers. Analyze-IT Standard Edition contains the following features: Job Status (Number of jobs and core-hours consumed per job status),Load (Allocated cores through time, and number of jobs allocated per node), Throughput (Submission frequency, slowdown, interarrival), Resources (Number of cores & core-hours, memory and nodes consumed by the jobs), Consumers (Grouping of jobs per Group, User, JobName, Queue/Partition, QoS, Parallel Environment. For each, details about number of cores & core-hours, execution & waiting time, slowdown), Concurrent users (Active users per period), Congestion/Contention (provides a day-to-day update of the cluster status (Optimal, Acceptable, Contention, Congestion) based on resources needs and delivered computing power, and jobs life cycle for each day. It helps to identify if the cluster is correctly sized and configured, or if upgrades should be performed or if additional/external resources could be beneficial)
Version
By
UCITVideo
Categories
Operating System
Linux/Unix, Amazon Linux Amazon Linux 2
Delivery Methods