HPC

Basic Slurm Usage for Linux Clusters

Bright Cluster Manager is a comprehensive cluster management solution for managing all types of HPC clusters and server farms, including CPU and GPU clusters, storage and database clusters, and big data Hadoop clusters. Slurm Workload Manager, which is integrated in Bright Cluster Manager, is an…

HPC

Using IPMItool to Manage and Monitor IPMI

Clusters can encounter many software and hardware health issues over extended periods of use. Since clusters are not static systems, there must be constant monitoring to reduce the possibility of wear and tear problems. Fortunately, there are ways to monitor your clusters remotely which makes…

HPC

Managing Drives in MDADM (CentOS, Ubuntu)

Software RAID is becoming very common with Linux based workstation and server environments. The following sections in this blog post will help provide guidance on viewing the current software RAID health (mdadm), as well as removing and re-adding drives for software RAID maintenance as needed…