Training Material

From CMU -- Language Technologies Institute -- HPC Wiki
Revision as of 13:57, 30 June 2023 by Yifengw2 (talk | contribs)
Jump to navigation Jump to search

Connecting to Cluster

Each team will be assigned with a temporary Andrew id. You will need the Andrew id to access Cluster.

Follow the Instructions to connect to Cluster

Instruction with SLURM

SLURM (Simple Linux Utility for Resource Management) is a job scheduler and resource management system commonly used in high-performance computing (HPC) environments. You will need to get familiar with basic usage of SLURM in order to interact with the Cluster

Refer to Beginner's guide to the SLURM workload manager for more details.

Monitoring

To monitor cluster activities such as jobs, compute resources, and disk usage, learn about essential techniques and tools for effective cluster monitoring here: Monitoring.

LLM Deployment Demo