badge icon

This article was automatically translated from the original Turkish version.

Article

High Performance Computing (HPC) systems play a critical role in fields such as scientific research simulations and data-intensive engineering applications. Efficient utilization of computational resources in these systems requires advanced job schedulers and resource managers. Slurm (Simple Linux Utility for Resource Management) is a leading open-source job scheduling and resource management system in this domain.

What is Slurm?

Slurm is an open-source resource management and job scheduling software originally developed by Hewlett-Packard and currently maintained by SchedMD. Slurm manages job execution on Linux-based cluster systems. It is used to fairly and efficiently queue schedule and allocate resources for users on supercomputers consisting of thousands of nodes.

Although designed for large-scale systems Slurm is also widely used in medium-sized research clusters.

Core Components

The Slurm architecture consists of the following main components:

  • slurmctld (controller): The central manager of the cluster that monitors job queues and nodes.
  • slurmd (daemon): Runs on each compute node and handles job assignment and monitoring.
  • slurmdbd: An optional database server that stores job history user statistics and accounting data.
  • scontrol sbatch squeue: Command-line tools that enable job submission queue monitoring and management.

Use Cases

  • Academic Research Centers: Computational jobs in universities for physics chemistry biology and engineering research are coordinated through Slurm.
  • Government Supercomputing Centers: Preferred for managing systems operating at teraflop and petaflop levels.
  • Industrial Simulations: Complex physical processes in the automotive aerospace and energy sectors are executed via Slurm.

Job Scheduling and Queuing

Slurm supports various scheduling policies to ensure fair resource sharing among jobs. For example jobs are ordered using strategies such as Priority Backfill and FairShare. Users can submit jobs using commands like sbatch srun and salloc and monitor job status with squeue and sacct.

Slurm supports multi-core and multi-node jobs and facilitates the coordinated execution of distributed workloads such as those using MPI (Message Passing Interface).

Customization and Extensibility

Slurm is extensible through plugins. For example:

  • User authentication plugins
  • Custom scheduling algorithms
  • Energy consumption monitoring modules

In addition hardware resources can be precisely defined through Slurm configuration files such as slurm.conf gres.conf and cgroup.conf.

Performance and Scalability

Slurm is successfully deployed on some of the world’s largest supercomputers including systems listed in the TOP500. It has the performance capability to manage tens of thousands of nodes and millions of jobs simultaneously.

Comparison of Slurm with Other Systems

Slurm is a powerful open-source and flexible platform used to manage job execution and resource planning in high performance computing systems. It is widely preferred in academic and institutional research environments and serves as the default system in the majority of supercomputing centers worldwide.

Bibliographies

National Energy Research Scientific Computing Center (NERSC). *NERSC Slurm User Guide.* DOE Office of Science. Accessed May 5, 2025. https://docs.nersc.gov/jobs/slurm/.

SchedMD LLC. *Slurm Workload Manager Documentation.* Accessed May 5, 2025. https://slurm.schedmd.com/documentation.html.

TOP500.org. "TOP500 List – Supercomputers Using Slurm." Accessed May 5, 2025. https://www.top500.org/statistics/list/.

Texas Advanced Computing Center (TACC). *Using Slurm on HPC Clusters.* Accessed May 5, 2025. https://portal.tacc.utexas.edu/user-guides/slurm.

Yoo, Andy B., Marty Jette, and Mark Grondona. “Slurm: Simple Linux Utility for Resource Management.” *Job Scheduling Strategies for Parallel Processing*, edited by Dror G. Feitelson, Larry Rudolph, and Uwe Schwiegelshohn, 44–60. Berlin: Springer, 2003. .

Author Information

Avatar
AuthorRamazan Cüneyt KüçükDecember 8, 2025 at 2:42 PM

Tags

Discussions

No Discussion Added Yet

Start discussion for "Slurm" article

View Discussions

Contents

  • What is Slurm?

  • Core Components

  • Use Cases

  • Job Scheduling and Queuing

  • Customization and Extensibility

  • Performance and Scalability

  • Comparison of Slurm with Other Systems

Ask to Küre