exec-slurm-compile

Community

Compile TensorRT-LLM remotely on SLURM.

Authoryo-steven
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill streamlines compiling TensorRT-LLM on a remote SLURM cluster by coordinating container preparation, job submission, live monitoring, and build verification so you don’t need to compile directly on your local machine.

Core Features & Use Cases

  • SLURM batch compilation workflow: Submits a containerized compilation job using sbatch and runs the build inside an enroot container.
  • enroot container pre-dumping: Optionally pre-imports the Docker image into a .sqsh image via a dedicated SLURM job to reduce compile-time download overhead.
  • Live monitoring and log-based verification: Actively polls SLURM status, tails the srun log, and validates success via SLURM exit code and build log messages.
  • Use Case: You need to build TensorRT-LLM for a specific GPU architecture on a cluster partition with the right TensorRT environment packaged in an NVIDIA container.

Quick Start

Use the enroot-import step to generate a .sqsh from the TensorRT-LLM Docker image, then submit the compilation with sbatch using that .sqsh, your TensorRT-LLM repo path, and your SLURM partition and account.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: exec-slurm-compile
Download link: https://github.com/yo-steven/skills-exploration-20260522/archive/main.zip#exec-slurm-compile

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.