croq-dsl-triton

Official

Optimize Triton GPU kernels with targeted tuning strategies.

AuthorLancerLab
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides a domain-specific language (DSL) tuning contract for Triton kernels, enabling users to efficiently optimize GPU performance.

Core Features & Use Cases

  • Environment Validation: Check Triton version compatibility to ensure proper execution.
  • BUILD / RUN Templates: Generate scripts to compile and run Triton kernels with proper syntax and error handling.
  • Performance Profiling: Use NVIDIA NVCC tools to profile GPU kernels for performance bottlenecks.
  • Optimization Ideas: Offer specific strategies like memory and compute bounds adjustments to improve kernel efficiency.
  • Verification & Measurement: Define standardized verification and benchmarking procedures for kernel correctness and speed.

Quick Start

Provide a Triton GPU kernel source file, and run the build script to compile; then execute the run script to generate profile data and analyze GPU performance metrics.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: croq-dsl-triton
Download link: https://github.com/LancerLab/croqtile-tuner/archive/main.zip#croq-dsl-triton

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.