croq-dsl-triton
OfficialOptimize Triton GPU kernels with targeted tuning strategies.
AuthorLancerLab
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a domain-specific language (DSL) tuning contract for Triton kernels, enabling users to efficiently optimize GPU performance.
Core Features & Use Cases
- Environment Validation: Check Triton version compatibility to ensure proper execution.
- BUILD / RUN Templates: Generate scripts to compile and run Triton kernels with proper syntax and error handling.
- Performance Profiling: Use NVIDIA NVCC tools to profile GPU kernels for performance bottlenecks.
- Optimization Ideas: Offer specific strategies like memory and compute bounds adjustments to improve kernel efficiency.
- Verification & Measurement: Define standardized verification and benchmarking procedures for kernel correctness and speed.
Quick Start
Provide a Triton GPU kernel source file, and run the build script to compile; then execute the run script to generate profile data and analyze GPU performance metrics.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: croq-dsl-triton Download link: https://github.com/LancerLab/croqtile-tuner/archive/main.zip#croq-dsl-triton Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.