choreo-kernel-examples
OfficialComprehensive Choreo GPU kernel pattern references.
Content & Communication#cuda#gpu kernels#kernel optimization#performance patterns#warp specialization#pipeline staging
AuthorLancerLab
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It provides a curated collection of reference GPU kernel implementations in the Choreo DSL, enabling developers to learn and replicate best practices in kernel design.
Core Features & Use Cases
- Kernel Pattern Exploration: Study diverse GPU kernel structures, tiling strategies, warp specialization, and pipeline staging.
- Performance Tuning Reference: Access example implementations optimized for different architectures and problem sizes.
- Use Case: Engineers can quickly adapt kernel templates for their custom GPU compute tasks, improving development speed and correctness.
Quick Start
To explore the example kernels, review the files in the skills/cursor-choreo-kernel-examples directory and run the build scripts to generate and analyze the CUDA code.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: choreo-kernel-examples Download link: https://github.com/LancerLab/croqtile-tuner/archive/main.zip#choreo-kernel-examples Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.