tilelang-developer
CommunityDesign and optimize fast TileLang kernels.
Authoryzlnew
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Designs, optimizes, and debugs high-performance TileLang GPU kernels for AI workloads.
Core Features & Use Cases
- End-to-end kernel scaffolding, memory management, and pipelining patterns for CUDA/HIP targets.
- Cross-vendor portability (NVIDIA, AMD, Ascend) and support for non-standard operators like DeepSeek MLA.
- Debugging practices, performance heuristics, and validation strategies with example workflows.
Quick Start
Design, implement, and benchmark a minimal TileLang kernel by executing a simple 32x32 GEMM on a CUDA device.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: tilelang-developer Download link: https://github.com/yzlnew/infra-skills/archive/main.zip#tilelang-developer Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.