tilelang-developer

Community

Design and optimize fast TileLang kernels.

Authoryzlnew
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Designs, optimizes, and debugs high-performance TileLang GPU kernels for AI workloads.

Core Features & Use Cases

  • End-to-end kernel scaffolding, memory management, and pipelining patterns for CUDA/HIP targets.
  • Cross-vendor portability (NVIDIA, AMD, Ascend) and support for non-standard operators like DeepSeek MLA.
  • Debugging practices, performance heuristics, and validation strategies with example workflows.

Quick Start

Design, implement, and benchmark a minimal TileLang kernel by executing a simple 32x32 GEMM on a CUDA device.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: tilelang-developer
Download link: https://github.com/yzlnew/infra-skills/archive/main.zip#tilelang-developer

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.