flydsl-kernel-authoring
OfficialAuthor high-performance FlyDSL kernels for AMD GPUs.
AuthorROCm
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Comprehensive reference for authoring FlyDSL GPU kernels on AMD GPUs. Covers the layout algebra, tiled copy/MMA, buffer ops, scf.for loops, SmemAllocator, autotuning, and common patterns. Use when writing, reviewing, or understanding FlyDSL kernel code.
Core Features & Use Cases
- Clear explanations of layout algebra, tiling, memory movement, and ROCm intrinsics.
- Practical patterns and recipes for element-wise kernels, data movement, and autotuning workflows.
- Use cases: implement custom kernels (GEMM, vector ops) on MI300X/MI350 with explicit layouts.
Quick Start
Begin by studying the overview and then implement a minimal FlyDSL kernel using the provided templates on your MI300X/MI350 GPU.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: flydsl-kernel-authoring Download link: https://github.com/ROCm/FlyDSL/archive/main.zip#flydsl-kernel-authoring Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.