flydsl-kernel-authoring

Official

Author high-performance FlyDSL kernels for AMD GPUs.

AuthorROCm
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Comprehensive reference for authoring FlyDSL GPU kernels on AMD GPUs. Covers the layout algebra, tiled copy/MMA, buffer ops, scf.for loops, SmemAllocator, autotuning, and common patterns. Use when writing, reviewing, or understanding FlyDSL kernel code.

Core Features & Use Cases

  • Clear explanations of layout algebra, tiling, memory movement, and ROCm intrinsics.
  • Practical patterns and recipes for element-wise kernels, data movement, and autotuning workflows.
  • Use cases: implement custom kernels (GEMM, vector ops) on MI300X/MI350 with explicit layouts.

Quick Start

Begin by studying the overview and then implement a minimal FlyDSL kernel using the provided templates on your MI300X/MI350 GPU.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: flydsl-kernel-authoring
Download link: https://github.com/ROCm/FlyDSL/archive/main.zip#flydsl-kernel-authoring

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.