triton-cuda-basics

Community

Master Triton CUDA basics for efficient kernels.

Authorxchang1121
Version1.0.0
Installs0

System Documentation

What problem does it solve?

此技能提供对 Triton CUDA 编程的系统入门,聚焦核心概念、网格/块结构、内核模式以及常用装饰器与代码模式,帮助开发者快速掌握高效的 GPU 内核实现。

Core Features & Use Cases

  • 明确讲解 Triton 的内核定义、program_id 的使用、全局/共享/寄存器内存以及内存访问模式。
  • 展示五步内核结构、边界处理、调试流程和 autotune 的应用场景,提供可复用的代码模板。
  • 场景示例包括自定义算子实现、GPU 上数据处理与并行化优化等实际任务。

Quick Start

Build and run a minimal Triton kernel following the five-step pattern shown in the guide.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: triton-cuda-basics
Download link: https://github.com/xchang1121/AutoResearch-CC-hook/archive/main.zip#triton-cuda-basics

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.