vector-triton-ascend-ops-optimizer

Official

Maximizes Ascend Triton vector op performance.

AuthorAscend
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides expert-level optimization for Triton vector operators on Ascend NPU, enabling deeper performance improvements for single-operator workloads.

Core Features & Use Cases

  • Single-operator optimization: Focused tuning of a single Triton op to meet or exceed performance targets.
  • Profiling and validation: Baseline performance measurement, correctness verification, and iterative improvements with strict guardrails.
  • Hardware-aware optimization: Techniques such as UB capacity planning, masking, double buffering, and vector-unit utilization to maximize throughput on Ascend NPU.

Quick Start

Baseline the target Triton op, then iteratively apply targeted kernel optimizations to meet or exceed the performance uplift.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: vector-triton-ascend-ops-optimizer
Download link: https://github.com/Ascend/agent-skills/archive/main.zip#vector-triton-ascend-ops-optimizer

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.