vector-triton-ascend-ops-optimizer
OfficialMaximizes Ascend Triton vector op performance.
Software Engineering#memory-bandwidth#triton#performance-optimization#ascend-npu#kernel-tuning#vector-ops
AuthorAscend
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides expert-level optimization for Triton vector operators on Ascend NPU, enabling deeper performance improvements for single-operator workloads.
Core Features & Use Cases
- Single-operator optimization: Focused tuning of a single Triton op to meet or exceed performance targets.
- Profiling and validation: Baseline performance measurement, correctness verification, and iterative improvements with strict guardrails.
- Hardware-aware optimization: Techniques such as UB capacity planning, masking, double buffering, and vector-unit utilization to maximize throughput on Ascend NPU.
Quick Start
Baseline the target Triton op, then iteratively apply targeted kernel optimizations to meet or exceed the performance uplift.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: vector-triton-ascend-ops-optimizer Download link: https://github.com/Ascend/agent-skills/archive/main.zip#vector-triton-ascend-ops-optimizer Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.