pypto-optimization

Community

Tune PyPTO optimizations for peak tile performance

Authorxchang1121
Version1.0.0
Installs0

System Documentation

What problem does it solve?

PyPTO optimization rules provide a structured approach to tuning tensor operation performance by selecting tile shapes, loop counts, and reduction-axis strategies to maximize efficiency while respecting hardware constraints. We also offer guidance on when to merge reduction axes and how to sequence experiments to find the sweet spot for throughput and latency.

Core Features & Use Cases

  • Rule-based tile shaping and auto-tiling guidance for common operators (softmax, logsoftmax, norm, reduction, loss).
  • Loop-count sweep strategies with middle-ground optimization to balance task granularity and scheduling overhead.
  • Guidance on merging consecutive reduction axes to reduce intermediate tensors and improve kernel performance.

Quick Start

Apply the PyPTO optimization rules to your operator to choose tile shapes and loop counts, then profile the results to select the best configuration.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: pypto-optimization
Download link: https://github.com/xchang1121/AutoResearch-CC-hook/archive/main.zip#pypto-optimization

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.