ascend-benchmark-evaluator

Community

Evaluate Lingxi-Code Ascend C code generation.

AuthorJust-it
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides a structured framework to evaluate the Lingxi-Code Agent's Ascend C code generation across standardized datasets, producing both correctness verification and performance benchmarks.

Core Features & Use Cases

  • Batch operator evaluation across multiple operators to generate per-operator reports.
  • Correctness validation by comparing generated outputs against reference implementations to ensure functional parity.
  • Performance benchmarking with timing comparisons and a consolidated benchmark report.
  • Automated dataset handling (NPUKernelBench format) and end-to-end evaluation workflow.

Quick Start

Run the evaluator against your NPUKernelbench dataset to produce a complete per-operator and global benchmark report.

Dependency Matrix

Required Modules

torchtorch_npu

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ascend-benchmark-evaluator
Download link: https://github.com/Just-it/AscendOpGenAgent/archive/main.zip#ascend-benchmark-evaluator

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.