onnxruntime

Community

High-performance ONNX model inference and training.

Authorjstzwj
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill enables efficient inference and training of ONNX models across multiple hardware backends, simplifying deployment and accelerating AI workflows.

Core Features & Use Cases

  • Cross-platform Deployment: Supports inference on CPUs, GPUs, NPUs, and edge devices with hardware-specific acceleration.
  • Model Optimization: Applies graph transformations, quantization, and format conversions for faster startup and lower memory footprint.
  • Use Case: Deploy a transformer model on server and mobile environments to perform real-time language understanding with optimized latency.

Quick Start

Invoke the onnxruntime skill to run a pre-trained image classification model on your dataset.

Dependency Matrix

Required Modules

onnxruntimeprotobufnumpy

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: onnxruntime
Download link: https://github.com/jstzwj/ai-infra-plugins/archive/main.zip#onnxruntime

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.