Name: pinchbench
Availability: InStock
Author: pinchbench

System Documentation

What problem does it solve?

PinchBench benchmarks how well AI models perform as the brains of OpenClaw agents by executing real-world tasks and surfacing results on a public leaderboard.

Core Features & Use Cases

Real-world, end-to-end task execution across productivity, research, writing, coding, analysis, and memory
Flexible scoring models: automated, llm_judge, and hybrid with per-task rubrics
Leaderboard submission and model comparison to drive improvements

Quick Start

Run uv run benchmark.py --model <provider/model> to start benchmarking an OpenClaw agent.

Please help me install this Skill: Name: pinchbench Download link: https://github.com/pinchbench/skill/archive/main.zip#pinchbench Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

pinchbench

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper