Name: 01fish-llm-radar
Availability: InStock
Author: OrangeViolin

System Documentation

What problem does it solve?

Benchmark OpenAI-compatible LLMs across seven objective tests to help teams compare models quickly and reproducibly.

Core Features & Use Cases

One-shot benchmarking across GSM8K, MATH, MMLU, MMLU-Pro, C-Eval, IFEval, HumanEval for a broad capability snapshot.
Multi-model horizontally comparative evaluation across providers (DeepSeek, OpenRouter, SiliconFlow, Volces, self-hosted endpoints).
Transparent data provenance with per-sample results and aggregated summaries, plus deep analysis reports.

Quick Start

Install dependencies, configure API keys, and run a sample evaluation with a chosen model and benchmark.

Please help me install this Skill: Name: 01fish-llm-radar Download link: https://github.com/OrangeViolin/01fish-llm-radar/archive/main.zip#01fish-llm-radar Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

01fish-llm-radar

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper