Name: langsmith-evaluator
Availability: InStock
Author: langchain-ai

System Documentation

What problem does it solve?

LangSmith-based evaluation pipelines for agent outputs with reproducible, code-driven evaluators that ensure consistent scoring across tasks and datasets.

Core Features & Use Cases

Define deterministic run evaluators in Python or TypeScript to validate outputs, trajectories, and schema conformity.
Upload offline evaluators attached to datasets or online evaluators attached to projects for real-time quality checks.
Align evaluation with LangSmith workflows, enabling structured scoring and debugging insights.

Quick Start

Capture agent outputs, write a small code evaluator in Python or TypeScript, and upload it to LangSmith to validate results against your dataset or project.

Please help me install this Skill: Name: langsmith-evaluator Download link: https://github.com/langchain-ai/skills-benchmarks/archive/main.zip#langsmith-evaluator Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

langsmith-evaluator

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper