LangSmith Dataset

Official

Create evaluation datasets from traces.

Authorlangchain-ai
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the creation of evaluation datasets from your LangSmith traces, streamlining the process of testing and validating your AI applications.

Core Features & Use Cases

  • Dataset Generation: Automatically generate datasets in various formats (final_response, single_step, trajectory, RAG) from exported trace files.
  • LangSmith Upload: Directly upload generated datasets to your LangSmith account for centralized management and analysis.
  • Use Case: After running your agent on a set of prompts, export the traces and use this Skill to generate a final_response dataset to evaluate the agent's overall performance, or a trajectory dataset to analyze its tool-use patterns.

Quick Start

Use the LangSmith Dataset skill to generate a final_response dataset from the traces in the './traces' directory and save it to './datasets/final_response.json'.

Dependency Matrix

Required Modules

langsmithclickrichpython-dotenvcommanderchalkcli-table3dotenv

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: LangSmith Dataset
Download link: https://github.com/langchain-ai/langchain-skills/archive/main.zip#langsmith-dataset

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.