hive-test

Community

Test and refine AI agents with goal-based eval.

Authoradenhq
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides a structured framework to validate that AI agents meet their defined goals by running goal-based evaluation tests, surfacing failures, and guiding iterations.

Core Features & Use Cases

  • Verifies that agents meet their goals through repeatable evaluation tests.
  • Provides debugging support with the debug_test workflow to diagnose failures.
  • Enables iterative improvements by re-running tests after modifications to agent code or goals.
  • Integrates with hive-create and MCP tooling to generate test guidelines and orchestrate test runs.

Quick Start

To start, ensure your agent is exported to exports/your_agent and you have a goal_id and goal_json. Use the MCP tools to generate success test guidelines and then write and run tests:

  • List existing tests with mcp__agent-builder__list_tests(goal_id=..., agent_path="exports/your_agent")
  • Generate success criteria test guidelines with the MCP tool
  • Write tests following the guidelines, then run tests with mcp__agent-builder__run_tests(goal_id=..., agent_path="exports/your_agent")
  • If a test fails, diagnose with mcp__agent-builder__debug_test(goal_id=..., test_name="...", agent_path="exports/your_agent")

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: hive-test
Download link: https://github.com/adenhq/hive/archive/main.zip#hive-test

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.