add-behavioral-tests
OfficialAdd EvalHub behavioral tests to any agent
Software Engineering#pytest#tool selection#behavioral tests#mlflow tracing#evalhub#golden queries#e2e evaluation
Authorred-hat-data-services
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Creates a complete, repo-aligned behavioral testing setup for an agent by generating pytest suites and an EvalHub fixture, along with the required verification and documentation steps.
Core Features & Use Cases
- End-to-end behavioral test workflow: guides the full sequence from Jira scope intake through validation and E2E EvalHub execution.
- MLflow-backed tool-call scoring expectations: focuses on runner/tracing compatibility and ensures tool_calls come from MLflow traces (not unreliable content heuristics).
- Strict boundary and change control: prevents modifying the agent under test while allowing test-only artifacts and README updates in scope.
- Use cases: implementing behavioral tests for a new agent, adding behavioral testing when users mention btest/eval coverage/test harness integration, and ensuring MLflow tracing is present for accurate tool scoring.
Quick Start
Invoke the skill with your agent path and the Jira key when available, for example: run /agentic-starter-kits-skills:add-behavioral-tests <agent_path> [JIRA-KEY]
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: add-behavioral-tests Download link: https://github.com/red-hat-data-services/agentic-starter-kits-skills/archive/main.zip#add-behavioral-tests Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.