hamel-husain
CommunityHamel Husain’s evals-driven thinking for engineers
Personal & Entrepreneur#llm-as-judge#maven#evals#trace-analysis#llm-evals#independent-consultant#hamel-husain
Authorvoidborne-d
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Identify and close AI agent reliability gaps by building production-grade eval sets before making prompts or model changes.
Core Features & Use Cases
- Evals-first workflow: manual trace review, human-validated rubrics, and LLM-as-judge alignment.
- Role-play and identity guidance: define role-specific agent behavior and governance.
- Production tracing: instrumentation and measurement to support continuous improvement.
Quick Start
Identify your current agent's evals gap and align on building an evals-first engagement plan.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: hamel-husain Download link: https://github.com/voidborne-d/master-skill/archive/main.zip#hamel-husain Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.