view-results
CommunityInstantly view Hawk evaluation results.
Authortbroadley
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This skill helps teams quickly access, understand, and analyze Hawk evaluation outputs after a run, including eval sets, evaluations, samples, and transcripts.
Core Features & Use Cases
- List Eval Sets: hawk list eval-sets to view available eval sets with IDs, dates, and creators.
- List Evaluations and Samples: hawk list evals [EVAL_SET_ID] and hawk list samples [EVAL_SET_ID] to inspect tasks, models, statuses, and sample counts.
- Retrieve Transcripts: hawk transcript <UUID> to obtain full conversations, with optional --raw for JSON.
- Bulk Access: hawk transcripts <EVAL_SET_ID> to export all transcripts to a directory, with limit controls.
Quick Start
Use the hawk command to list available eval sets, then list evaluations or samples for an eval set, and fetch a transcript for a selected sample.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: view-results Download link: https://github.com/tbroadley/dotfiles/archive/main.zip#view-results Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.