view-results

Community

Instantly view Hawk evaluation results.

Authortbroadley
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill helps teams quickly access, understand, and analyze Hawk evaluation outputs after a run, including eval sets, evaluations, samples, and transcripts.

Core Features & Use Cases

  • List Eval Sets: hawk list eval-sets to view available eval sets with IDs, dates, and creators.
  • List Evaluations and Samples: hawk list evals [EVAL_SET_ID] and hawk list samples [EVAL_SET_ID] to inspect tasks, models, statuses, and sample counts.
  • Retrieve Transcripts: hawk transcript <UUID> to obtain full conversations, with optional --raw for JSON.
  • Bulk Access: hawk transcripts <EVAL_SET_ID> to export all transcripts to a directory, with limit controls.

Quick Start

Use the hawk command to list available eval sets, then list evaluations or samples for an eval set, and fetch a transcript for a selected sample.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: view-results
Download link: https://github.com/tbroadley/dotfiles/archive/main.zip#view-results

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.