Name: shiken
Availability: InStock
Author: ntholm86

System Documentation

What problem does it solve?

It helps identify whether an AI agent is truly reasoning or merely pattern-matching by constructing and analyzing deliberate examination scenarios.

Core Features & Use Cases

Designs targeted probes to test the agent's reasoning capabilities under novel or complex conditions.
Creates paired test cases that share surface features but differ in key details to reveal reasoning divergence.
Analyzes reasoning trails at predicted divergence points to assess the agent's situational understanding and genuine reasoning skills.
Use Case: Test whether an AI correctly interprets nuanced differences in scenarios where routines fail but interpretive reasoning should succeed.

Quick Start

Provide a pair of similar cases with expected divergence points to evaluate the agent's reasoning behavior in complex decision-making situations.

Please help me install this Skill: Name: shiken Download link: https://github.com/ntholm86/autonomous-agent-skills/archive/main.zip#shiken Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

shiken

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper