agentic-hyperparm
CommunityTune an agent’s behavior for better outcomes
Software Engineering#end-to-end testing#experiment tracking#context budget#evaluation framework#agent tuning#retrieval depth#abstention policy
Authorthistleknot
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It solves the problem of an agent’s performance, cost, and safety being miscalibrated because its behavioral “dial set” (retrieval, planning, verification, abstention, and sampling) is not tuned to the deployment context.
Core Features & Use Cases
- Layerwise behavioral tuning: Tunes retrieval, context management, reasoning depth, verification passes, abstention policy, then sampling in a deliberate order to avoid wasted search.
- Interaction-aware parameter planning: Models key interaction effects (e.g., retrieval depth vs context budget, temperature vs verification) so configurations are evaluated realistically.
- End-to-end episode evaluation: Uses multi-seed scoring and composite objectives (quality, cost, latency, safety) rather than cheap proxy metrics.
- Trial loop with persistence and validation: Defines how to materialize configs, run tune/holdout banks, persist trial artifacts, and detect tune/holdout overfitting.
Quick Start
Tune the agent’s behavior by running the skill’s layerwise protocol on your tune bank, selecting the best composite-scalar configuration, and then validating it once on a separate holdout bank.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: agentic-hyperparm Download link: https://github.com/thistleknot/skills/archive/main.zip#agentic-hyperparm Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.