ArXiv Agentic Verifier

Name: ArXiv Agentic Verifier
Availability: InStock
Author: Wanli-Lee

Community

Find code bugs with targeted edge-case tests.

Software Engineering #test generation #edge cases #verify #generate #code correctness #sandbox execution

AuthorWanli-Lee

Version1.0.0

Installs0

System Documentation

What problem does it solve?

Verifying competitive-coding solutions is hard because edge cases and logic flaws often escape simple samples, so this Skill helps you automatically create discriminative tests and check whether candidate code is correct.

Core Features & Use Cases

Analyze Code Logic: Uses an LLM to reason about the problem statement and candidate code to identify likely failure modes.
Generate Targeted Test Cases: Produces specific inputs plus expected outputs aimed at breaking incorrect logic (not random sampling).
Execute and Verify: Runs the candidate code with the generated input and reports pass/fail based on output equality.

Use case examples: verifying a Python/JavaScript solution in a coding interview harness, diagnosing a wrong-answer submission by generating a counterexample, or stress-testing a small algorithm implementation against tricky boundary conditions.

Quick Start

Create an AgenticVerifier instance and call verify(problem, code, language) to generate a discriminative test case, execute the candidate program, and return whether it passed.

ArXiv Agentic Verifier

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper