llm-judge

Community

AI-powered code comparison and scoring.

Authorjavierhbr
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the complex and time-consuming process of evaluating and comparing code implementations across multiple repositories based on predefined criteria.

Core Features & Use Cases

  • Multi-Repo Code Evaluation: Compares code quality, functionality, security, and more across different codebases.
  • LLM-as-Judge Methodology: Leverages AI agents to perform detailed, rubric-based scoring.
  • Structured Reporting: Generates a ranked report with detailed justifications for each score.
  • Use Case: When deciding which of several competing feature implementations to merge, use this Skill to objectively score each one on functionality, security, and maintainability.

Quick Start

Use the llm-judge skill to compare the code in '/path/to/repo-a' and '/path/to/repo-b' against the spec in '/path/to/spec.md'.

Dependency Matrix

Required Modules

@beagle:llm-artifacts-detection

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: llm-judge
Download link: https://github.com/javierhbr/random-poc/archive/main.zip#llm-judge

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.