llm-judge
CommunityAI-powered code comparison and scoring.
Software Engineering#code quality#testing#code review#security analysis#llm-as-judge#repository comparison
Authorjavierhbr
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates the complex and time-consuming process of evaluating and comparing code implementations across multiple repositories based on predefined criteria.
Core Features & Use Cases
- Multi-Repo Code Evaluation: Compares code quality, functionality, security, and more across different codebases.
- LLM-as-Judge Methodology: Leverages AI agents to perform detailed, rubric-based scoring.
- Structured Reporting: Generates a ranked report with detailed justifications for each score.
- Use Case: When deciding which of several competing feature implementations to merge, use this Skill to objectively score each one on functionality, security, and maintainability.
Quick Start
Use the llm-judge skill to compare the code in '/path/to/repo-a' and '/path/to/repo-b' against the spec in '/path/to/spec.md'.
Dependency Matrix
Required Modules
@beagle:llm-artifacts-detection
Components
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: llm-judge Download link: https://github.com/javierhbr/random-poc/archive/main.zip#llm-judge Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.