extraction-v2

Community

Turn exam PDFs into structured data for grading

AuthorAKCqhzdy
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Extract structured text from exam-paper PDFs / images (questions, student answers, rubrics) into v1-schema JSON (questions.json / rubrics.json) for downstream grading workflows.

Core Features & Use Cases

  • Converts exam documents into a hierarchical v1 schema suitable for grading tools; outputs include questions.json and rubrics.json with optional image data alignment.
  • Embeds [IMAGE_DATA] tokens in content when visuals matter and provides aligned images metadata to export page images later.
  • Maintains compatibility with legacy extraction workflows via a symlinked scripts path and a pluggable prompt module.

Quick Start

Run the extraction-v2 pipeline on a PDF or directory to produce questions.json and rubrics.json and review the wrapped paper envelope artifacts in the outputs.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: extraction-v2
Download link: https://github.com/AKCqhzdy/dse-subject-grading/archive/main.zip#extraction-v2

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.