scan-organizer
CommunityTurn scanned PDFs into categorized folders.
Software Engineering#ocr#pdf#document organization#openai-compatible api#llm classification#metadata sidecar#manifest undo
Authormarkuskreitzer
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It removes the manual work of opening scanned PDFs, figuring out what they are, extracting usable text, and reorganizing them into the right places.
Core Features & Use Cases
- Dual extraction (Docling + vision OCR): Converts structured PDFs with Docling and falls back to vision OCR for pages with sparse text.
- LLM-based classification: Assigns each document to one category (medical, financial, insurance, tax, legal, personal, household, other) and generates structured classification output.
- Organized output with auditability: Moves PDFs into category subfolders and writes markdown plus JSON sidecar metadata, while recording each move in a manifest to support undo and reclassification.
Quick Start
Use the scan-organizer command to process your watch directory by running process so new PDFs get extracted, classified, and moved into categorized subfolders.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: scan-organizer Download link: https://github.com/markuskreitzer/scan-organizer/archive/main.zip#scan-organizer Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.