scan-organizer

Community

Turn scanned PDFs into categorized folders.

Authormarkuskreitzer
Version1.0.0
Installs0

System Documentation

What problem does it solve?

It removes the manual work of opening scanned PDFs, figuring out what they are, extracting usable text, and reorganizing them into the right places.

Core Features & Use Cases

  • Dual extraction (Docling + vision OCR): Converts structured PDFs with Docling and falls back to vision OCR for pages with sparse text.
  • LLM-based classification: Assigns each document to one category (medical, financial, insurance, tax, legal, personal, household, other) and generates structured classification output.
  • Organized output with auditability: Moves PDFs into category subfolders and writes markdown plus JSON sidecar metadata, while recording each move in a manifest to support undo and reclassification.

Quick Start

Use the scan-organizer command to process your watch directory by running process so new PDFs get extracted, classified, and moved into categorized subfolders.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: scan-organizer
Download link: https://github.com/markuskreitzer/scan-organizer/archive/main.zip#scan-organizer

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.