extract-book
OfficialTurn PDF books into structured Markdown fast.
Content & Communication#pdf#markdown#metadata#pdfplumber#text-extraction#chapter-detection#vision-pass
AuthorNoiseMeldOrg
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This skill converts PDFs of books into well-structured Markdown with chapters, metadata blocks, and cleaned text.
Core Features & Use Cases
- Detects chapters using multiple strategies (text markers, single-number markers, named sections, and TOC-based detection) to produce reliable, multi-chapter outputs.
- Builds a clean, navigable Markdown document with per-chapter sections, metadata blocks, and optional image-rendered placeholders for pages that cannot be OCRed.
- Supports end-to-end processing from PDF extraction to Markdown assembly, with automatic metadata extraction (title, author, publisher, ISBN, copyright).
Quick Start
Convert a PDF book into structured Markdown with chapters, metadata, and clean text.
Dependency Matrix
Required Modules
pdfplumberpypdfium2
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: extract-book Download link: https://github.com/NoiseMeldOrg/skills/archive/main.zip#extract-book Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.