extract-pdf

Name: extract-pdf
Availability: InStock
Author: KaihuaTang

Community

Turn any PDF into structured markdown + images.

Software Engineering #pdf #pymupdf #markdown extraction #pdfminer #page rendering #table detection #multimodal reading

AuthorKaihuaTang

Version1.0.0

Installs0

System Documentation

What problem does it solve?

It converts PDFs into Claude-readable content without losing structure, figures, tables, or scanned-page visuals that naive text extraction often mangles.

Core Features & Use Cases

Structured Markdown extraction: Preserves headings, lists, and tables so you can summarize and analyze like it’s a readable document.
Per-page rendered images: Produces PNGs so Claude can directly interpret charts, equations, and scanned pages in a multimodal way.
Complementary plain-text extraction: Uses a second extractor to recover text when the first pass misreads tricky layouts, encoding, or typography.
Use Case: When you ask Claude to “look inside this PDF,” this skill prepares a full workspace (manifest + files) for fast, reliable downstream reading and analysis.

Quick Start

Run the extract-pdf skill on your PDF at a given path or URL, writing outputs into an extraction folder, for example by instructing Claude to execute the provided command on your target PDF input.

extract-pdf

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper