extract-tables-from-pdf

Community

Turn PDF tables into clean data fast.

AuthorGozei
Version1.0.0
Installs0

System Documentation

What problem does it solve?

PDF tables are often embedded in documents and are hard to extract into structured data for analysis. This Skill automates detection and extraction of structured table data from both native and scanned PDFs, preserving row and column layouts, and supports OCR for scanned documents. It helps data analysts, researchers, and teams quickly convert PDF tables into usable formats.

Core Features & Use Cases

  • Automatic table detection in PDFs with preserved row/column structure.
  • OCR mode for scanned PDFs and support for local files or URLs.
  • Use cases include extracting financial tables, research results, or reports into CSV/JSON for downstream analytics.

Quick Start

Ask me to extract tables from a PDF such as report.pdf and return the results as a structured table.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: extract-tables-from-pdf
Download link: https://github.com/Gozei/ClawX/archive/main.zip#extract-tables-from-pdf

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.