ingest-pdf

Community

Instantly extract editable text from PDFs

AuthorRonanCodes
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Extract selectable text from PDF files and prepare it for automated wiki ingestion, removing the need for manual copy-paste and enabling downstream summarization and citation.

Core Features & Use Cases

  • Layout-preserving extraction: Uses pdftotext with layout mode to retain columns and basic formatting for more accurate parsing.
  • Page-range support: Extract specific page ranges for large documents or targeted imports.
  • Repository integration: Advises copying the original PDF to the vault raw/ folder and routes extracted text to the ingest pipeline for page creation.
  • Use Case: Convert a research paper or technical manual PDF into plain text to generate wiki pages, summaries, and source notes.

Quick Start

Ingest the PDF at /path/to/file.pdf and extract its text for wiki ingestion.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ingest-pdf
Download link: https://github.com/RonanCodes/llm-wiki/archive/main.zip#ingest-pdf

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.