ingest-pdf
CommunityInstantly extract editable text from PDFs
AuthorRonanCodes
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Extract selectable text from PDF files and prepare it for automated wiki ingestion, removing the need for manual copy-paste and enabling downstream summarization and citation.
Core Features & Use Cases
- Layout-preserving extraction: Uses pdftotext with layout mode to retain columns and basic formatting for more accurate parsing.
- Page-range support: Extract specific page ranges for large documents or targeted imports.
- Repository integration: Advises copying the original PDF to the vault raw/ folder and routes extracted text to the ingest pipeline for page creation.
- Use Case: Convert a research paper or technical manual PDF into plain text to generate wiki pages, summaries, and source notes.
Quick Start
Ingest the PDF at /path/to/file.pdf and extract its text for wiki ingestion.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ingest-pdf Download link: https://github.com/RonanCodes/llm-wiki/archive/main.zip#ingest-pdf Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.