docetl

Official

Build and run LLM-powered data pipelines.

Authorucbepic
Version1.0.0
Installs0

System Documentation

What problem does it solve?

DocETL provides a framework to design, orchestrate, and execute end-to-end pipelines that leverage large language models to process documents, extract information, and transform data for downstream use.

Core Features & Use Cases

  • End-to-end pipeline authoring for data collection, extraction, transformation, and execution
  • Interactive UI playground for iterative prompt engineering and pipeline development plus a Python package for production use
  • Use cases include extracting structured data from unstructured documents, validating results, and optimizing prompts and schemas

Quick Start

Use the docetl skill to design and run a minimal pipeline that ingests a sample JSON file and prints a short summary of the results.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: docetl
Download link: https://github.com/ucbepic/docetl/archive/main.zip#docetl

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.