ls-video-lens
CommunityIndex and caption YouTube videos frame-by-frame.
System Documentation
What problem does it solve?
This Skill eliminates the manual effort of watching, summarizing, and extracting visual information from videos by automatically downloading a YouTube video, sampling it into frames, and generating per-frame visual descriptions.
Core Features & Use Cases
- Frame extraction at configurable intervals: Samples a video into JPEG frames every N seconds for dense or lightweight visual coverage.
- Visual captioning/indexing via VLM backends: Produces a structured index by captioning each frame using either LM Studio (local) or the Claude API (hosted).
- Caption-free extraction mode: Supports an extract-only workflow for scenarios like manual review or Claude Code-style reading of frames directly.
- Outputs for downstream use: Writes a machine-readable JSON index and a human-readable Markdown summary with timestamps and captions.
Use case: When a user says “index this video” or “what happens in this video,” this Skill creates a searchable timeline of visual events so you can quickly locate moments that matter (e.g., tutorials, on-screen text, product demos).
Quick Start
Tell the assistant to download a YouTube URL, extract one frame every 5 seconds, and caption/index each frame using the LM Studio backend.
Dependency Matrix
Required Modules
Components
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ls-video-lens Download link: https://github.com/ahostbr/liteharness/archive/main.zip#ls-video-lens Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.