System Documentation

What problem does it solve?

Vox solves the problem of adding low-latency, local speech interfaces (text-to-speech and speech-to-text) to your workflow without cloud APIs.

Core Features & Use Cases

  • Local TTS + STT via MCP tools: Use say/enqueue_say to speak and listen/converse to transcribe what you say in real time.
  • Background listening + inbox retrieval: Use start_listening, then poll check_inbox to retrieve transcriptions asynchronously.
  • On-device DSP calibration for better accuracy: Use calibrate, plus reset_dsp, to tune microphone processing to your room.

Quick Start

Use the Vox skill to run the MCP tool listen so it records from your microphone and returns the transcribed text.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: vox
Download link: https://github.com/trvon/vox/archive/main.zip#vox

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.