cartesia-api

Official

Build Cartesia TTS/STT into your app

Authorcartesia-ai
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill solves the problem of integrating Cartesia speech capabilities—text-to-speech, speech-to-text, and voice selection—into application code and agent-assisted workflows without getting stuck on authentication, versioning, or API conventions.

Core Features & Use Cases

  • TTS (Sonic) integration: Generate audio from text using the correct HTTPS/WebSocket endpoints and the required version header or query parameter.
  • STT (Ink) integration: Convert spoken audio to text for conversational or real-time use cases while keeping request semantics consistent.
  • Voice management: Use voices via library/clone/localize workflows and select them correctly using API-supported voice identifiers.
  • Agent-ready guidance: Helps developers and coding agents choose REST/WebSocket vs SDK paths, handle concurrency/quota errors, and avoid mixing up Cartesia API vs Line deployments.

Quick Start

Use the cartesia-api skill to send a POST request to the Cartesia TTS endpoint with Authorization Bearer set to your Cartesia API key and include the Cartesia-Version header that matches your documentation.

Dependency Matrix

Required Modules

None required

Components

references

đź’» Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: cartesia-api
Download link: https://github.com/cartesia-ai/skills/archive/main.zip#cartesia-api

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.