streaming

Community

Deliver real-time AI responses, effortlessly.

Authorjuanre
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Users need to display LLM responses incrementally for better user experience, especially with long generations, but managing async streams and usage tracking across different providers can be complex. This Skill simplifies that, providing a unified API for all major LLM providers.

Core Features & Use Cases

  • Real-time Output: Display LLM responses as they are generated, improving user experience for chat interfaces and long-form content generation.
  • Unified Streaming API: Use a single, consistent API for streaming across OpenAI, Anthropic, Google, and Ollama, eliminating provider-specific complexities.
  • Usage Tracking: Automatically capture token usage and other statistics from the final stream chunk, enabling cost monitoring and optimization.
  • Use Case: Build a responsive AI chatbot that shows text appearing character-by-character, making interactions feel more natural and immediate, or stream the generation of a long report to keep users engaged.

Quick Start

Use the streaming skill to get a real-time response from the 'chatbot' alias asking it to 'Count to 10'.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: streaming
Download link: https://github.com/juanre/llmring/archive/main.zip#streaming

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.