streaming
CommunityDeliver real-time AI responses, effortlessly.
Authorjuanre
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Users need to display LLM responses incrementally for better user experience, especially with long generations, but managing async streams and usage tracking across different providers can be complex. This Skill simplifies that, providing a unified API for all major LLM providers.
Core Features & Use Cases
- Real-time Output: Display LLM responses as they are generated, improving user experience for chat interfaces and long-form content generation.
- Unified Streaming API: Use a single, consistent API for streaming across OpenAI, Anthropic, Google, and Ollama, eliminating provider-specific complexities.
- Usage Tracking: Automatically capture token usage and other statistics from the final stream chunk, enabling cost monitoring and optimization.
- Use Case: Build a responsive AI chatbot that shows text appearing character-by-character, making interactions feel more natural and immediate, or stream the generation of a long report to keep users engaged.
Quick Start
Use the streaming skill to get a real-time response from the 'chatbot' alias asking it to 'Count to 10'.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: streaming Download link: https://github.com/juanre/llmring/archive/main.zip#streaming Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.