ai-cost-finops
OfficialCut LLM spend with caching, batching, routing
Finance & Accounting#llm#model routing#finops#prompt caching#token economics#batch api#cost observability
AuthorMuvon
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill helps you reduce and forecast LLM costs by turning pricing, token economics, and provider features (prompt caching, Batch APIs, and model routing) into actionable optimization steps.
Core Features & Use Cases
- Cost projection with real price inputs: Uses provider-specific input/output token rates to estimate spend before you ship.
- Prompt caching optimization: Applies Anthropic and OpenAI caching mechanics (discount rates, TTL behavior, what to cache, and break-even reasoning).
- Batching and async execution strategy: Designs where Batch APIs fit to capture standardized discounts for non-real-time workloads.
- Model routing playbooks: Guides when to default to smaller models and when to escalate to larger/frontier models using audit-friendly rules.
- Token and output-length economics: Focuses on output discipline and structured outputs to control the dominant output token cost driver.
- RAG cost stack budgeting: Estimates spending across embeddings, reranking, and vector DB operations to find the biggest cost drivers in retrieval pipelines.
- FinOps observability and attribution: Recommends instrumentation patterns (Helicone, Langfuse, Phoenix, LangSmith, OpenLLMetry, Vantage) so you can measure per-feature waste and allocate spend accurately.
Quick Start
Ask the AI: "Project our 2026 monthly LLM cost for the 'customer support triage' feature, using your May 2026 pricing assumptions, then list the top 3 levers to reduce spend with expected impact."
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ai-cost-finops Download link: https://github.com/Muvon/octomind-tap/archive/main.zip#ai-cost-finops Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.