truefoundry-llm-deploy

Official

Deploy LLMs and GPU inference on TrueFoundry.

Authortruefoundry
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Deploys LLM models and GPU-accelerated inference servers on TrueFoundry using YAML manifests and the tfy CLI or REST API.

Core Features & Use Cases

  • Production-grade LLM deployment with vLLM, TGI, or NVIDIA NIM
  • GPU provisioning, model caching, and health probes for reliable serving
  • Use cases include hosting Gemma, Llama, Mistral, or HuggingFace models for inference

Quick Start

Provide the HuggingFace model ID and workspace to generate a ready-to-deploy manifest.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: truefoundry-llm-deploy
Download link: https://github.com/truefoundry/tfy-deploy-skills/archive/main.zip#truefoundry-llm-deploy

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.