transformers-docs
CommunityGet answers fast for Hugging Face Transformers
Education & Research#fine-tuning#quantization#huggingface#transformers#vllm#model loading#inference optimization
Authorwenerme
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill helps you quickly find the right guidance for using and optimizing the Hugging Face Transformers library, covering model loading, training, inference, tokenization, quantization, and serving integrations.
Core Features & Use Cases
- Transformers usage reference for core concepts like
AutoModel/AutoTokenizer,from_pretrained, thepipeline()API,generate(), and theTrainerworkflow. - Inference optimization & serving guidance including attention backends, KV-cache strategies, and integration pointers for vLLM/SGLang/llama.cpp.
- Training & distributed setup support spanning FSDP, DeepSpeed, Accelerate, and Trainer-related configuration topics.
- Quantization roadmap for GPTQ, AWQ, bitsandbytes, and GGUF, with links to the most relevant subsections.
Quick Start
Ask it for guidance on how to load a pretrained causal language model and choose the correct decoding strategy for generate() while also understanding how quantization affects that flow.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: transformers-docs Download link: https://github.com/wenerme/ai/archive/main.zip#transformers-docs Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.