gpu

Community

Real-time GPU monitoring for Ollama inference.

Authoratrawog
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Monitor and optimize GPU usage for Ollama inference, ensuring performance and resource efficiency.

Core Features & Use Cases

  • Real-time GPU status monitoring (name, memory, utilization) during Ollama runs.
  • Track models loaded in GPU memory and their VRAM consumption.
  • Benchmark inference latency and throughput to identify bottlenecks in GPU-accelerated workflows.

Quick Start

Run a quick health check on the Ollama GPU setup and report current GPU usage and loaded models.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: gpu
Download link: https://github.com/atrawog/overthink-plugins/archive/main.zip#gpu

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.