troubleshoot-zymtrace-profiler

Official

Diagnose and fix a misbehaving GPU profiler agent

Authorzystem-io
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill troubleshoots the zymtrace profiler agent when it misbehaves, preventing reliable CPU/GPU profile collection.

Core Features & Use Cases

  • Symptom-to-root-cause routing: Guides diagnosis for CrashLoopBackOff, ImagePullBackOff, OOMKilled/restart cycles, missing NVML/GPU metrics, broken PC sampling, and profiler-side license/auth failures.
  • Agent-focused verification: Checks DaemonSet health, pod readiness, recent agent logs, and workload interception signals, while explicitly handing off to backend troubleshooting when the issue is downstream.
  • Kubernetes-first workflows: Uses kubectl and helm workflows to triage GPU vs CPU expectations, container configuration mismatches, and common cluster/runtime constraints.
  • Use Case: When the UI shows no GPU traces but the CPU path appears partially healthy, it walks the agent-side implant and GPU metrics prerequisites (cudaProfiler enabled, NVML discoverability, and PC sampling conditions) to determine whether the failure is agent-side or workload/backend-side.

Quick Start

Tell the Skill what you see (for example, “profiler pods CrashLoopBackOff” or “CPU profiles arrive but no GPU traces”) and confirm the profiler namespace and Helm release to get a targeted diagnosis and fix path.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: troubleshoot-zymtrace-profiler
Download link: https://github.com/zystem-io/zymtrace-skills/archive/main.zip#troubleshoot-zymtrace-profiler

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.