ray-data

Community

Scalable data processing for ML workloads

Authorovachiever
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Ray Data offers scalable, distributed data processing for ML pipelines, enabling streaming, multi-modal data loading, and integration with PyTorch/TensorFlow across CPU/GPU clusters.

Core Features & Use Cases

  • Streaming execution and distributed transforms
  • Multi-modal data loading (Parquet/CSV/JSON/images)
  • Integration with Ray Train, PyTorch, and TensorFlow
  • Scales from laptop to large clusters

Quick Start

Read Parquet data, apply a map_batches transformation, and iterate batches.

Dependency Matrix

Required Modules

ray[data]pyarrowpandas

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ray-data
Download link: https://github.com/ovachiever/droid-tings/archive/main.zip#ray-data

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.