ray-data
CommunityScalable data processing for ML workloads
Authorovachiever
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Ray Data offers scalable, distributed data processing for ML pipelines, enabling streaming, multi-modal data loading, and integration with PyTorch/TensorFlow across CPU/GPU clusters.
Core Features & Use Cases
- Streaming execution and distributed transforms
- Multi-modal data loading (Parquet/CSV/JSON/images)
- Integration with Ray Train, PyTorch, and TensorFlow
- Scales from laptop to large clusters
Quick Start
Read Parquet data, apply a map_batches transformation, and iterate batches.
Dependency Matrix
Required Modules
ray[data]pyarrowpandas
Components
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ray-data Download link: https://github.com/ovachiever/droid-tings/archive/main.zip#ray-data Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.