qwen_training_data_miner_prototype

Community

Prototype domain data miner for training data.

AuthorFoundup
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Mine 012.txt to generate high-quality, domain-specific training examples for Gemma models, accelerating instruction-tuning workflows.

Core Features & Use Cases

  • Extracts domain patterns from 012.txt (mps_scoring, wsp_application, roadmap_analysis, readme_patterns, modlog_updates, first_principles)
  • Produces instruction-tuning datasets and pattern summaries for downstream training
  • Supports configurable inputs (source_file, domain, pattern_type, min_examples) and outputs for traceability

Quick Start

Load the 012.txt file and run the miner for the target domain to generate a training dataset and a domain pattern summary.

Dependency Matrix

Required Modules

pattern_memorylibido_monitor

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: qwen_training_data_miner_prototype
Download link: https://github.com/Foundup/Foundups-Agent/archive/main.zip#qwen-training-data-miner-prototype

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.