paper-writing-bench
CommunityTurn papers into end-to-end benchmark artifacts.
Education & Research#nlp#benchmark#experimentation#reverse-engineer#llm-pipeline#paper-writing#academic-pipelines
AuthorAr9av
Version1.0.0
Installs0
System Documentation
What problem does it solve?
PaperWritingBench provides a structured approach to reverse-engineer high-quality I/E materials from an existing AI research paper to create a repeatable benchmark for evaluating paper-writing pipelines.
Core Features & Use Cases
- Outputs three artifacts: idea_sparse.md, idea_dense.md, and experimental_log.md derived from a paper to serve as end-to-end benchmarks.
- Replicates the PaperWritingBench data construction process with anonymization, prompts, and deterministic scripts; applicable to any new paper to benchmark pipeline performance and autoraters.
Quick Start
Reverse-engineer the benchmark case from the provided paper to produce idea_sparse.md, idea_dense.md, and experimental_log.md.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: paper-writing-bench Download link: https://github.com/Ar9av/PaperOrchestra/archive/main.zip#paper-writing-bench Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.