Assembly Regression Testing

Official

Catch assembly regressions before release.

Authorbroadinstitute
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps you validate that changes to the assembly or variant-calling pipeline do not introduce unintended biological or technical differences by comparing old vs new outputs across real Terra samples.

Core Features & Use Cases

  • End-to-end regression comparison: Pairs assemblies produced by old and new code for the same samples and checks equivalence or improvement.
  • Metric + alignment divergence analysis: Downloads assembly statistics TSVs, aligns FASTAs with mafft, and quantifies SNPs, indels (events and bp), ambiguity diffs, and terminal extensions.
  • Multi-segment genome correctness: Aligns segments independently (by FASTA header when possible) to prevent boundary artifacts from being misclassified as internal indels.
  • Optional VADR quality assessment: Uses dsub to batch-run VADR on divergent assemblies (especially helpful when indels could cause frameshifts and downstream annotation issues).
  • Aggregated visual and markdown reporting: Produces plots and a summary markdown report suitable for PR comments.

Quick Start

Ask the Skill to run discovery, compare all paired samples, and generate the report by using the old and new Terra submission IDs you provide.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Assembly Regression Testing
Download link: https://github.com/broadinstitute/viral-ngs/archive/main.zip#assembly-regression-testing

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.