alterlab-geniml
CommunityCreate genomic embeddings to accelerate analyses.
Data & Analytics#embeddings#tokenization#scembed#bedspace#region2vec#genomic-intervals#universe-building
AuthorAlterLab-IEU
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Geniml brings a complete toolkit to build unsupervised embeddings from genomic interval data, enabling similarity searches, clustering, and downstream ML analyses across BED files, scATAC-seq data, and consensus peak sets.
Core Features & Use Cases
- Region2Vec: learn embeddings for genomic regions to reduce dimensionality and enable region-level analyses.
- BEDspace: jointly embed regions and metadata labels for metadata-aware searches across regions and labels.
- scEmbed: generate cell embeddings from scATAC-seq data for clustering and annotation.
- Universe building: construct consensus peak universes to standardize tokenization references.
- Utilities: caching, randomization, evaluation, tokenization, and search backends for reproducible pipelines.
Use cases include clustering cells, performing similarity queries across datasets, and building tokenization universes for cross-project analyses.
Quick Start
Run Geniml with a prepared universe and tokenized BED files to train embeddings and evaluate them on a sample metadata file.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: alterlab-geniml Download link: https://github.com/AlterLab-IEU/AlterLab-Academic-Skills/archive/main.zip#alterlab-geniml Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.