0170-similarity-search-patterns
CommunityBuild fast, scalable semantic retrieval.
Software Engineering#vector database#RAG#hybrid search#similarity search#semantic retrieval#vector indexing#ANN
AuthorMrJmpl3
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Similarity search systems struggle to balance relevance, latency, and cost when retrieving the nearest vectors from large embedding collections.
Core Features & Use Cases
- Distance & Metric Selection: Choose cosine, L2, dot product, or L1 based on embedding characteristics and desired scoring behavior.
- Index Strategy: Select the right index type (flat exact, HNSW graph-based, IVF+PQ quantized) to trade off recall, speed, and memory.
- Operational Best Practices: Tune retrieval parameters, implement hybrid (vector + keyword) search, pre-filter candidates, and continuously monitor recall and tail latency (P99).
- Use Case: Implement semantic search for RAG by retrieving top-k relevant chunks, optionally with reranking to improve final answer quality.
Quick Start
Ask an AI to generate a production-ready similarity search module using the included Pinecone, Qdrant, pgvector, or Weaviate templates, tuned for cosine distance and hybrid retrieval.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: 0170-similarity-search-patterns Download link: https://github.com/MrJmpl3/codex_____data_____configuration/archive/main.zip#0170-similarity-search-patterns Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.