finding-data-lake-assets

Community

Resolve data lake tables to real assets

Authormreferre
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill resolves ambiguous data lake or lakehouse asset references into concrete Glue, S3, S3 Tables, and Redshift catalog entries so you can quickly identify the right dataset and where it lives.

Core Features & Use Cases

  • Guided asset resolution: Turns a table name, keyword, column name, or S3 path into a specific table reference with format and location where available.
  • Layered discovery with early stopping: Searches Glue first for high-confidence matches, then uses S3 reverse lookup or Redshift only when appropriate to minimize latency and token usage.
  • Confidence-gated disambiguation: Returns immediately on high confidence, lists top candidates for medium confidence, and reports what was searched or skipped when results are weak or absent.

Use case example: You say “what is our customer_id dataset in us-east-1?” and the Skill finds the best matching Glue table (or S3/S3 Tables/Redshift counterpart when relevant) and returns the resolved reference plus the sources searched/skipped.

Quick Start

Ask: "Resolve customer_id data in us-east-1 and tell me the matching Glue/S3/S3 Tables/Redshift table reference and location."

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: finding-data-lake-assets
Download link: https://github.com/mreferre/aws-agent-toolkit-skills/archive/main.zip#finding-data-lake-assets

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.