Search and Explore AWS Open Data
MCP server that searches and retrieves datasets from the AWS Open Data Registry with fuzzy matching and automatic caching for fast lookups.
Why it matters
Access and query the AWS Open Data Registry to discover and retrieve information about available datasets. This asset facilitates efficient data exploration through fuzzy searching and caching.
Outcomes
What it gets done
Search datasets by name, description, or tags using fuzzy matching.
Retrieve detailed information for specific datasets.
Leverage automatic caching for faster query responses.
Explore datasets from the AWS Open Data Registry.
Install
Add it to your toolbox
Run in your project directory:
curl -fsSL https://spark.entire.vc/get/vb-aws-open-data | bash Capabilities
Tools your agent gets
Search datasets by name, description, or tags with fuzzy search.
Get detailed information about a specific dataset by its file name.
Overview
AWS Open Data MCP Server
What it does
An MCP server providing programmatic access to the AWS Open Data Registry through search and retrieval tools.
How it connects
Use when you need to search for public datasets on AWS by keyword or retrieve detailed metadata for specific datasets without manual registry navigation.
Source README
AWS Open Data Registry MCP Server
An MCP server that allows searching and exploring datasets from the AWS Open Data Registry.
Features
- search_datasets: Search for datasets by name, description, or tags with fuzzy matching
- get_dataset: Get detailed information about a specific dataset
- Automatic caching of the AWS Open Data Registry for fast lookups
Installation
npm install
Usage
npm start
The server will be available at http://localhost:3000/mcp using the streamable HTTP transport. You can install this in Claude Code with: claude mcp add aws-open-data --transport http http://localhost:3000/mcp. You'll then need to restart Claude Code to pick up the new MCP.
Tools
search_datasets
Search for datasets matching a query. If no query is provided, returns all datasets.
Arguments:
query(string, optional): Search query to match against dataset names, descriptions, and tags. Empty query returns all datasetslimit(number, optional): Maximum number of results to return (default: 25)detail(enum, optional): Detail level - 'nameOnly' (just names), 'minimal' (name and description), or 'full' (complete dataset info). Default: 'minimal'
Example:
{
"query": "satellite",
"limit": 5,
"detail": "minimal"
}
get_dataset
Get detailed information about a specific dataset.
Arguments:
id(string, required): The ID of the dataset (without .yaml extension)
Example:
{
"id": "sentinel-1"
}
Discussion
Questions & comments · 0
Sign In Sign in to leave a comment.