MCP

Enable Persistent Memory for AI Agents

Name: Enable Persistent Memory for AI Agents
Availability: OnlineOnly
Author: Topoteretes

Cognee is an open-source AI memory platform that gives AI agents persistent long-term memory by ingesting data and building a self-hosted knowledge graph with

Connect

Works with openai postgresneo4jchromadb

Topoteretes

Maintainer?

Spark score

out of 100

Updated 2 days ago

Version 1.2.1

Models

claude

Add to Favorites

Why it matters

Integrate persistent memory capabilities into your AI agents using a knowledge graph. This asset provides a GraphRAG memory server that enhances AI agent functionality with robust data storage and retrieval.

Outcomes

What it gets done

Provide persistent memory for AI agents via knowledge graph storage.

Enable efficient data retrieval for AI agent applications.

Support multiple transport modes including HTTP and SSE.

Facilitate local file imports for memory enrichment.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/vb-cognee-mcp | bash

Capabilities

Tools your agent gets

add_data

Add data to the knowledge graph for memory storage and retrieval.

cognify

Process and structure data into knowledge graph nodes and relationships.

codify

Generate code based on knowledge graph context and developer rules.

query

Query the knowledge graph to retrieve relevant memory and context.

prune

Clear and reset the entire memory and knowledge graph.

import_file

Import local files (.md, source code, rules) into the knowledge graph.

get_pipeline_status

Check the status of running cognify and codify background tasks.

Overview

cognee-mcp server

What it does

Cognee is an open-source AI memory platform that provides persistent long-term memory for AI agents through a self-hosted knowledge graph combining vector embeddings and graph reasoning.

How it connects

Use Cognee when you need AI agents to maintain context across sessions, build a company knowledge base from various data sources, or enable agents to recall and connect information through relationship reasoning rather than losing context after each interaction.

Source README

Cognee - The Open-Source AI Memory Platform for Agents

Demo . Docs . Learn More · Join Discord · Join r/AIMemory . Community Plugins & Add-ons

Cognee is the open-source AI memory platform that gives AI agents persistent long-term memory across sessions. Ingest data in any format, build a self-hosted knowledge graph, and let every agent recall, connect, and act with full context

📄 Read the research paper: Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning - Markovic et al., 2025

About Cognee

Cognee is an open-source AI memory platform for AI Agents. Ingest data in any format, and Cognee continuously builds a self-hosted knowledge graph that gives your agents persistent long-term memory across sessions. Cognee combines vector embeddings, graph reasoning, and cognitive-science-grounded ontology generation to make documents both searchable by meaning and connected by relationships that evolve as your knowledge does.

:star: Help us reach more developers and grow the cognee community. Star this repo!

:books: Check our detailed documentation for setup and configuration.

:crab: Available as a plugin for your OpenClaw - cognee-openclaw

✴️ Available as a plugin for your Claude Code - claude-code-plugin

Why use Cognee:

Easily Build Company Brain - unify data from various sources in one place and enable Agents with your domain knowledge
Knowledge infrastructure - unified ingestion, graph/vector search, runs locally, ontology grounding, multimodal
Persistent and Learning Agents - learn from feedback, context management, cross-agent knowledge sharing
Reliable and Trustworthy Agents - agentic user/tenant isolation, traceability, OTEL collector, audit traits

Product Features

Cognee Products

Basic Usage & Feature Guide

To learn more, check out this short, end-to-end Colab walkthrough of Cognee's core features.

Quickstart

Let’s try Cognee in just a few lines of code.

Prerequisites

Python 3.10 to 3.14

Step 1: Install Cognee

You can install Cognee with pip, poetry, uv, or your preferred Python package manager.

uv pip install cognee

Step 2: Configure the LLM

import os
os.environ["LLM_API_KEY"] = "YOUR OPENAI_API_KEY"

Alternatively, create a .env file using our template.

To integrate other LLM providers, see our LLM Provider Documentation.

Step 3: Run the Pipeline

Cognee's API gives you four operations - remember, recall, forget, and improve:

import cognee
import asyncio


async def main():
    # Store permanently in the knowledge graph (runs add + cognify + improve)
    await cognee.remember("Cognee turns documents into AI memory.")

    # Store in session memory (fast cache, syncs to graph in background)
    await cognee.remember("User prefers detailed explanations.", session_id="chat_1")

    # Query with auto-routing (picks best search strategy automatically)
    results = await cognee.recall("What does Cognee do?")
    for result in results:
        print(result)

    # Query session memory first, fall through to graph if needed
    results = await cognee.recall("What does the user prefer?", session_id="chat_1")
    for result in results:
        print(result)

    # Delete when done
    await cognee.forget(dataset="main_dataset")


if __name__ == '__main__':
    asyncio.run(main())

Use the Cognee CLI

cognee-cli remember "Cognee turns documents into AI memory."

cognee-cli recall "What does Cognee do?"

cognee-cli forget --all

To open the local UI, run:

cognee-cli -ui

Note: The MCP server launched by cognee-cli -ui runs inside a Docker container.
Docker Desktop, Colima, or any OCI-compatible runtime with a working docker CLI is
required. See Docker & Colima Setup for details.

Run with Docker

Prefer containers? Cognee publishes prebuilt images to Docker Hub on every push to main:
cognee/cognee (the API server) and
cognee/cognee-mcp (the MCP server).

Option A - Docker Compose (build from source)

Clone the repo, create a .env with at least LLM_API_KEY, then:

cp .env.template .env   # then edit .env and set LLM_API_KEY

# Start the API server (http://localhost:8000)
docker compose up

# Optional profiles (combine as needed):
docker compose --profile ui up        # + frontend on http://localhost:3000
docker compose --profile mcp up       # + MCP server on http://localhost:8001
docker compose --profile postgres up  # + Postgres/PGVector
docker compose --profile neo4j up     # + Neo4j

The cognee and cognee-mcp services publish different host ports (8000 vs 8001),
so you can run both at once.

Option B - Pull the prebuilt image (no clone required)

# Create a minimal .env in the current directory
echo 'LLM_API_KEY="YOUR_OPENAI_API_KEY"' > .env

# API server
docker run --env-file ./.env -p 8000:8000 --rm -it cognee/cognee:main

# MCP server (HTTP transport)
docker pull cognee/cognee-mcp:main
docker run -e TRANSPORT_MODE=http --env-file ./.env -p 8000:8000 --rm -it cognee/cognee-mcp:main

See the MCP server README for SSE/stdio transports, optional
extras, and MCP client configuration.

Use with AI Agents

Claude Code

Install the Cognee memory plugin to give Claude Code persistent memory across sessions. The plugin automatically captures tool calls into session memory via hooks and syncs to the permanent knowledge graph at session end.

Setup:

# Install cognee
pip install cognee

# Configure
export LLM_API_KEY="your-openai-key"

# Clone the plugin
git clone https://github.com/topoteretes/cognee-integrations.git

# Enable it (add to ~/.zshrc for permanent use)
claude --plugin-dir ./cognee-integrations/integrations/claude-code

Or connect to Cognee Cloud instead of running locally:

export COGNEE_SERVICE_URL="https://your-instance.cognee.ai"
export COGNEE_API_KEY="ck_..."

The plugin hooks into Claude Code's lifecycle - SessionStart initializes memory, PostToolUse captures actions, UserPromptSubmit injects relevant context, PreCompact preserves memory across context resets, and SessionEnd bridges session data into the permanent graph.

Connect to Cognee Cloud

Point any Python agent at a managed Cognee instance - all SDK calls route to the cloud:

import cognee

await cognee.serve(url="https://your-instance.cognee.ai", api_key="ck_...")

await cognee.remember("important context")
results = await cognee.recall("what happened?")

await cognee.disconnect()

Examples

Browse more examples in the examples/ folder - demos, guides, custom pipelines, and database configurations.

Use Case 1 - Customer Support Agent

Goal: Resolve customer issues using their personal data across finance, support, and product history.

User: "My invoice looks wrong and the issue is still not resolved."

Cognee tracks: past interactions, failed actions, resolved cases, product history

# Agent response:
Agent: "I found 2 similar billing cases resolved last month.
        The issue was caused by a sync delay between payment
        and invoice systems - a fix was applied on your account."

# What happens under the hood:
- Unifies data sources from various company channels
- Reconstructs the interaction timeline and tracks outcomes
- Retrieves similar resolved cases
- Maps to the best resolution strategy
- Updates memory after execution so the agent never repeats the same mistake

Use Case 2 - Expert Knowledge Distillation (SQL Copilot)

Goal: Help junior analysts solve tasks by reusing expert-level queries, patterns, and reasoning.

User: "How do I calculate customer retention for this dataset?"

Cognee tracks: expert SQL queries, workflow patterns, schema structures, successful implementations

# Agent response:
Agent: "Here's how senior analysts solved a similar retention query.
        Cognee matched your schema to a known structure and adapted
        the expert's logic to fit your dataset."

# What happens under the hood:
- Extracts and stores patterns from expert SQL queries and workflows
- Maps the current schema to previously seen structures
- Retrieves similar tasks and their successful implementations
- Adapts expert reasoning to the current context
- Updates memory with new successful patterns so junior analysts perform at near-expert level

Deploy Cognee

Use Cognee Cloud for a fully managed experience, or self-host with one of the 1-click deployment configurations below.

Platform	Best For	Command
Cognee Cloud	Managed service, no infrastructure to maintain	Sign up or `await cognee.serve()`
Modal	Serverless, auto-scaling, GPU workloads	`bash distributed/deploy/modal-deploy.sh`
Railway	Simplest PaaS, native Postgres	`railway init && railway up`
Fly.io	Edge deployment, persistent volumes	`bash distributed/deploy/fly-deploy.sh`
Render	Simple PaaS with managed Postgres	Deploy to Render button
Daytona	Cloud sandboxes (SDK or CLI)	See `distributed/deploy/daytona_sandbox.py`

See the distributed/ folder for deploy scripts, worker configurations, and additional details.

Latest News

Community & Support

Research & Citation

We recently published a research paper on optimizing knowledge graphs for LLM reasoning:

@misc{markovic2025optimizinginterfaceknowledgegraphs,
      title={Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning},
      author={Vasilije Markovic and Lazar Obradovic and Laszlo Hajdu and Jovan Pavlovic},
      year={2025},
      eprint={2505.24478},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2505.24478},
}

Discussion