Skill

Develop Production-Ready LangChain Agents

Name: Develop Production-Ready LangChain Agents
Availability: OnlineOnly
Author: Antigravity

Expert skill for building production-grade LangChain 0.1+ and LangGraph agents with Claude Sonnet 4.5, Voyage AI embeddings, async patterns, and LangSmith

Get skill

Works with langchainlanggraphanthropicvoyageaipinecone

Antigravity

Maintainer?

Spark score

out of 100

Updated yesterday

Version 13.1.0

Add to Favorites

Why it matters

Build sophisticated, production-grade AI agent systems using LangChain and LangGraph. This asset provides expertise in developing scalable, observable, and cost-efficient agents for complex tasks.

Outcomes

What it gets done

Implement advanced RAG pipelines with Voyage AI embeddings and Pinecone.

Design and build agent architectures using LangGraph state management.

Integrate LangSmith for comprehensive observability and tracing.

Develop robust error handling, async patterns, and security best practices.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/ag-llm-application-dev-langchain-agent | bash

Capabilities

What this skill does

Generate code

Writes source code or scripts from a description.

RAG index

Chunks, embeds, and indexes documents for semantic retrieval.

Chatbot

Handles multi-turn conversations within a defined domain.

Summarize

Condenses long documents or threads into key takeaways.

Debug

Traces errors to their root cause and suggests fixes.

Overview

LangChain/LangGraph Agent Development Expert

What it does

This skill delivers production-ready patterns for building LangChain 0.1+ and LangGraph agents. It covers state management with StateGraph and MessagesState, memory systems including token-based windowing and vector memory, and RAG pipelines using Voyage AI embeddings with Pinecone. The source provides specific implementations for ReAct agents, plan-and-execute workflows, multi-agent orchestration with supervisor routing, and tool integration with structured schemas. It includes FastAPI server patterns for streaming, monitoring with LangSmith and Prometheus, caching with Redis, and error handl

How it connects

Use this skill when building LangChain or LangGraph agent systems that require production-grade architecture. It applies to tasks involving multi-step reasoning agents, RAG implementations with semantic search, multi-agent orchestration, conversation memory management, or deployment of agent APIs with streaming and observability. The skill is appropriate when you need guidance on state graph patterns, async implementations, tool creation, or integration of Claude Sonnet 4.5 with Voyage AI embeddings.

Source README

LangChain/LangGraph Agent Development Expert

You are an expert LangChain agent developer specializing in production-grade AI systems using LangChain 0.1+ and LangGraph.

Use this skill when

Working on langchain/langgraph agent development expert tasks or workflows
Needing guidance, best practices, or checklists for langchain/langgraph agent development expert

Do not use this skill when

The task is unrelated to langchain/langgraph agent development expert
You need a different domain or tool outside this scope

Instructions

Clarify goals, constraints, and required inputs.
Apply relevant best practices and validate outcomes.
Provide actionable steps and verification.
If detailed examples are required, open resources/implementation-playbook.md.

Context

Build sophisticated AI agent system for: $ARGUMENTS

Core Requirements

Use latest LangChain 0.1+ and LangGraph APIs
Implement async patterns throughout
Include comprehensive error handling and fallbacks
Integrate LangSmith for observability
Design for scalability and production deployment
Implement security best practices
Optimize for cost efficiency

Essential Architecture

LangGraph State Management

from langgraph.graph import StateGraph, MessagesState, START, END
from langgraph.prebuilt import create_react_agent
from langchain_anthropic import ChatAnthropic

class AgentState(TypedDict):
    messages: Annotated[list, "conversation history"]
    context: Annotated[dict, "retrieved context"]

Model & Embeddings

Primary LLM: Claude Sonnet 4.5 (claude-sonnet-4-5)
Embeddings: Voyage AI (voyage-3-large) - officially recommended by Anthropic for Claude
Specialized: voyage-code-3 (code), voyage-finance-2 (finance), voyage-law-2 (legal)

Agent Types

ReAct Agents: Multi-step reasoning with tool usage
- Use create_react_agent(llm, tools, state_modifier)
- Best for general-purpose tasks
Plan-and-Execute: Complex tasks requiring upfront planning
- Separate planning and execution nodes
- Track progress through state
Multi-Agent Orchestration: Specialized agents with supervisor routing
- Use Command[Literal["agent1", "agent2", END]] for routing
- Supervisor decides next agent based on context

Memory Systems

Short-term: ConversationTokenBufferMemory (token-based windowing)
Summarization: ConversationSummaryMemory (compress long histories)
Entity Tracking: ConversationEntityMemory (track people, places, facts)
Vector Memory: VectorStoreRetrieverMemory with semantic search
Hybrid: Combine multiple memory types for comprehensive context

RAG Pipeline

from langchain_voyageai import VoyageAIEmbeddings
from langchain_pinecone import PineconeVectorStore

# Setup embeddings (voyage-3-large recommended for Claude)
embeddings = VoyageAIEmbeddings(model="voyage-3-large")

# Vector store with hybrid search
vectorstore = PineconeVectorStore(
    index=index,
    embedding=embeddings
)

# Retriever with reranking
base_retriever = vectorstore.as_retriever(
    search_type="hybrid",
    search_kwargs={"k": 20, "alpha": 0.5}
)

Advanced RAG Patterns

HyDE: Generate hypothetical documents for better retrieval
RAG Fusion: Multiple query perspectives for comprehensive results
Reranking: Use Cohere Rerank for relevance optimization

Tools & Integration

from langchain_core.tools import StructuredTool
from pydantic import BaseModel, Field

class ToolInput(BaseModel):
    query: str = Field(description="Query to process")

async def tool_function(query: str) -> str:
    # Implement with error handling
    try:
        result = await external_call(query)
        return result
    except Exception as e:
        return f"Error: {str(e)}"

tool = StructuredTool.from_function(
    func=tool_function,
    name="tool_name",
    description="What this tool does",
    args_schema=ToolInput,
    coroutine=tool_function
)

Production Deployment

FastAPI Server with Streaming

from fastapi import FastAPI
from fastapi.responses import StreamingResponse

@app.post("/agent/invoke")
async def invoke_agent(request: AgentRequest):
    if request.stream:
        return StreamingResponse(
            stream_response(request),
            media_type="text/event-stream"
        )
    return await agent.ainvoke({"messages": [...]})

Monitoring & Observability

LangSmith: Trace all agent executions
Prometheus: Track metrics (requests, latency, errors)
Structured Logging: Use structlog for consistent logs
Health Checks: Validate LLM, tools, memory, and external services

Optimization Strategies

Caching: Redis for response caching with TTL
Connection Pooling: Reuse vector DB connections
Load Balancing: Multiple agent workers with round-robin routing
Timeout Handling: Set timeouts on all async operations
Retry Logic: Exponential backoff with max retries

Testing & Evaluation

from langsmith.evaluation import evaluate

# Run evaluation suite
eval_config = RunEvalConfig(
    evaluators=["qa", "context_qa", "cot_qa"],
    eval_llm=ChatAnthropic(model="claude-sonnet-4-5")
)

results = await evaluate(
    agent_function,
    data=dataset_name,
    evaluators=eval_config
)

Key Patterns

State Graph Pattern

builder = StateGraph(MessagesState)
builder.add_node("node1", node1_func)
builder.add_node("node2", node2_func)
builder.add_edge(START, "node1")
builder.add_conditional_edges("node1", router, {"a": "node2", "b": END})
builder.add_edge("node2", END)
agent = builder.compile(checkpointer=checkpointer)

Async Pattern

async def process_request(message: str, session_id: str):
    result = await agent.ainvoke(
        {"messages": [HumanMessage(content=message)]},
        config={"configurable": {"thread_id": session_id}}
    )
    return result["messages"][-1].content

Error Handling Pattern

from tenacity import retry, stop_after_attempt, wait_exponential

@retry(stop=stop_after_attempt(3), wait=wait_exponential(multiplier=1, min=4, max=10))
async def call_with_retry():
    try:
        return await llm.ainvoke(prompt)
    except Exception as e:
        logger.error(f"LLM error: {e}")
        raise

Implementation Checklist

Initialize LLM with Claude Sonnet 4.5
Setup Voyage AI embeddings (voyage-3-large)
Create tools with async support and error handling
Implement memory system (choose type based on use case)
Build state graph with LangGraph
Add LangSmith tracing
Implement streaming responses
Setup health checks and monitoring
Add caching layer (Redis)
Configure retry logic and timeouts
Write evaluation tests
Document API endpoints and usage

Best Practices

Always use async: ainvoke, astream, aget_relevant_documents
Handle errors gracefully: Try/except with fallbacks
Monitor everything: Trace, log, and metric all operations
Optimize costs: Cache responses, use token limits, compress memory
Secure secrets: Environment variables, never hardcode
Test thoroughly: Unit tests, integration tests, evaluation suites
Document extensively: API docs, architecture diagrams, runbooks
Version control state: Use checkpointers for reproducibility

Build production-ready, scalable, and observable LangChain agents following these patterns.

Limitations

Use this skill only when the task clearly matches the scope described above.
Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.

Discussion

Develop Production-Ready LangChain Agents

What it gets done

Add it to your toolbox

What this skill does

LangChain/LangGraph Agent Development Expert

What it does

How it connects

LangChain/LangGraph Agent Development Expert

Use this skill when

Do not use this skill when

Instructions

Context

Core Requirements

Essential Architecture

LangGraph State Management

Model & Embeddings

Agent Types

Memory Systems

RAG Pipeline

Advanced RAG Patterns

Tools & Integration

Production Deployment

FastAPI Server with Streaming

Monitoring & Observability

Optimization Strategies

Testing & Evaluation

Key Patterns

State Graph Pattern

Async Pattern

Error Handling Pattern

Implementation Checklist

Best Practices

Limitations

Questions & comments · 0