Develop Production-Ready LangChain Agents
Expert skill for building production-grade LangChain 0.1+ and LangGraph agents with Claude Sonnet 4.5, Voyage AI embeddings, async patterns, and LangSmith
Why it matters
Build sophisticated, production-grade AI agent systems using LangChain and LangGraph. This asset provides expertise in developing scalable, observable, and cost-efficient agents for complex tasks.
Outcomes
What it gets done
Implement advanced RAG pipelines with Voyage AI embeddings and Pinecone.
Design and build agent architectures using LangGraph state management.
Integrate LangSmith for comprehensive observability and tracing.
Develop robust error handling, async patterns, and security best practices.
Install
Add it to your toolbox
Run in your project directory:
curl -fsSL https://spark.entire.vc/get/ag-llm-application-dev-langchain-agent | bash Capabilities
What this skill does
Writes source code or scripts from a description.
Chunks, embeds, and indexes documents for semantic retrieval.
Handles multi-turn conversations within a defined domain.
Condenses long documents or threads into key takeaways.
Traces errors to their root cause and suggests fixes.
Overview
LangChain/LangGraph Agent Development Expert
What it does
This skill delivers production-ready patterns for building LangChain 0.1+ and LangGraph agents. It covers state management with StateGraph and MessagesState, memory systems including token-based windowing and vector memory, and RAG pipelines using Voyage AI embeddings with Pinecone. The source provides specific implementations for ReAct agents, plan-and-execute workflows, multi-agent orchestration with supervisor routing, and tool integration with structured schemas. It includes FastAPI server patterns for streaming, monitoring with LangSmith and Prometheus, caching with Redis, and error handl
How it connects
Use this skill when building LangChain or LangGraph agent systems that require production-grade architecture. It applies to tasks involving multi-step reasoning agents, RAG implementations with semantic search, multi-agent orchestration, conversation memory management, or deployment of agent APIs with streaming and observability. The skill is appropriate when you need guidance on state graph patterns, async implementations, tool creation, or integration of Claude Sonnet 4.5 with Voyage AI embeddings.
Source README
LangChain/LangGraph Agent Development Expert
You are an expert LangChain agent developer specializing in production-grade AI systems using LangChain 0.1+ and LangGraph.
Use this skill when
- Working on langchain/langgraph agent development expert tasks or workflows
- Needing guidance, best practices, or checklists for langchain/langgraph agent development expert
Do not use this skill when
- The task is unrelated to langchain/langgraph agent development expert
- You need a different domain or tool outside this scope
Instructions
- Clarify goals, constraints, and required inputs.
- Apply relevant best practices and validate outcomes.
- Provide actionable steps and verification.
- If detailed examples are required, open
resources/implementation-playbook.md.
Context
Build sophisticated AI agent system for: $ARGUMENTS
Core Requirements
- Use latest LangChain 0.1+ and LangGraph APIs
- Implement async patterns throughout
- Include comprehensive error handling and fallbacks
- Integrate LangSmith for observability
- Design for scalability and production deployment
- Implement security best practices
- Optimize for cost efficiency
Essential Architecture
LangGraph State Management
from langgraph.graph import StateGraph, MessagesState, START, END
from langgraph.prebuilt import create_react_agent
from langchain_anthropic import ChatAnthropic
class AgentState(TypedDict):
messages: Annotated[list, "conversation history"]
context: Annotated[dict, "retrieved context"]
Model & Embeddings
- Primary LLM: Claude Sonnet 4.5 (
claude-sonnet-4-5) - Embeddings: Voyage AI (
voyage-3-large) - officially recommended by Anthropic for Claude - Specialized:
voyage-code-3(code),voyage-finance-2(finance),voyage-law-2(legal)
Agent Types
ReAct Agents: Multi-step reasoning with tool usage
- Use
create_react_agent(llm, tools, state_modifier) - Best for general-purpose tasks
- Use
Plan-and-Execute: Complex tasks requiring upfront planning
- Separate planning and execution nodes
- Track progress through state
Multi-Agent Orchestration: Specialized agents with supervisor routing
- Use
Command[Literal["agent1", "agent2", END]]for routing - Supervisor decides next agent based on context
- Use
Memory Systems
- Short-term:
ConversationTokenBufferMemory(token-based windowing) - Summarization:
ConversationSummaryMemory(compress long histories) - Entity Tracking:
ConversationEntityMemory(track people, places, facts) - Vector Memory:
VectorStoreRetrieverMemorywith semantic search - Hybrid: Combine multiple memory types for comprehensive context
RAG Pipeline
from langchain_voyageai import VoyageAIEmbeddings
from langchain_pinecone import PineconeVectorStore
# Setup embeddings (voyage-3-large recommended for Claude)
embeddings = VoyageAIEmbeddings(model="voyage-3-large")
# Vector store with hybrid search
vectorstore = PineconeVectorStore(
index=index,
embedding=embeddings
)
# Retriever with reranking
base_retriever = vectorstore.as_retriever(
search_type="hybrid",
search_kwargs={"k": 20, "alpha": 0.5}
)
Advanced RAG Patterns
- HyDE: Generate hypothetical documents for better retrieval
- RAG Fusion: Multiple query perspectives for comprehensive results
- Reranking: Use Cohere Rerank for relevance optimization
Tools & Integration
from langchain_core.tools import StructuredTool
from pydantic import BaseModel, Field
class ToolInput(BaseModel):
query: str = Field(description="Query to process")
async def tool_function(query: str) -> str:
# Implement with error handling
try:
result = await external_call(query)
return result
except Exception as e:
return f"Error: {str(e)}"
tool = StructuredTool.from_function(
func=tool_function,
name="tool_name",
description="What this tool does",
args_schema=ToolInput,
coroutine=tool_function
)
Production Deployment
FastAPI Server with Streaming
from fastapi import FastAPI
from fastapi.responses import StreamingResponse
@app.post("/agent/invoke")
async def invoke_agent(request: AgentRequest):
if request.stream:
return StreamingResponse(
stream_response(request),
media_type="text/event-stream"
)
return await agent.ainvoke({"messages": [...]})
Monitoring & Observability
- LangSmith: Trace all agent executions
- Prometheus: Track metrics (requests, latency, errors)
- Structured Logging: Use
structlogfor consistent logs - Health Checks: Validate LLM, tools, memory, and external services
Optimization Strategies
- Caching: Redis for response caching with TTL
- Connection Pooling: Reuse vector DB connections
- Load Balancing: Multiple agent workers with round-robin routing
- Timeout Handling: Set timeouts on all async operations
- Retry Logic: Exponential backoff with max retries
Testing & Evaluation
from langsmith.evaluation import evaluate
# Run evaluation suite
eval_config = RunEvalConfig(
evaluators=["qa", "context_qa", "cot_qa"],
eval_llm=ChatAnthropic(model="claude-sonnet-4-5")
)
results = await evaluate(
agent_function,
data=dataset_name,
evaluators=eval_config
)
Key Patterns
State Graph Pattern
builder = StateGraph(MessagesState)
builder.add_node("node1", node1_func)
builder.add_node("node2", node2_func)
builder.add_edge(START, "node1")
builder.add_conditional_edges("node1", router, {"a": "node2", "b": END})
builder.add_edge("node2", END)
agent = builder.compile(checkpointer=checkpointer)
Async Pattern
async def process_request(message: str, session_id: str):
result = await agent.ainvoke(
{"messages": [HumanMessage(content=message)]},
config={"configurable": {"thread_id": session_id}}
)
return result["messages"][-1].content
Error Handling Pattern
from tenacity import retry, stop_after_attempt, wait_exponential
@retry(stop=stop_after_attempt(3), wait=wait_exponential(multiplier=1, min=4, max=10))
async def call_with_retry():
try:
return await llm.ainvoke(prompt)
except Exception as e:
logger.error(f"LLM error: {e}")
raise
Implementation Checklist
- Initialize LLM with Claude Sonnet 4.5
- Setup Voyage AI embeddings (voyage-3-large)
- Create tools with async support and error handling
- Implement memory system (choose type based on use case)
- Build state graph with LangGraph
- Add LangSmith tracing
- Implement streaming responses
- Setup health checks and monitoring
- Add caching layer (Redis)
- Configure retry logic and timeouts
- Write evaluation tests
- Document API endpoints and usage
Best Practices
- Always use async:
ainvoke,astream,aget_relevant_documents - Handle errors gracefully: Try/except with fallbacks
- Monitor everything: Trace, log, and metric all operations
- Optimize costs: Cache responses, use token limits, compress memory
- Secure secrets: Environment variables, never hardcode
- Test thoroughly: Unit tests, integration tests, evaluation suites
- Document extensively: API docs, architecture diagrams, runbooks
- Version control state: Use checkpointers for reproducibility
Build production-ready, scalable, and observable LangChain agents following these patterns.
Limitations
- Use this skill only when the task clearly matches the scope described above.
- Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
Discussion
Questions & comments · 0
Sign In Sign in to leave a comment.