Catalog

32
Skill

Agent Evaluation

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on re...

claude-code
3.0 0
Testing
Skill

AI Product Development

Every product will be AI-powered. The question is whether you'll build it right or ship a demo that falls apart in production. This skill covers LLM integration patterns, RAG architecture, prompt ...

claude-code
3.0 0
Architecture
Skill

Context Manager

Elite AI context engineering specialist mastering dynamic context management, vector databases, knowledge graphs, and intelligent memory systems. Orchestrates context across multi-agent workflows, enterprise AI systems, and long-running projects with 2024/2025 best practices. Use PROACTIVELY for complex AI orchestration.

claude-code
3.0 0
Data Analysis
Skill

Context Optimization Techniques

Apply compaction, masking, and caching strategies

claude-code
3.0 0
Code Generation
Skill

Langfuse

Expert in Langfuse - the open-source LLM observability platform. Covers tracing, prompt management, evaluation, datasets, and integration with LangChain, LlamaIndex, and OpenAI. Essential for debug...

claude-code
3.0 0
Data Analysis
Skill

🤖 LLM Application Patterns

Production-ready patterns for building LLM applications. Covers RAG pipelines, agent architectures, prompt IDEs, and LLMOps monitoring. Use when designing AI applications, implementing RAG, buildin...

claude-code
3.0 0
Architecture
Skill

ML Pipeline Workflow

Build end-to-end MLOps pipelines from data preparation through model training, validation, and production deployment. Use when creating ML pipelines, implementing MLOps practices, or automating mod...

claude-code
3.0 0
DevOps
Skill

Python Performance Optimization

Profile and optimize Python code using cProfile, memory profilers, and performance best practices. Use when debugging slow Python code, optimizing bottlenecks, or improving application performance.

claude-code
3.0 0
Code Generation
Agent

AI Engineer Pro

Autonomously designs and implements production-ready AI systems including RAG pipelines, agent architectures, and MLOps workflows.

claude-opusclaude
5.0 0
Architecture
Agent

Test Results Analyzer

Autonomously analyzes test execution data, generates comprehensive quality metrics, and provides actionable insights for improving test coverage and reliability.

claude-sonnetclaude
4.0 0
MLOps
Skill

Airflow DAG Builder Agent

Transforms Claude into an expert in creating, optimizing, and troubleshooting Apache Airflow DAGs with best practices for production workflows.

claude
5.0 0
MLOps
Skill

Argo Workflow Generator Agent

Helps Claude generate, optimize, and diagnose Argo Workflows with expert knowledge of YAML specifications, templates, and best practices.

claude
5.0 0
DevOps