Multi-step prompt sequences for complex AI workflows.
89 tools found
Integrate Moonshot AI provider into promptfoo for testing and evaluating code generation prompts.
Execute hard coding tasks with Claude Fable 5's adaptive thinking at maximum effort level for complex problem-solving.
Multi-step prompt workflow that migrates legacy code by running an OpenAI agent outside sandboxed execution environments, validating each repo shard with tests
Three-phase agent workflow (review, repair, validate) that uses Codex CLI to detect and fix stale API documentation through iterative feedback loops.
A multi-step prompt workflow teaching how to use persistent Goals in Codex to keep long-running tasks-like profiling, benchmarking, or flaky test
Multi-step prompt workflow that uses GPT-5.5 to generate structured office layouts from empty floorplans, furniture catalogs, and spatial constraints.
A prompt workflow that runs Promptfoo evaluations against Anthropic Messages API using an existing local Claude Code session instead of creating a separate API
Azure Mai is a multi-step prompt workflow example demonstrating how to run prompt chains with promptfoo's Azure integration for testing and evaluation.
A prompt workflow that benchmarks and compares different GPT model tiers side-by-side to evaluate performance, cost, and output quality across OpenAI's model
A promptfoo example demonstrating how to configure and test prompts using the Fireworks AI provider for LLM evaluation and benchmarking workflows.
Prompt workflow example demonstrating MLflow AI Gateway integration as an LLM provider in promptfoo for governed model access and testing.
Automate code generation and review for Nvidia provider integrations, enhancing development efficiency and code quality.