Vellum

Use Vellum to build and ship reliable AI solutions. Define, evaluate and monitor AI solutions through test-driven development for AI.

Agent Platform

Visit Website

Founded

2023

Location

New York, New York

Employees

Funding

$25.5M

Vellum: Enterprise Platform for Building, Evaluating, and Operating AI Agents

Vellum is an enterprise-grade platform for designing, testing, deploying, and monitoring AI agents and LLM-powered applications. It unifies orchestration, evaluations, versioning, routing, and production observability so teams can ship reliable AI into real workflows—without stitching together disparate tools.

YC batch: **W23**

HQ: **New York, NY (Madison Ave)**

Ideal for: Product, engineering, data/platform, and operations teams building AI into production workflows with governance and compliance needs

Start free: See the **Free plan** on the [pricing page](https://www.vellum.ai/pricing)

Explore Vellum: [Homepage](https://www.vellum.ai/) | [Docs](https://docs.vellum.ai/home/getting-started/overview) | [LLM Leaderboard](https://www.vellum.ai/llm-leaderboard) | [Orchestration](https://www.vellum.ai/products/orchestration)

---

Why Vellum

Build agents in plain English or as **visual workflows** with tools and function calling

Centralize **orchestration, evaluations, observability, routing, and versioning** in one platform

Integrate any model (OpenAI, Anthropic, Google, Azure OpenAI, open-source) and connect external systems via APIs and webhooks

Ship safely with **SOC 2 Type 2, HIPAA, SSO, RBAC, VPC/private networking**, audit logs, and governance

Improve quality over time with **evals, datasets, A/B testing, benchmarking**, and production traces

Learn more: [Orchestration & Agent Builder](https://www.vellum.ai/products/orchestration) | [Evaluations](https://www.vellum.ai/blog/introducing-vellum-evaluations) | [Levels of Agentic Behavior](https://www.vellum.ai/blog/levels-of-agentic-behavior)

---

Key Capabilities

**Agent Builder**

Design agents by chatting in natural language or assembling **node-based visual workflows**

Add tool use via **function calls** and custom actions; connect to external APIs and webhooks

Mix and match **internal and external models**; enable routing by cost, quality, or latency

**Evaluations & Experimentation**

Run **evals** before rollout using datasets and benchmarks

**Compare prompts, models, and versions**; A/B test workflows to quantify improvements

Use the public [LLM Leaderboard](https://www.vellum.ai/llm-leaderboard) to compare models by benchmarks, cost, and context windows

**Production Observability**

End-to-end **tracing, logging, and feedback capture** for real-world monitoring

**Versioning and safe deployments:** promote/rollback versions with auditability

**Governance & Collaboration**

**SSO, RBAC, audit logs**, and private networking options

Share agents across teams; track changes and debug with **production traces**

**Deployment Options**

Ship via **API**, generate internal UIs with one click, or embed widgets

Get started: [Docs Overview](https://docs.vellum.ai/home/getting-started/overview)

---

Security & Compliance

Certifications and controls: **SOC 2 Type 2**, **HIPAA**, **SSO**, **RBAC**, **VPC/private networking**, audit logs

Built for enterprises needing **governance, auditability, and controlled rollout**

Details: [Security features referenced across product pages](https://www.vellum.ai/)

---

Who It’s For

**Product managers and engineers** shipping AI features into production

**Data and platform teams** standardizing LLM ops, governance, and observability

**Enterprise teams** in SaaS, healthcare, fintech, support, and operations requiring evals, auditability, and compliance

---

Common Use Cases

Customer support and internal assistants with tool use and retrieval

Sales enablement and content generation with approval steps

Knowledge assistants with **RAG** and guardrails

Operations automations that call internal APIs and back-office systems

**Evaluation pipelines** and A/B testing for prompts, models, and workflows

SEO and content workflows from research to publishing

Example: [Notion Article Generator Template](https://www.vellum.ai/template/generate-article-in-notion)

---

Integrations

Models: **OpenAI, Anthropic, Google, Azure OpenAI**, and **open-source models**

Data & memory: Retrieval over your data, vector stores, and in-product datasets

Tools & actions: Function calling, APIs, webhooks, SDK, and custom tools

Enterprise: **SSO, RBAC, audit logs, private networking**

Observability: Production tracing, logging, and feedback capture

More on orchestration and integrations: [Product Overview](https://www.vellum.ai/products/orchestration)

---

Pricing

**Free plan available** to start building and testing

Usage-based and enterprise tiers for scale, governance, and advanced features

See details: [Pricing](https://www.vellum.ai/pricing)

---

Customer Sentiment (Pros & Cons)

Pros

Strong visual builder and collaboration—non-dev teammates can contribute to workflows

Source: [Reddit: r/AI_Agents](https://www.reddit.com/r/AI_Agents/comments/1nqyy1r/tried_a_bunch_of_aiagent_platforms_and_what/)

Workflows and function calling reduce custom code and save engineering time

Source: [G2 Reviews](https://www.g2.com/products/vellum/reviews)

Fast way to prototype, test, deploy, and observe LLM workflows in one place

Source: [AWS Marketplace](https://aws.amazon.com/marketplace/pp/prodview-skbe6ou2xiqc4)

Built-in evals and versioning support safer iteration and quality gates

Source: [G2 Pros & Cons](https://www.g2.com/products/vellum/reviews?qs=pros-and-cons)

Helpful, responsive support for complex setups

Source: [AWS Marketplace Reviews](https://aws.amazon.com/marketplace/reviews/reviews-list/prodview-skbe6ou2xiqc4)

Cons

Advanced flows can feel complex without guidance

Source: [G2 Pros & Cons](https://www.g2.com/products/vellum/reviews?qs=pros-and-cons)

Pricing transparency has been a concern for some users

Source: [Reddit: r/ChatGPT](https://www.reddit.com/r/ChatGPT/comments/1fzny2e/how_much_does_vellumai_cost/)

Higher cost vs. DIY stacks, offset by time savings at scale

Source: [Reddit: r/AI_Agents](https://www.reddit.com/r/AI_Agents/comments/1nqyy1r/tried_a_bunch_of_aiagent_platforms_and_what/)

---

Notable Resources

Product and positioning: [Homepage](https://www.vellum.ai/)

Orchestration and agent builder: [Product](https://www.vellum.ai/products/orchestration)

Evaluations: [Introducing Vellum Evaluations](https://www.vellum.ai/blog/introducing-vellum-evaluations)

Best practices: [Ultimate LLM Agent Build Guide](https://www.vellum.ai/blog/the-ultimate-llm-agent-build-guide)

Agent maturity: [Levels of Agentic Behavior](https://www.vellum.ai/blog/levels-of-agentic-behavior)

Model comparisons: [LLM Leaderboard](https://www.vellum.ai/llm-leaderboard)

Company updates: [LinkedIn](https://www.linkedin.com/company/vellumai)

---

Quick Facts

Company: **Vellum**

What it does: **Enterprise platform to build, evaluate, deploy, and monitor AI agents and LLM apps**

HQ: **New York, NY, USA**

Team size: ~37 (LinkedIn)

YC batch: **W23**

Funding: Seed and growth rounds; a **$20M raise** was shared on [LinkedIn](https://www.linkedin.com/company/vellumai)

Product pillars: **Orchestration, Evaluations, Observability, Versioning, Routing, Governance**

LLM Leaderboard: [Public comparison page](https://www.vellum.ai/llm-leaderboard)

Pricing: **Free plan** available; paid tiers on the [pricing page](https://www.vellum.ai/pricing)

Related Companies

Botpress

Botpress is a fully extensible chatbot and AI agent platform. The all-in-one conversational AI Platform-as-a-Service (PaaS) allows users to build, deploy, and monitor LLM-powered solutions. Applied across industries, use cases, and business processes, Botpress projects are always scalable, secure, and on-brand. With 900,000+ users and millions of bots deployed worldwide, Botpress is the platform of choice for companies, developers, and new coders alike. In 2025, Botpress raised a $25 million Series B round of funding led by investors like FRAMEWORK, Deloitte, HubSpot, and Inovia. Botpress has been deploying chatbots since 2017. The company is headquartered in Quebec.

Lyzr

"Lyzr AI - The Full-Stack Agent Framework Build fully autonomous AI agents with Lyzr. Lyzr agents run locally on your cloud server, ensuring 100% data privacy and compliance. AI Worker Agents 1. Jazon - The AI SDR 2. Skott - The AI Marketer Productivity Agents 1. Chat Agent 2. Knowledge Search Book a demo today - https://www.lyzr.ai/book-demo"

Relevance AI

Relevance AI is the home of the AI workforce: where anyone can build and recruit teams of AI agents to complete tasks on autopilot. Our no-code platform is built for ops teams, no technical background required. Subject-matter experts can use Relevance to design powerful AI agents and AI teams without relying on developer resources. Scale excellence across every area or team with your intelligent, purpose-built AI workforce.

Retool AI Agents

Build internal software better with AI. Create apps, agents, and workflows with any LLM, datasource, or API to deploy AI across your business. Retool is the application layer for AI and leading platform for internal software development, trusted by over 10,000 companies worldwide, including Amazon, Stripe, Brex, and Orangetheory Fitness. Using Retool, developers deploy sophisticated apps and agents dramatically faster without sacrificing quality or control, combining powerful building blocks with the flexibility of custom code. To learn more and start building for free today, visit https://retool.com

Stack AI

Enterprise AI trusted by IT - power your GenAI strategy with Stack AI. Backed by YC and Google. We are a small dedicated team of insanely talented individuals relentlessly pushing the boundaries of what’s possible with AI. We are pioneering a new horizontal platform allowing anyone to build and deploy AI agents and automations. Companies across industries—including healthcare, legal, financial, logistics, and defense—use StackAI make their organizations faster, more efficient, and scalable, leveraging the power of AI. With strong backing from YC and Google, we're set to double our team size in 2025. This is an exciting time to join if you want to build a category-defining product with strong customer momentum. We're looking for user-centric, craft-focused, creative minds who work hard but don't take themselves too seriously. We're ambitious yet pragmatic. We move fast, but care about the important details. We're hiring for a range of roles. Reach out if you'd like to learn more!

TinyFish

The Enterprise Web Agent company. Our AI agents run complex business workflows at web scale millions of times to deliver measurable outcomes.