Vespa
Vespa.ai operates Vespa Cloud - used by companies to run Big Data serving with AI, online. We maintain the Vespa open-source project, continuously released and used by organizations with high performance, availability, and functional requirements. We are hiring! See the Jobs page, or visit our website.
Founded
2023
Location
Trondheim
Employees
57
Funding
OSS
Vespa — Open-Source Search and Vector Engine for Large-Scale AI
Overview
**Vespa** is an open-source search and vector engine purpose-built for production AI workloads. It stores and serves vectors, text, and structured data in one system, then ranks results using on-node machine learning for ultra‑low latency at scale. Use it for **hybrid retrieval**, **RAG**, **recommendation**, and **personalization**—self-managed or on the managed **Vespa Cloud**.
Originating from Yahoo’s search stack and open-sourced in 2017, Vespa is maintained by Vespa.ai (HQ Trondheim, Norway) and operated by a senior, compact team.
Why Vespa
Vespa’s pitch: a search platform with native vector support beats a pure vector database for production AI. You get ANN search with filters, text retrieval, aggregations, custom ranking features, and fresh updates in one system—plus **on-node inference** to cut glue code and latency. Notably, **Perplexity** brought its search in-house with Vespa to scale more effectively .
Key Capabilities
Primary Use Cases
Who It’s For
Integrations and Ecosystem
Deployment, Pricing, and Free Trial
Proof Points and References
User Sentiment Snapshot
Pros
Cons
Technical Highlights
Getting Started
Company Snapshot
---
In short: Vespa is a production‑ready, open‑source search and vector engine that unifies vectors, text, and structured data with on‑node ML for fast, scalable hybrid search, RAG, and recommendations—self‑managed or fully managed on Vespa Cloud.
Related Companies
Arcade
Baseten
Inference is everything. Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring great AI products to market - fast. Our proprietary Inference Stack utilizes the cutting-edge of performance research combined with highly performant and reliable infrastructure to give you out-of-the-box global availability with 99.99% of uptime.
Cast AI
Increase your profit margin without additional work. CAST AI cuts your cloud bill in half, automates DevOps tasks, and prevents downtime in one Autonomous Kubernetes platform.
Ciroos
Ciroos (pronounced "Sai-rose") offers an AI SRE teammate that empowers site reliability engineers (SREs), DevOps and operations teams to be superheroes. Built from the ground up with the power of multi-agentic AI, Ciroos enables operations teams to reduce toil, investigate incidents, explain anomalies, and drive autonomous operations, across complex multi-domain environments, all while leaving humans in control. Reach out to us at www.ciroos.ai to learn more about what an AI SRE Teammate can do for you.
Context.ai
Context is the first AI Office Suite that automates your workflow by creating documents, presentations, spreadsheets, and more using your data, tools, and style.
Databricks Mosaic AI
Databricks is the Data and AI company. More than 15,000 organizations worldwide — including Block, Comcast, Condé Nast, Rivian, Shell and over 60% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to take control of their data and put it to work with AI. Databricks is headquartered in San Francisco, with offices around the globe, and was founded by the original creators of Lakehouse, Apache Spark, Delta Lake and MLflow. --- Databricks applicants Please apply through our official Careers page at databricks.com/company/careers. All official communication from Databricks will come from email addresses ending with @databricks.com or @goodtime.io (our meeting tool).