
Qbeast 0.6 marks a structural shift: from a library to a distributed, lakehouse-adjacent service.With SOC 2 compliance and multi-cloud deployment across AWS, Azure, and GCP, Qbeast now runs where your data lives—collocating optimization with your lakehouse estate. It is available both as a fully managed SaaS and deployable within your own resources, giving enterprises flexibility across security and operational models.
This release also introduces AI-native access via MCP, along with meaningful query performance gains through new storage and execution optimizations.
Here's what's new.
Qbeast runs natively on Kubernetes, and you can now run it as a managed service or in a provided Kubernetes cluster on AWS, Azure, or GCP — in your VPC. For the lakehouse catalog, we support Hive Metastore (Thrift), JDBC, and AWS Glue.
Deployments are orchestrated with Crossplane, enabling fully reproducible and maintainable infrastructure: cluster provisioning, networking, storage and observability are defined as code rather than bespoke setup.
We support two deployment models:
This release also introduces foundational platform capabilities:
qbctl, a CLI that wraps the Control Plane API for operators who'd rather not hit the REST endpoints directly.Qbeast now includes a Model Context Protocol (MCP) server, enabling direct integration with AI tools — e.g., Claude Code, OpenAI Codex, Cursor, and other MCP-compatible agents.This makes Qbeast a first-class data source for agentic workflows.
Key capabilities:
The MCP server is available across all supported clouds and includes built-in observability (Prometheus + Grafana). Auth0 is supported as an OIDC provider.The practical outcome: analysts and agents can query optimized lakehouse data conversationally while benefiting from Qbeast’s indexing—without additional pipelines.

Qbeast continues to focus on its core objective: reducing query cost through data layout optimization. Two major enhancements in 0.6:
Page-Cube Alignment (PCA)
A cube-aware Parquet writer that co-locates data by cube within files.
This improves data locality and reduces I/O during query execution—amplifying the benefits of multi-dimensional indexing.
Storage Partition Joins (SPJ)
Qbeast now aligns its weighting model with Iceberg bucketing metadata, allowing Spark to eliminate shuffle operations for joins on bucketed tables. For join-heavy workloads, this translates directly into lower compute cost and faster execution.
Additional improvements
Qbeast supports both new and existing tables, with flexible onboarding strategies. For large datasets, full rewrites are often impractical. Qbeast allows:
This capability is particularly relevant for enterprises operating at multi-terabyte or petabyte scale.

We have additionally included the following:
With 0.6.0, Qbeast now supports all three major open table formats (Iceberg, Delta Lake, Hudi) across all three major cloud providers (AWS, GCP, Azure), with automated deployment into the customer environment. Query performance — the thing we care most about — also sees a number of enhancements in this release.
What's next: enabling agentic AI over the lakehouse. As autonomous agents increasingly operate directly on data, efficient storage layout and query performance become critical to controlling compute costs. Qbeast is positioning itself as the optimization layer that makes this viable at scale.