Kyvos · Cloudflare

Kyvos turned the semantic layer into the AI layer.
What's running under it?

The framing on kyvosinsights.com is clear: "A universal semantic layer for AI and BI" — sub-second analytics on petabyte data in Snowflake, Databricks, BigQuery, Synapse, and Redshift. Pre-aggregated cubes, governed metric definitions, the same semantic model feeding both dashboards and LLMs.

That puts Kyvos in the path of every customer's most sensitive workflow: an LLM, a copilot, or an agent asking questions of their enterprise data. The cube is the answer. The question — and what comes back — needs a runtime that handles cost, identity, PII, and latency before it ever hits a model provider. That's where Cloudflare slots in.

GA · June 2026 The piece an AI semantic layer was missing.
AI Gateway just shipped dollar-denominated spend limits, per-tenant cost attribution, PII redaction at the gateway, and a unified log across every model provider — OpenAI, Anthropic, Google, Bedrock, Workers AI. The kind of governance Kyvos could embed once and offer every enterprise customer, without building (or maintaining) an LLM platform team per cloud. Read the announcement.

The primitives that map cleanly to where Kyvos is going:

AI Gateway — under every text-to-cube and copilot call. Caching, fallback routing, PII redaction, per-tenant spend caps before the model provider sees the prompt.
Workers AI — for inference paths where the customer doesn't want their semantic dictionary or query patterns sitting in a third-party model log.
Vectorize — for the embeddings index over metric definitions, hierarchies, and business glossaries that grounds RAG against the cube.
R2 — egress-free object store for cube materializations, exports, and snapshots. Complements Snowflake/Databricks, doesn't compete with them.
Durable Objects — per-tenant, per-session state for long-running analytical conversations and shared exploration.
Workers — semantic API and embedded dashboards served from 330+ POPs, close to whichever office or region is querying.

One question for the Kyvos team:

For the AI-on-the-semantic-layer story — is the harder problem right now customer-side (giving enterprises a runtime they trust for governance, PII, and cost) or Kyvos-side (the embedded AI features inside the product that need to scale across every customer cloud)? 20 minutes to compare notes.

Deeper Dive

The full architecture, ready when you are

The detailed primitive-by-primitive mapping — Kyvos's semantic layer on Cloudflare primitives, the AI Gateway runtime story for the customer-facing copilot, the multi-cloud anti-lock-in argument, and a 90-day proof plan — runs about 19 KB of dense technical content.

Read the expanded version →
Grab 20 minutes →
Matt Holscher Cloudflare · Developer Platform