← Products/DataAPI

Your model gets
context,
not fragments.

DataAPI indexes your knowledge base and returns distilled, query-relevant context to your LLM — three retrieval tiers, one endpoint, zero glue code.

Get API key See pricing

95.2%

search accuracy

60%

token savings

<300ms

query latency

Live demo

Ask the knowledge base. Get distilled context, not fragments.

AI App

Ask the Cosavu knowledge base

◆ demo knowledge base

DataAPIawaiting query

awaiting query input

The problem with RAG

RAG returns chunks.
DataAPI returns context.

Traditional RAG hands your model a bag of raw text fragments — similar-looking but unrelated, often redundant — and hopes it figures out the rest.

DataAPI resolves, re-ranks, deduplicates, and optionally distils the evidence into a single coherent block. The LLM gets what it needs. Nothing more.

Traditional RAG

Fragment 1 …

Fragment 2 …

Fragment 2 (duplicate) …

Fragment 3 …

Fragment 4 (off-topic) …

model receives noise

DataAPI CAR-1.5

Distilled context — coherent, deduplicated, targeted.

Direct answer synthesised from evidence.

model receives signal

Retrieval systems

Three tiers.
One endpoint.

Pass system: "car-0" | "car-1" | "car-1.5" per query. Upgrade a single call without changing your integration.

CAR-0 — Semantic search

Cosavu Store-backed vector search with per-tenant namespace isolation. Fast, reliable, and ideal for most document retrieval workloads where semantic similarity is the primary signal.

›Cosavu Store cosine similarity
›Per-tenant namespace isolation
›HNSW index
›Best for: notes, documents, unstructured text

retrieval engine

> QUERY RECEIVED

"how does context distillation work"

SCANNING COSAVU INDEX

████████░░░░░░░░░░░░░░░ 32%

CANDIDATES FOUND ─────── ?

ENGRAM waiting ·········· |

Inside CAR-1

The Engram filter —
seven signals, one score.

Every candidate chunk is scored across semantic similarity, lexical recall, IDF weighting, bigram overlap, exact match, and rank prior. Diversity-aware gating then removes near-duplicate passages before they consume your context budget.

Chunks with zero query-term overlap are penalised 75%. Off-topic passages don't crowd out relevant ones even when the vector space is noisy.

Semantic similarity

35%

Lexical recall

30%

IDF recall

18%

Bigram overlap

12%

Lexical precision

14%

Exact match

Rank prior

Diversity gate

22% novelty min

Capabilities

What DataAPI does

Multi-tenant isolation

Every collection is namespaced to a tenant. Cosavu Index and Cosavu Store are completely isolated — one tenant's data never touches another's.

Ingest anything

Upload PDFs, DOCX, plain text, code files, PPTX, XLSX, and CSV. A universal parser extracts and chunks each file automatically.

Context budget

Returned chunks are trimmed to a configurable token budget using sentence-level extraction — your model never receives a truncated mid-thought.

Safety on every query

A moderation gate runs before retrieval. Harmful queries are blocked at the boundary before any data is accessed.

Get started

Start retrieving with DataAPI.

Get a tenant key and connect your knowledge base in minutes. No infrastructure to manage.

Get API key View pricing

Your model getscontext,not fragments.