CosavuCosavu

← Products/DataAPI

Your model gets
context,
not fragments.

DataAPI indexes your knowledge base and returns distilled, query-relevant context to your LLM — three retrieval tiers, one endpoint, zero glue code.

95.2%
search accuracy
60%
token savings
<300ms
query latency

Live demo

Ask the knowledge base. Get distilled context, not fragments.

AI App

Ask the Cosavu knowledge base

◆ demo knowledge base
DataAPIawaiting query

awaiting query input

The problem with RAG

RAG returns chunks.
DataAPI returns context.

Traditional RAG hands your model a bag of raw text fragments — similar-looking but unrelated, often redundant — and hopes it figures out the rest.

DataAPI resolves, re-ranks, deduplicates, and optionally distils the evidence into a single coherent block. The LLM gets what it needs. Nothing more.

Traditional RAG

Fragment 1 …
Fragment 2 …
Fragment 2 (duplicate) …
Fragment 3 …
Fragment 4 (off-topic) …

model receives noise

DataAPI CAR-1.5

Distilled context — coherent, deduplicated, targeted.
Direct answer synthesised from evidence.

model receives signal

Retrieval systems

Three tiers.
One endpoint.

Pass system: "car-0" | "car-1" | "car-1.5" per query. Upgrade a single call without changing your integration.

CAR-0Semantic search

Cosavu Store-backed vector search with per-tenant namespace isolation. Fast, reliable, and ideal for most document retrieval workloads where semantic similarity is the primary signal.

  • Cosavu Store cosine similarity
  • Per-tenant namespace isolation
  • HNSW index
  • Best for: notes, documents, unstructured text
retrieval engine
> QUERY RECEIVED
"how does context distillation work"
SCANNING COSAVU INDEX
████████░░░░░░░░░░░░░░░ 32%
CANDIDATES FOUND ─────── ?
ENGRAM waiting ·········· |

Inside CAR-1

The Engram filter —
seven signals, one score.

Every candidate chunk is scored across semantic similarity, lexical recall, IDF weighting, bigram overlap, exact match, and rank prior. Diversity-aware gating then removes near-duplicate passages before they consume your context budget.

Chunks with zero query-term overlap are penalised 75%. Off-topic passages don't crowd out relevant ones even when the vector space is noisy.

Semantic similarity

35%

Lexical recall

30%

IDF recall

18%

Bigram overlap

12%

Lexical precision

14%

Exact match

6%

Rank prior

2%

Diversity gate

22% novelty min

Capabilities

What DataAPI does

01

Multi-tenant isolation

Every collection is namespaced to a tenant. Cosavu Index and Cosavu Store are completely isolated — one tenant's data never touches another's.

02

Ingest anything

Upload PDFs, DOCX, plain text, code files, PPTX, XLSX, and CSV. A universal parser extracts and chunks each file automatically.

03

Context budget

Returned chunks are trimmed to a configurable token budget using sentence-level extraction — your model never receives a truncated mid-thought.

04

Safety on every query

A moderation gate runs before retrieval. Harmful queries are blocked at the boundary before any data is accessed.

Get started

Start retrieving with DataAPI.

Get a tenant key and connect your knowledge base in minutes. No infrastructure to manage.