Products
Two products.
One intelligence layer.
Cosavu ships two composable APIs. Use one or both — they're designed to work together as the context layer in front of any LLM deployment.
Every token is a cost. Cut the ones that don't matter.
ContextAPI sits between your application and any LLM. It parses every prompt into typed blocks, runs them through the STAN-1-Mini RL meta-controller, and emits an optimised package — fewer tokens, same intent.
- ›STAN-1-Mini RL optimisation engine
- ›5-stage prompt compilation pipeline
- ›cosavu-small / medium / large model tiers
- ›Code audit + production readiness scoring
- ›PII scrubbing & token budget enforcement
Your model gets context, not fragments.
DataAPI indexes your knowledge base and returns distilled, query-relevant context to your LLM — not a bag of raw fragments. Multi-tenant isolation, S3 persistence, and three retrieval tiers built for production.
- ›CAR-0 · CAR-1 · CAR-1.5 retrieval systems
- ›Engram n-gram + semantic re-ranking filter
- ›STAN-prefilled context distillation
- ›Multi-tenant isolated Cosavu Index + Store
- ›Safety moderation on every query
Better together
ContextAPI + DataAPI —
full stack context intelligence.
Run DataAPI to retrieve distilled, query-relevant context from your knowledge base. Pipe that context through ContextAPI before it reaches your LLM — compressing it further and enforcing token budgets.
The result: your model receives a lean, structured, governance-checked payload. No prompt injection vectors. No runaway token storms. No glue code.
Full stack context intelligence — in two API calls.