Low-latency prompt compression

Save on LLM input tokens before every request.

Meet frontier models with frontier query planning. Vrbose shortens prompts before they reach your model, helping teams reduce input token costs with a simple API and evolving, cutting-edge compression algorithm.

How do you save on token input costs? You Vrbose!

Create a free account Read the API docs

Free plan: 20/day
Paid plan: $0.01/query
Potential savings: Up to 50%

An assiduous ferret for every prompt.

Send Vrbose your original query, receive a compressed version, and pass that on to your LLM provider.

POST /detokenate
Authorization: Bearer vb_live_...

{ "q": "the original query" }