Question 1

How do I implement Aguru?

Accepted Answer

Aguru connects to your LLM model through API, compatible with Python and node.js environments. It’s built for fast, easy implementation. Once you’ve created your trial account, you’ll see an explicit user guide that guides you through the integration in just a few clicks.

Question 2

I want to see how Aguru’s LLM Router works in action before letting it automatically route my queries. Is it possible?

Accepted Answer

Of course. You have 3 options:1. Set up a demo: We’ll schedule an online demo with you to give you a thorough walkthrough of Augur’s LLM Router, Caching, Clustering, and answer any of your questions. 2. Use observation mode: By simply deactivating LLM Router, Caching, or both, you’ll activate observation mode. This means you’ll see how your new queries are answered through cache, and/or how the answers from other LLM models compared to your original LLM, without having the feature(s) impact your new queries. 3. Use historical data: You can upload your historical dataset into Aguru to see how Aguru’s LLM Router and Caching work, instead of applying the functionality on new prompts. This feature isn’t activated in trial account by default, but can be added quickly. If you prefer this option, contact us.

Question 3

How do you evaluate LLM models performance?

Accepted Answer

We use BERTScore to measure the output quality to each query. And you’ll see the different scores of different LLM models for the same query in our LLM Router, with 1 being the highest performance.

Question 4

When LLM Router and Caching are both activated, what’s the workflow in Aguru upon inference?

Accepted Answer

When LLM Router and Caching are both activated, Aguru will first compare a new prompt to past ones to see if any past response can be reused. If not, it’ll pass the new query to LLM Router, routing it to a more cost-effective model based on your quality and cost tradeoff threshold. Coupling LLM Router and Caching will bring enhanced cost optimization to businesses constantly receiving semantically repetitive queries and a substantial amount of queries in general.

Question 5

Can I choose to only trial one feature, either LLM Router, Caching, or Clustering?

Accepted Answer

Of course you can. All the 3 features are included in the trial, and it’s your decision which feature you want to experiment with. It only takes a click to activate or deactivate LLM Router or Caching. Clustering is set to be activated all the time, in order to provide you visibility into your user interaction with your app.

Question 6

How long does the trial last?

Accepted Answer

We’re flexible with the trial length. Our primary goal is to build a solution that truly adds value to your business, so we’re flexible with the time you need to test our solution and share with us your feedback on the product and future features you’ll need.

Your tailored LLM Routing & Caching solution, powered by clustering insights

OUR SOLUTIONS

Decipher, reuse, and route your prompts for optimal cost-efficiency

Cluster-Based LLM Router

Efficient LLM Caching

Insightful Data Clustering & Visualization

TRY AGURU

Curious about how our solution will work for your AI applications?

FAQ

Frequently Asked Questions

WHO WE ARE

Our Team

Oleg Simrnov

Derek O’Carroll

Nick Shaw

Features

LLM Router

LLM Caching

Clustering

Get in touch

Get a live demo

Contact us

Log in

Useful links

Blog

Privacy policy

Website Terms of Use