added

Llama 3.3 70b & Llama Guard 3 are now available through Together AI

Experience the power of the latest Llama 3.3 70b and Llama Guard 3 models on Orq, integrated via Together AI.

improved

New Layout with Project Structure

We’re introducing a new project structure UI to help you organize and manage your resources more effectively. With projects, you can group your work by use case, environment, or any logical structure that suits your needs.

added

Online Guardrails in Live Deployments

You can now configure Guardrails after you have added them to your Library directly in Deployments > Settings for both input and output, giving you full control over live deployments.

added

HTTP and JSON Evaluators and Guardrails

You can now add HTTP and JSON Evaluators and Guardrails under the Evaluator tab and add them to your Deployment or Experiment.

added

Master Your RAG with RAGAS Evals

The RAGAS Evaluators are now available, providing specialized tools to evaluate retrieval-augmented generation (RAG) workflows. These evaluators make it easy to set up quality checks when integrating a Knowledge Base into a RAG system and can be used in Experiments and Deployments to ensure responses are accurate, relevant, and safe.

added

Evaluator Library: 50+ Ready-to-Use and Tailorable Evaluators

Introducing the new Evaluator Library:

improved

Improved LLMs as a Judge

LLM-as-a-Judge Enhancements:
We’ve significantly improved our existing LLM-as-a-Judge feature to provide more robust evaluation capabilities and enforce type-safe outputs.

added

Online Evaluators in Live Deployments

You can now configure Evaluators after you have added them to your Library directly in Deployments > Settings for both input and output, giving you full control over live deployments.

added

Cache your LLM response

Cache and reuse your LLM outputs for near-instant responses, reduced costs, and consistent results.

added

Structured Outputs on Azure

Ensure the model’s responses always stick to your JSON Schema. No more missing fields or incorrect values, making your AI applications more reliable and robust.