Python Evaluators
When building AI features, ensuring high-quality and reliable outputs is crucial. Orq.ai allows you to implement custom evaluators in Python, giving you full control over how AI-generated content is assessed and validated.
Benefits of Using Python for Evaluators
- Flexibility & Customization
Python enables you to define evaluation logic that precisely matches your needs, whether it’s scoring relevance, detecting biases, or enforcing style guidelines.
- Seamless Integration
Orq.ai supports direct integration of Python-based evaluators, allowing you to run checks on AI outputs within your workflow without extra tooling.
- Preloaded with NumPy
Your Python evaluators can leverage NumPy (v1.26.4) for numerical computations, making it easier to apply statistical methods, calculate custom scores, or analyze AI responses efficiently.
- Automated & Scalable
Python evaluators run automatically on AI responses, ensuring continuous quality control and reducing manual review efforts.

Example python eval
Read here how to set up a custom Python Evaluator here: docs