Announcing our $14.5M Series A!
Read the blog post

Improved test diagnosis page, SAML SSO, design refreshes, + more

Improved test diagnosis page, SAML SSO, design refreshes, + more

🔎🩹 Quickly identify issues with the improved test diagnosis page Diagnosing issues is a core part of the eval process, and that’s why we want to make sure our test diagnosis page is as helpful as possible. To make it easier to figure out why a test has been skipped or errored, we’ve added a list view where you can easily scan through test results and view any related error messages. We’ve added an overview so you can see the test result breakdown at a glance, as well as recent issues and failures. We’ve also made each section of the page collapsible for a smoother experience.

🔐 Make your login even more secure by enabling SAML SSO We now support SAML SSO using any of the major providers so that you can make sure your team’s workspace has that extra layer of security.

Features

  • Collaboration
    SAML SSO Support
  • Platform
    List view for results on test diagnosis page (error messages for skipped and errored tests are now visible, ability to filter test results by type, test results overview at the top of the page which lists the total number of results for each status type and recent issues with the test results)

Improvements

  • Documentation
    Groq guide available in docs
  • Integrations
    Support for Azure OpenAI as an LLM evaluator
  • UI/UX
    Login page design refresh
  • UI/UX
    Sections in test diagnosis page are collapsible
  • UI/UX
    More informative tooltips on test cards
  • UI/UX
    Homepage overview polishes
  • Evals
    Additional Ragas metrics (faithfulness, answer correctness)
  • Observability
    Updated cost table for OpenAI models
  • UI/UX
    Notifications for new inference pipelines now list the name of the pipeline
$ openlayer push

Stop guessing. Ship with confidence.

The automated AI evaluation and monitoring platform.