Announcing our $14.5M Series A!
Read the blog post

SAML Directory Sync, new LLM-as-a-judge models, and website refresh

SAML Directory Sync, new LLM-as-a-judge models, and website refresh

We’ve added lots of features and enhancements across our platform, focused on improving performance, expanding functionality, and streamlining workflows. To highlight a few:

🔐 Increased security with SAML SSO directory sync. You can now sync SAML SSO on Openlayer with existing security groups. Openlayer can now more seamlessly fit into your organization’s security policies.

🧑‍⚖️ New LLM-as-a-judge models. We’ve expanded the models available to act as judges for LLM-as-a-judge tests. You can now use models from Cohere and Vertex AI when running these tests.

🎨 Website refresh. We’ve given our website a brand refresh, including lots of fun animations showcasing the Openlayer product in action and case studies from some of our customers.

Features

  • SDKs
    Faster batch uploads with pyarrow support
  • SDKs
    Push commits to the platform via the Python SDK
  • UI/UX
    Tabular view of test results in test modals
  • UI/UX
    Add pie graph for test results in project home
  • Evals
    Add Faithfulness and Answer Correctness metrics for RAG systems
  • Platform
    Use Cohere, Vertex AI models as options for LLM-as-a-judge metrics
  • API
    Add `expand` to inference pipeline GETs so projects and workspaces are included in the response body
  • Platform
    New "Viewer" role in workspaces that doesn’t have write, update or delete permissions on resources
  • SDKs
    Support for async data uploads, and faster upload speeds
  • Platform
    Directory sync with SAML

Improvements

  • API
    Lower latency for data stream endpoint
  • UI/UX
    Update tooltips and rendering of statuses in test cards
  • UI/UX
    Make sections in test modals collapsible
  • API
    Add skipped and failing test counts in project version and inference pipeline objects
  • API
    Better error messages for invalid data configs when streaming data
  • Platform
    More intuitive status messages for skipped tests
  • Documentation
    Add code samples in Java

Fixes

  • Platform
    Generate outputs step was not failing gracefully
  • UI/UX
    Surface user-facing error messages upon SSO login failures
  • UI/UX
    Better failure message when password reset link has expired
  • Platform
    Improved rate limiting
  • Integrations
    Slack notifications for create pipeline now includes name
  • Platform
    Answer Correctness metric was breaking when output was not a string
$ openlayer push

Stop guessing. Ship with confidence.

The automated AI evaluation and monitoring platform.