Announcing our $14.5M Series A!
Read the blog post

SAML Directory Sync, new LLM-as-a-judge models, and website refresh

SAML Directory Sync, new LLM-as-a-judge models, and website refresh

We’ve added lots of features and enhancements across our platform, focused on improving performance, expanding functionality, and streamlining workflows. To highlight a few:

🔐 Increased security with SAML SSO directory sync. You can now sync SAML SSO on Openlayer with existing security groups. Openlayer can now more seamlessly fit into your organization’s security policies.

🧑‍⚖️ New LLM-as-a-judge models. We’ve expanded the models available to act as judges for LLM-as-a-judge tests. You can now use models from Cohere and Vertex AI when running these tests.

🎨 Website refresh. We’ve given our website a brand refresh, including lots of fun animations showcasing the Openlayer product in action and case studies from some of our customers.

Features

  • SDKs
    Faster batch uploads with pyarrow support
  • SDKs
    Push commits to the platform via the Python SDK
  • UI/UX
    Tabular view of test results in test modals
  • UI/UX
    Add pie graph for test results in project home
  • Evals
    Add Faithfulness and Answer Correctness metrics for RAG systems
  • Platform
    Use Cohere, Vertex AI models as options for LLM-as-a-judge metrics
  • API
    Add `expand` to inference pipeline GETs so projects and workspaces are included in the response body
  • Platform
    New "Viewer" role in workspaces that doesn’t have write, update or delete permissions on resources
  • SDKs
    Support for async data uploads, and faster upload speeds
  • Platform
    Directory sync with SAML

Improvements

  • API
    Lower latency for data stream endpoint
  • UI/UX
    Update tooltips and rendering of statuses in test cards
  • UI/UX
    Make sections in test modals collapsible
  • API
    Add skipped and failing test counts in project version and inference pipeline objects
  • API
    Better error messages for invalid data configs when streaming data
  • Platform
    More intuitive status messages for skipped tests
  • Documentation
    Add code samples in Java

Fixes

  • Platform
    Generate outputs step was not failing gracefully
  • UI/UX
    Surface user-facing error messages upon SSO login failures
  • UI/UX
    Better failure message when password reset link has expired
  • Platform
    Improved rate limiting
  • Integrations
    Slack notifications for create pipeline now includes name
  • Platform
    Answer Correctness metric was breaking when output was not a string
$ openlayer push

Stop guessing. Ship with confidence.

The automated AI evaluation and monitoring platform.

We value your privacy

We use cookies to enhance your browsing experience, serve personalized content, and analyze our traffic.