A log of all the changes and improvements made to our app

Subscribe to the changelog

December 21st, 2023

Log multi-turn interactions, sort and filter production requests, and token usage and latency graphs

Row-by-row observability over your production requests 🔍

Introducing support for multi-turn interactions. You can now log and refer back to the full chat history of each of your production requests in Openlayer. Sort by timestamp, token usage, or latency to dig deeper into your AI’s usage. And view graphs of these metrics over time.

There’s more: we now support Google’s new Gemini model. Try out the new model and compare its performance against others.

⬇️ Read the full changelog below for all the tweaks and improvements we’ve shipped over the last few weeks and, as always, stay closer to our development journey by joining our Discord!

New features

  • Log multi-turn interactions in monitoring mode, and inspect individual production requests to view the full chat history alongside other meta like token usage and latency
  • Sort and filter through your production requests
  • View a graph of the token usage and latency across all your requests over time
  • Support for Gemini is now available in-platform: experiment with Google’s new model and see how it performs on your tests
  • View row-by-row explanations for tests using GPT evaluation


  • Expanded the Openlayer TypeScript/JavaScript library to support all methods of logging requests, including those using other providers or workflows than OpenAI
  • Improved commit selector shows the message and date published for each commit
  • New notifications for uploading reference datasets and data limits exceeded in monitoring mode
  • Only send email notifications when test statuses have changed from the previous evaluation in monitoring
  • Added sample projects for monitoring
  • Enhancements to the onboarding, including a way to quickstart a monitoring project by sending a sample request through the UI
  • No longer navigate away from the current page when toggling between development and monitoring, unless the mode does not apply to the page
  • Allow reading and setting project descriptions from the UI
  • Update style of selected state for project mode toggles in the navigation pane for clarity
  • Clarify that thresholds involving percentages currently require inputting floats
  • Allow computing PPS tests for columns other than the features
  • Test results automatically update without having to refresh the page in monitoring mode
  • Add dates of last/next evaluation to monitoring projects and a loading indication when they recompute
  • Surface error messages when tests fail to compute
  • Add callouts for setting up notifications and viewing current usage against plan limits in the navigation
  • Graphs with only a single data point have a clearer representation now
  • Improvements to the experience of creating tests with lots of parameters/configuration
  • Improvements to the experience of creating Great Expectations tests
  • Add alert when using Openlayer on mobile
  • Default request volume, token usage, and latency graphs to monthly view

Bug fixes

  • Title suggestions for certain tests during creation were unavailable or inaccurate
  • Fixes to test parameters, including incorrectly labeled and invalid options
  • Certain LLM tests would not allow selecting target columns that are not input variables
  • Code in development onboarding modals was not syntax highlighted
  • Create test card content would overflow improperly
  • Sample projects would not show button for creating suggested tests after some were created
  • Graphs in monitoring test cards were cut off
  • Requests table would break when rows were missing columns
  • Full-screen onboarding pages would not allow scrolling when overflowed
  • Options were sometimes duplicated in heatmap dropdowns
  • Thresholds would not faithfully appear in test result graphs
  • Skipped evaluations would not appear in test result graphs