We understand that you may have metrics that are highly specific to your use case, and you want to use these alongside standard metrics to eval your AI systems. That’s why we built custom metrics. You can now upload any custom metric to Openlayer simply by specifying them in your openlayer.json file. These metrics can then be used in a number of ways on platform; they’ll show up as metric tests that you can run, or as project-wide metrics that will be computed on all of your data. Now, your evals on Openlayer are more comprehensive than ever.
We’ve shipped more exciting features and improvements this month, including the ability to create multiple API keys, and a bunch of new models available for direct-to-API calls, so be sure to read below for a full list of updates!
Features
•EvalsCustom metrics (Upload your own custom metrics to Openlayer, which can be used: as project-wide metrics, as tests)
•APICreate multiple Openlayer API keys (create new personal Openlayer API keys so that you can rotate API keys, rename and delete keys)
•APISpecify desired metrics in openlayer.json
•APINew models available for direct-to-API calls (GPT-4o, GPT-4 Turbo, Claude 3.5 Sonnet, Claude 3 Haiku, Claude 3 Opus, Claude 3 Sonnet, Command R, Command R Plus, Gemini 1.0 Flash, Gemini 1.5 Flash, Gemini 1.5 Pro
Improvements
•UI/UXTest creation page design improvements
•IntegrationsLink to git repository and organization in git settings pages
•UI/UXAdd button to view status of commit during processing in project loading state
•UI/UXUpdated solid danger buttons’ shade of red
•UI/UXModal background overlay opacity is no longer too light
•UI/UXToast messages no longer overflow the page
•UI/UXImproved text sizing in various places
•UI/UXAdded error toast when Assistant requests fail
•UI/UXNavigation polish
•UI/UXDifferent icons for different commit sources
•UI/UXSuggested titles for GPT evaluation tests now reference the criteria name
•SDKsImprovements to docs (updated Python code snippets with the new SDK syntax, tracing for Anthropic models, updated example notebook links)
Fixes
•UI/UXCommit log processing time does not use relative time
•CollaborationNew users that were invited to a workspace do not auto-navigate to invites page
•UI/UXHelp breadcrumb is hidden and shows in place of user dropdown options
•UI/UXCost values close to 0 rendered as $0.00
•UI/UXProgress bars did not render in chrome
•UI/UXTest metadata disappeared entirely when collapsed
•UI/UXNavigating to project from breadcrumb prevents back navigation
•UI/UXSwitching projects prevented back navigation
•UI/UXCreating commit from the UI does not generate outputs
•UI/UXCommit processing icon was broken in navigation dropdowns
•UI/UXActivity log overflows screen
$ openlayer push
Stop guessing.
Ship with confidence.
The automated AI evaluation and monitoring platform.