Automatic deactivation for inactive deployments
Deployments that haven't received any traffic and have been scaled to zero for over two weeks will now be automatically deactivated. However, production deployments will remain unaffected by this...
See our latest feature releases, product improvements and bug fixes
Aug 21, 2024
Deployments that haven't received any traffic and have been scaled to zero for over two weeks will now be automatically deactivated. However, production deployments will remain unaffected by this...
Aug 21, 2024
The model overview page now includes new filtering and sorting options to help you find models more easily. You can filter by running models, scaled-to-zero, inactive, failed, and chains....
Aug 20, 2024
We're excited to announce the launch of the Playground —an enhanced and rebranded version of the "Call Model" dialog. The Playground is now accessible from anywhere within a model, allowing you to...
Aug 19, 2024
We've made it easier to zoom in on specific time periods in your metrics. Now, if you spot an anomaly, simply drag over the period, and the time range will automatically update to focus on that...
Aug 18, 2024
To make debugging and log analysis easier, you can now expand logs to full screen, giving you more space to work and breathe while providing a clearer, more comprehensive view of the logs when you...
Aug 17, 2024
We've shipped several small quality-of-life improvements for managing API keys when creating workspace keys for specific models. You can now filter the list, making it easier to find the right key,...
Aug 16, 2024
We've heard your feedback! You can now easily see which chainlet you're filtering by when viewing logs and metrics. We've also improved the filter popover, making it simpler to navigate and filter in...
Aug 15, 2024
Interested in running async inference ? We've made it easier to get started by adding a code example in the new API endpoint modal. Just click API in the top right of the page and toggle to Async API...
Aug 14, 2024
To handle massive bursts of traffic, we've helped user scale to hundreds of replicas on model deployments. To make it easier to set minimums and maximums for large numbers of replicas, we've added...
Jul 9, 2024
By popular demand: you can now pin models and chains to the top of your workspace! To pin an item, click into the ... menu on any model or chain and select Pin . You can pin up to 6 items for quick...