Baseten Blog | Page 11

News

Baseten achieves SOC 2 Type II certification

Baseten, an MLOps platform for model deployment & fine-tuning, now boasts SOC 2 type 2 certification, ensuring data security, privacy, and confidentiality.

Product

Technical deep dive: Truss live reload

Truss' live reload feature revolutionizes iterative development, turning the lengthy 3-30 minute model deployment process into an almost instant task.

Product

New in January 2023

Deploy multiple model versions, model resource management, a cleaner Truss DX, and more.

GPU guides

Choosing the right horizontal scaling setup for high-traffic models

Horizontal scaling via replicas with load balancing is an important technique for handling high traffic to an ML model.

GPU guides

How to choose the right instance size for your ML models

This post simplifies instance sizing with heuristics to choose an optimal size for your model, balancing performance and compute cost.

Product

New in December 2022

2022's rapid ML advancements felt like a decade. Excited for 2023, we anticipate foundational models will further empower scientists and developers in ML apps.

Hacks & projects

Serving four million Riffusion requests in two days

Riffusion is a fine-tuned version of Stable Diffusion. Baseten served Riffusion over four million times in a couple of days, serving top-of-hacker-news traffic.

Product

Accelerating model deployment: 100X faster dev loops with development deployments

Baseten's development deployments speed up ML model dev loops, replacing slow workflows with a live reload system for quick, seconds-long testing updates.

Product

New in October: Find community with The DSC

October was a big month for the ML industry, with more momentum than ever behind spooky-good models and novel applications