Baseten Blog | Page 11
Baseten achieves SOC 2 Type II certification
Baseten, an MLOps platform for model deployment & fine-tuning, now boasts SOC 2 type 2 certification, ensuring data security, privacy, and confidentiality.
Technical deep dive: Truss live reload
Truss' live reload feature revolutionizes iterative development, turning the lengthy 3-30 minute model deployment process into an almost instant task.
New in January 2023
Deploy multiple model versions, model resource management, a cleaner Truss DX, and more.
Choosing the right horizontal scaling setup for high-traffic models
Horizontal scaling via replicas with load balancing is an important technique for handling high traffic to an ML model.
How to choose the right instance size for your ML models
This post simplifies instance sizing with heuristics to choose an optimal size for your model, balancing performance and compute cost.
New in December 2022
2022's rapid ML advancements felt like a decade. Excited for 2023, we anticipate foundational models will further empower scientists and developers in ML apps.
Serving four million Riffusion requests in two days
Riffusion is a fine-tuned version of Stable Diffusion. Baseten served Riffusion over four million times in a couple of days, serving top-of-hacker-news traffic.
Accelerating model deployment: 100X faster dev loops with development deployments
Baseten's development deployments speed up ML model dev loops, replacing slow workflows with a live reload system for quick, seconds-long testing updates.
New in October: Find community with The DSC
October was a big month for the ML industry, with more momentum than ever behind spooky-good models and novel applications