Baseten Chains is now GA: Deploy ultra-low-latency compound AI systems

Now with improved performance, robustness, and an even more delightful DevEx since our beta launch, we’re thrilled to announce the general availability of Baseten Chains for production compound AI!

Chains enables you to:

  • Call a sequence of models and processing steps without incurring excess latency

  • Modularize complex workflows (allocating custom hardware and autoscaling) while keeping them cohesive

  • Abstract away complex model orchestration

Deploy any compound AI system with Chains and gain the optimized model performance and elastic horizontal scaling we specialize in. Building complex, multi-model workflows is as simple as calling local, type-safe Python functions.

Check out our launch blog to learn more, and join us live on March 6th to see Chains in action!

Video