Lead Developer Advocate

Philip Kiely

How to serve your ComfyUI model behind an API endpoint

This guide details deploying ComfyUI image generation pipelines via API for app integration, using Truss for packaging & production deployment.

Het Trivedi

1 other

Model: SDXL + ControlNet, Prompt: A top down view of a river through the woods

GPU guides

NVIDIA A10 vs A10G for ML model inference

The A10, an Ampere-series GPU, excels in tasks like running 7B parameter LLMs. AWS's A10G variant, similar in GPU memory & bandwidth, is mostly interchangeable.

Philip Kiely

Hacks & projects

GPT vs Llama: Migrate to open source LLMs seamlessly

Use ChatCompletions API to test open-source LLMs like Llama in your AI app with just three minor code modifications.

Sid Shanker

1 other

Prompt: A sturdy stone bridge under a full moon, warm colors

ML models

Open source alternatives for machine learning models

Building on top of open source models gives you access to a wide range of capabilities that you would otherwise lack from a black box endpoint provider.

Varun Shenoy

1 other

Prompt: An open door leading to a beautiful garden

ML models

A checklist for switching to open source ML models

Transitioning from using ML models via closed source APIs to open source ML models? This checklist provides all necessary resources for the shift.

Philip Kiely

Prompt: Luggage on a trolley in a historic train station

Model performance

A guide to LLM inference and performance

Learn if LLM inference is compute or memory bound to fully utilize GPU power. Get insights on better GPU resource utilization.

Varun Shenoy

1 other

Prompt: A glowing floating book of runes

ML models

Pinning ML model revisions for compatibility and security

Pin versions of open source packages like PyPi's transformers to avoid breaking changes or security issues; similarly, pin model revisions for stability.

Philip Kiely

Prompt: A green pushpin in an old-fashioned map

ML models

Deployment and inference for open source text embedding models

Text embedding models convert text into semantic vectors. Numerous open source models cater to search, recommendation, classification & LLM-augmented retrieval.

Philip Kiely

Prompt: a typewriter embedded in a motherboard

1…5 6 7

‌

‌
‌
‌

‌

‌
‌
‌

‌

‌
‌
‌

‌

‌
‌
‌

‌

Machine learning infrastructure that just works

Baseten provides all the infrastructure you need to deploy and serve ML models performantly, scalable, and cost-efficiently.