Baseten Blog | Page 9

Topics

Latest Model performance Hacks & projects GPU guides ML models Glossary Community Product News

Open source alternatives for machine learning models

Building on top of open source models gives you access to a wide range of capabilities that you would otherwise lack from a black box endpoint provider.

Varun Shenoy

1 other

Prompt: An open door leading to a beautiful garden

ML models

A checklist for switching to open source ML models

Transitioning from using ML models via closed source APIs to open source ML models? This checklist provides all necessary resources for the shift.

Philip Kiely

Prompt: Luggage on a trolley in a historic train station

Model performance

A guide to LLM inference and performance

Learn if LLM inference is compute or memory bound to fully utilize GPU power. Get insights on better GPU resource utilization.

Varun Shenoy

1 other

Prompt: A glowing floating book of runes

ML models

Pinning ML model revisions for compatibility and security

Pin versions of open source packages like PyPi's transformers to avoid breaking changes or security issues; similarly, pin model revisions for stability.

Philip Kiely

Prompt: A green pushpin in an old-fashioned map

ML models

Deployment and inference for open source text embedding models

Text embedding models convert text into semantic vectors. Numerous open source models cater to search, recommendation, classification & LLM-augmented retrieval.

Philip Kiely

Prompt: a typewriter embedded in a motherboard

Product

New in October 2023

All-new model management, a text embedding model that matches OpenAI, and misgif, the most fun you’ll have with AI all week.

Baseten

A glowing cyberpunk car racing through an enchanted forest

Product

New in September 2023

Mistral 7B LLM, GPU comparisons, model observability features, and an open source AI event series

Baseten

GPU guides

NVIDIA A10 vs A100 GPUs for LLM and Stable Diffusion inference

This article compares two popular GPUs—the NVIDIA A10 and A100—for model inference and discusses the option of using multi-GPU instances for larger models.

Philip Kiely

Product

New in August 2023

Truss' latest update addresses key ML model serving issues. Discover how to speed up SDXL inference to 3s and build ChatGPT-like apps with Llama 2 & Chainlit.

Baseten

Prompt: a heavily constructed solarpunk bridge over a chasm

1…8 9 10…14