23 large language models

NVIDIA logoLlama 3.1 Nemotron 70B

LLM
3.1NemotronA100

Meta logoLlama 3.1 405B Instruct

LLM
3.1InstructH100

Fixie LogoUltravox v0.4

LLM
0.4vLLMH100 MIG 40GB

Meta logoLlama 3.1 8B Instruct

LLM
3.1InstructH100

Meta logoLlama 3.1 70B Instruct

LLM
3.1InstructH100

Meta logoLlama 3 8B Instruct

LLM
3InstructA100

Mistral AI logoPixtral 12B

LLM
PixtralvLLMA100

Microsoft LogoPhi 3.5 Mini Instruct

LLM
3.5128kvLLMA10G

Meta logoLlama 3 70B Instruct

LLM
3InstructH100

google logoGemma 2 9B

LLM
vLLMA100

google logoGemma 2 27B

LLM
vLLMA100

Meta logoLlama 3 8B Instruct TRT-LLM

LLM
3InstructTRT-LLMA100

Mistral AI logoMistral 7B Chat TRT-LLM

LLM
v1ChatTRT-LLMA100

Mistral AI logoMixtral 8x7B

LLM
v1ChatTRT-LLMA100

Mistral AI logoMistral 7B Instruct

LLM
v2ChatA10G

Hugging Face logoZephyr 7B Alpha

LLM
AlphaA10G

Mistral AI logoMistral 7B Chat

LLM
v1ChatA10G

Deploy any model in just a few commands

Avoid getting tangled in complex deployment processes. Deploy best-in-class open-source models and take advantage of optimized serving for your own models.

$

truss init -- example stable-diffusion-2-1-base ./my-sd-truss

$

cd ./my-sd-truss

$

export BASETEN_API_KEY=MdNmOCXc.YBtEZD0WFOYKso2A6NEQkRqTe

$

truss push

INFO

Serializing Stable Diffusion 2.1 truss.

INFO

Making contact with Baseten 👋 👽

INFO

🚀 Uploading model to Baseten 🚀

Upload progress: 0% | | 0.00G/2.39G