Improve performance and reduce cost with fractional H100 GPUs
Baseten now offers model inference on NVIDIA H100mig GPUs, available for all customers starting at $0.08250/minute. The H100mig family of instances runs on a fractional share of an H100 GPU using...