baseten/Qwen2.5-32B-Instruct-128k
Text Generation
• 33B • Updated • 14
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.18.1-TP1
baseten/btest-Qwen-0.5B-NVIDIA-H100-80GB-HBM3-v0.18.1-TP1
baseten/Llama-4-Scout-17B-16E-fp4
62B • Updated • 4
• 1
baseten/Llama-4-Scout-17B-16E-fp8
108B • Updated • 7
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-A10G-v0.20.0r-TP1
baseten/orpheus-3b-0.1-ft
Text-to-Speech
• 4B • Updated • 240
• 2
baseten/whisper_trt_large_v3_turbo_test_NVIDIA_L4_0_18_2
Updated
397B • Updated • 3.62k
• 1
baseten/Qwen2.5-Coder-32B-Instruct-128k
Text Generation
• 33B • Updated • 6
baseten/whisper_trt_large_v3_turbo_test_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_18_2
Updated
baseten/whisper_trt_large_v3_test_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_18_2
Updated
baseten/whisper_trt_large_v3_test_NVIDIA_L4_0_18_2
Updated
baseten/whisper_trt_large_v3_test04152025_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.18.1-TP2
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.18.1-TP1
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-A10G-v0.18.1-TP1
baseten/llama-3.3-70B-smoothquant-tllm
Text Generation
• Updated • 4
baseten/btest-Llama-3.3-70B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP2
Updated
baseten/btest_Qwen2.5-Coder-1.5B-Instruct-fp8-tp1-lade-h100-hbm3
baseten/whisper_trt_large_v3_test03132025_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated
baseten/btest-Qwen2.5-7B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP2
baseten/btest-Qwen2.5-7B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP1
baseten/btest-engine-builder-tllm-llama-1b
Text Generation
• 1B • Updated • 9
• baseten/whisper_trt_large_v3_turbo_test20250307_NVIDIA_L4_0_13_0
Updated
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP2
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP1
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.17.0-TP2
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.17.0-TP1
baseten/btest-Llama-3.1-70B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP4