Inference Providers
Active filters: pruned
5456es/last_layer_prune_Llama-3.1-8B-Instruct_prune_0.6-sigmoid
8B • Updated • 8
5456es/last_layer_prune_Qwen2.5-7B-Instruct_prune_0.6-sigmoid
8B • Updated • 6
5456es/last_layer_prune_Qwen2.5-7B-Instruct_prune_0.4-sigmoid
8B • Updated • 6
5456es/last_layer_prune_Llama-3.1-8B-Instruct_prune_0.8-sigmoid
8B • Updated • 8
5456es/last_layer_prune_Llama-3.1-8B-Instruct_prune_0.4-sigmoid
8B • Updated • 11
rinarina0429/pruned-llama2-7b
Text Generation
• Updated 5456es/random_prune_Llama-3.1-8B-Instruct_prune_0.4-sigmoid
8B • Updated • 7
5456es/last_layer_prune_Llama-3.2-3B-Instruct_prune_0.3-sigmoid
3B • Updated • 1
5456es/random_prune_Llama-3.1-8B-Instruct_prune_0.2-sigmoid
8B • Updated • 15
5456es/random_prune_Llama-3.2-3B-Instruct_prune_0.2-sigmoid
3B • Updated • 1
5456es/last_layer_prune_Llama-3.1-8B-Instruct_prune_0.3-sigmoid
8B • Updated • 7
5456es/last_layer_prune_Llama-3.2-3B-Instruct_prune_0.5-sigmoid
3B • Updated 5456es/random_prune_Llama-3.2-3B-Instruct_prune_0.4-sigmoid
3B • Updated • 1
5456es/last_layer_prune_Llama-3.1-8B-Instruct_prune_0.5-sigmoid
8B • Updated • 7
5456es/last_layer_prune_Llama-3.2-3B-Instruct_prune_0.7-sigmoid
3B • Updated 5456es/random_prune_Llama-3.1-8B-Instruct_prune_0.6-sigmoid
8B • Updated • 7
5456es/random_prune_Llama-3.2-3B-Instruct_prune_0.6-sigmoid
3B • Updated • 1
5456es/last_layer_prune_Llama-3.1-8B-Instruct_prune_0.7-sigmoid
8B • Updated • 7
CarlosRCDev/spanish-snowflake-arctic-embed-l-v2.0
Sentence Similarity
• 0.4B • Updated • 3
abhinavv3/SmolLM-135M-Instruct-layer-width-pruned-90000000M-raw
Text Generation
• 95M • Updated • 1
codemichaeld/FramePainter_UnetQunatizedFP8
codemichaeld/Wan_vae_upscale2x_fp8
VAGOsolutions/SauerkrautLM-ColQwen3-1.7b-Turbo-v0.1
Image-Text-to-Text
• Updated • 16
• 3
ArslanRobo/llama-3-8b-pruned-mixed-precision-gguf
6B • Updated ArslanRobo/llama-3.1-8b-pruned-taylor30-padded-mixed-precision-quantization-gguf
6B • Updated • 1
annus-lums/llama-3.1-8b-pruned-taylor30-smooth-quant-mpq-gguf
6B • Updated • 10
annus-lums/llama-3.1-8b-pruned-taylor30-padded-mixed-precision-quantization-gguf
6B • Updated Pinkstackorg/Qwen3-Coder-pruned-20B-A3B
20B • Updated • 4
• 4
mlx-community/GLM-4.7-REAP-50-mxfp4
Text Generation
• Updated • 2.62k
• 29
0xSero/INTELLECT-3-REAP-50
Text Generation
• 57B • Updated • 21
• 4