Qwen3-30B-A3B-Thinking-2507 GPTQ

Requirements

  • vLLM : v0.11.1
  • gptqmodel : v4.0.0

Performance

id Dataset Metric Samples bf16 w4a16
1 aime25 AveragePass@1 30 0.7666 0.8334
2 gpqa_diamond AveragePass@1 198 0.6869 0.6313
3 mmlu_pro AverageAccuracy 1196 0.7784 0.7617
4 ifeval prompt_level_strict_acc 541 0.8521 0.8503
5 live_code_bench Pass@1 1055 0.7922 0.7687
  • temperature 0.6
  • top_p 0.95
  • max_tokens 81920
Downloads last month
49
Safetensors
Model size
31B params
Tensor type
I32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support