Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yourbench 's Collections
MMLU-Pro by TIGER-Lab
MMLU-Pro by Qwen235B-A22B
MMLU-Pro by DeepSeek R1-0528
MMLU-Pro by OpenAI o4-mini
MMLU-Pro by Grok-3-Mini

MMLU-Pro by OpenAI o4-mini

updated May 31, 2025
Upvote
-

  • yourbench/yourbench_reproduction_o4mini_biology

    Viewer • Updated Jun 1, 2025 • 1.83k • 36

  • yourbench/yourbench_reproduction_o4mini_business

    Viewer • Updated Jun 1, 2025 • 829 • 15

  • yourbench/yourbench_reproduction_o4mini_chemistry

    Viewer • Updated Jun 1, 2025 • 805 • 32

  • yourbench/yourbench_reproduction_o4mini_computerscience

    Viewer • Updated Jun 1, 2025 • 1.81k • 12

  • yourbench/yourbench_reproduction_o4mini_economics

    Viewer • Updated Jun 1, 2025 • 874 • 6

  • yourbench/yourbench_reproduction_o4mini_health

    Viewer • Updated Jun 1, 2025 • 1.48k • 5

  • yourbench/yourbench_reproduction_o4mini_history

    Viewer • Updated Jun 1, 2025 • 2.71k • 6

  • yourbench/yourbench_reproduction_o4mini_law

    Viewer • Updated Jun 1, 2025 • 778 • 5

  • yourbench/yourbench_reproduction_o4mini_philosophy

    Viewer • Updated Jun 1, 2025 • 2.05k • 5

  • yourbench/yourbench_reproduction_o4mini_physics

    Viewer • Updated Jun 1, 2025 • 1.03k • 3

  • yourbench/yourbench_reproduction_o4mini_psychology

    Viewer • Updated Jun 1, 2025 • 1.24k • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs