Richard Zhuang's picture

Richard Zhuang PRO

RZ412

·

https://richardzhuang0412.github.io

AI & ML interests

LLM Routing, LLM + Games, Post-Training, Agents

Recent Activity

updated a dataset about 5 hours ago

DCAgent2/dev_set_v2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_num_traff7c40f1

published a dataset about 5 hours ago

DCAgent2/dev_set_v2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_num_traff7c40f1

updated a dataset about 6 hours ago

DCAgent2/swebench_verified_random_100_folders_nemotron_100000_opt100k__Qwen3_8B_20260330_225138

View all activity

Organizations

Papers 2

arxiv:2501.08328

arxiv:2410.02223

models 57

RZ412/Qwen2.5-3B-Instruct-inferredbugs-sandboxes-traces-terminus-2

Updated Dec 4, 2025

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-Min-R1-Min-MLR

Text Generation • 3B • Updated Nov 30, 2025 • 1

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-Only-Seed-42

Text Generation • 3B • Updated Nov 3, 2025 • 1

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RM-50-50-SS-42-AS-42

Text Generation • 3B • Updated Nov 3, 2025 • 4

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-Only-Seed-42

Text Generation • 3B • Updated Nov 3, 2025 • 32

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-MeL

Text Generation • 3B • Updated Oct 28, 2025 • 1

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-ML

Text Generation • 3B • Updated Oct 27, 2025 • 2

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-MaL-misstore

Text Generation • 3B • Updated Oct 27, 2025 • 2

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-DB

Text Generation • 3B • Updated Oct 26, 2025 • 4

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RES

Text Generation • 3B • Updated Oct 26, 2025 • 4

datasets 20

RZ412/PokerBench

Viewer • Updated Jan 8 • 574k • 1.3k • 34

RZ412/db-test-traces

Viewer • Updated Dec 10, 2025 • 210 • 5

RZ412/test-parquet2

Viewer • Updated Dec 6, 2025 • 728 • 4

RZ412/test-parquet

Viewer • Updated Dec 6, 2025 • 728 • 4

RZ412/inferredbugs-traces-sft

Viewer • Updated Dec 5, 2025 • 4

RZ412/inferredbugs-tasks

Viewer • Updated Dec 5, 2025 • 100 • 5

RZ412/inferredbugs-10

Viewer • Updated Dec 5, 2025 • 10 • 3

RZ412/inferredbugs-traces-10

Viewer • Updated Dec 5, 2025 • 7

RZ412/inferredbugs-sandboxes-10

Viewer • Updated Dec 5, 2025 • 10 • 4

RZ412/inferredbugs-10-traces

Viewer • Updated Dec 5, 2025 • 5

View 20 datasets