3 9 2

Perry the Platypus PRO

AgPerry

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

submitted a paper about 2 hours ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

upvoted a paper 1 day ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

View all activity

Organizations

upvoted a paper about 1 hour ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 1 day ago • 5

submitted a paper to Daily Papers about 2 hours ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 1 day ago • 5

upvoted 2 papers 1 day ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published 4 days ago • 28

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 4 days ago • 220

upvoted a collection 1 day ago

AI Paper of the Day

Collection

A collection of papers that I think are interesting, one added each day • 635 items • Updated 1 day ago • 94

upvoted 2 papers 1 day ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 80

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

Paper • 2603.12698 • Published 28 days ago • 1

submitted a paper to Daily Papers 2 days ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published 4 days ago • 28

published a model 2 days ago

AgPerry/Qwen3-8B-fim-v2v3pt-swe-lego-posttrain

Updated 2 days ago

upvoted a collection 3 days ago

IQuest-Coder

Collection

14 items • Updated Mar 3 • 108

liked a dataset 8 days ago

TIGER-Lab/SWE-QA-Pro-Bench

Viewer • Updated 17 days ago • 260 • 108 • 5

upvoted a paper 10 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published 11 days ago • 30

New activity in TIGER-Lab/MMLU-Pro 10 days ago

how to download responses of specific models

#45 opened 26 days ago by

Roman1111111

updated a model 23 days ago

AgPerry/Qwen2.5-Coder-7B-Instruct-num05-accumulate_16

Text Generation • 333k • Updated 23 days ago • 14

published a model 23 days ago

AgPerry/Qwen2.5-Coder-7B-Instruct-num05-accumulate_16

Text Generation • 333k • Updated 23 days ago • 14

updated a model 24 days ago

AgPerry/Qwen2.5-Coder-7B-Instruct-num06-accumulate_16

Text Generation • 333k • Updated 24 days ago • 14

published a model 24 days ago

AgPerry/Qwen2.5-Coder-7B-Instruct-num06-accumulate_16

Text Generation • 333k • Updated 24 days ago • 14

updated a model 25 days ago

AgPerry/Qwen2.5-Coder-7B-Instruct-num07

Text Generation • 333k • Updated 25 days ago • 523

published a model 25 days ago

AgPerry/Qwen2.5-Coder-7B-Instruct-num07

Text Generation • 333k • Updated 25 days ago • 523

updated a model 27 days ago

AgPerry/Qwen2.5-Coder-7B-Instruct-num06

Text Generation • 333k • Updated 27 days ago • 15

Perry the Platypus PRO

AI & ML interests

Recent Activity

Organizations

AgPerry's activity

how to download responses of specific models