4 24 41

Chi PRO

ChilleD

AI & ML interests

Natural Language Processing.

Recent Activity

upvoted a paper about 16 hours ago

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

upvoted a paper 21 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

upvoted a paper 21 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

View all activity

Organizations

Collections 1

Papers 5

spaces 1

Agent World Model Environment Server

🤖

Step through and monitor an OpenEnv environment via web UI

models 3

datasets 8

ChilleD/WebHarbor

Updated about 1 month ago • 346

ChilleD/SynthAgent

Viewer • Updated Dec 18, 2025 • 2.5k • 24 • 1

ChilleD/pop1k7

Updated Jun 28, 2024 • 17

ChilleD/SVAMP

Viewer • Updated Jun 5, 2024 • 1k • 12.7k • 23

ChilleD/CommonsenseQA

Viewer • Updated Jun 4, 2024 • 12.1k • 43 • 1

ChilleD/StrategyQA

Viewer • Updated Aug 26, 2023 • 2.29k • 11.1k • 6

ChilleD/LastLetterConcat

Viewer • Updated May 11, 2023 • 500 • 189 • 4

ChilleD/MultiArith

Viewer • Updated May 2, 2023 • 600 • 1.76k • 17

Chi PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

ChilleD/SynthAgent

Adapting Web Agents with Synthetic Supervision

ChilleD/SynthAgent-SFT-Qwen2.5-VL-7B

ChilleD/SynthAgent-SFT-UI-TARS-1.5-7B

ChilleD/SynthAgent

Adapting Web Agents with Synthetic Supervision

ChilleD/SynthAgent-SFT-Qwen2.5-VL-7B

ChilleD/SynthAgent-SFT-UI-TARS-1.5-7B

Papers 5

spaces 1

Agent World Model Environment Server

models 3

ChilleD/SynthAgent-SFT-Qwen2.5-VL-7B

ChilleD/SynthAgent-SFT-UI-TARS-1.5-7B

ChilleD/SynthAgent

datasets 8

ChilleD/WebHarbor

ChilleD/SynthAgent

ChilleD/pop1k7

ChilleD/SVAMP

ChilleD/CommonsenseQA

ChilleD/StrategyQA

ChilleD/LastLetterConcat

ChilleD/MultiArith

Chi PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 5

spaces 1

Agent World Model Environment Server

models 3 Sort: Recently updated

datasets 8 Sort: Recently updated

models 3

datasets 8