Chi PRO
ChilleD
AI & ML interests
Natural Language Processing.
Recent Activity
upvoted a paper about 16 hours ago
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts upvoted a paper 21 days ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper 21 days ago
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration