32 1

Chanuk Lee

tally0818

https://tally0818.github.io

AI & ML interests

LLM post-training

Recent Activity

upvoted a paper 8 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

upvoted a paper 14 days ago

On the Geometry of On-Policy Distillation

upvoted a paper 14 days ago

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 10 days ago • 61

upvoted 2 papers 14 days ago

On the Geometry of On-Policy Distillation

Paper • 2606.07082 • Published 21 days ago • 73

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Paper • 2606.13673 • Published 15 days ago • 106

upvoted a paper 21 days ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published 23 days ago • 46

upvoted a paper 23 days ago

Trust Region On-Policy Distillation

Paper • 2606.01249 • Published 26 days ago • 44

upvoted a paper 28 days ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published 29 days ago • 78

upvoted 2 papers 29 days ago

Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Paper • 2605.28775 • Published about 1 month ago • 38

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published about 1 month ago • 93

upvoted 8 papers about 1 month ago

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

Paper • 2605.17873 • Published May 18 • 12

Rebellious Student: Reversing Teacher Signals for Reasoning Exploration with Self-Distilled RLVR

Paper • 2605.10781 • Published May 11 • 17

upvoted a paper about 2 months ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published Apr 30 • 222

upvoted a paper 2 months ago

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

upvoted 2 papers 3 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 179

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Paper • 2603.22341 • Published Mar 21 • 37

Chanuk Lee

AI & ML interests

Recent Activity

Organizations

tally0818's activity