DavidDeng's picture

5 4

DavidDeng

ZiHDeng

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Paper • 2601.15197 • Published Jan 21 • 55

upvoted an article 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

616

upvoted a paper 3 months ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

Paper • 2512.16793 • Published Dec 18, 2025 • 76

upvoted an article 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

282

upvoted a paper about 1 year ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125