18 2

Xiao

Yang1213112131

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

upvoted a paper about 1 month ago

VideoWorld 2: Learning Transferable Knowledge from Real-world Videos

upvoted a paper about 2 months ago

Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning

View all activity

Organizations

None yet

upvoted a paper 23 days ago

Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

Paper • 2602.18422 • Published 26 days ago • 30

upvoted a paper about 1 month ago

VideoWorld 2: Learning Transferable Knowledge from Real-world Videos

Paper • 2602.10102 • Published Feb 10 • 14

upvoted a paper about 2 months ago

Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning

Paper • 2601.16163 • Published Jan 22 • 14

commented a paper 2 months ago

Inference-time Physics Alignment of Video Generative Models with Latent World Models

Paper • 2601.10553 • Published Jan 15 • 12 •

upvoted a paper 2 months ago

ThinkGen: Generalized Thinking for Visual Generation

Paper • 2512.23568 • Published Dec 29, 2025 • 1

upvoted 3 papers 3 months ago

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published Dec 16, 2025 • 72

SpatialTree: How Spatial Abilities Branch Out in MLLMs

Paper • 2512.20617 • Published Dec 23, 2025 • 43

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74

liked a Space 4 months ago

Paper2Poster

🚀

updated a dataset 5 months ago

Yang1213112131/PreFM

Viewer • Updated Oct 29, 2025 • 1.69M • 44 • 1

authored a paper 5 months ago

PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Paper • 2505.23155 • Published May 29, 2025 • 2

upvoted a paper 5 months ago

PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Paper • 2505.23155 • Published May 29, 2025 • 2

upvoted an article 5 months ago

Article

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI

Oct 28, 2025

•

liked a dataset 5 months ago

Yang1213112131/PreFM

Viewer • Updated Oct 29, 2025 • 1.69M • 44 • 1

published a dataset 5 months ago

Yang1213112131/PreFM

Viewer • Updated Oct 29, 2025 • 1.69M • 44 • 1

upvoted a paper 5 months ago

UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models

Paper • 2509.21760 • Published Sep 26, 2025 • 15

upvoted 4 papers 6 months ago

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11, 2025 • 80

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Paper • 2509.09174 • Published Sep 11, 2025 • 62

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 130

From Editor to Dense Geometry Estimator

Paper • 2509.04338 • Published Sep 4, 2025 • 96

Xiao

AI & ML interests

Recent Activity

Organizations

Yang1213112131's activity

Paper2Poster

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI