K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts Paper • 2606.02404 • Published 3 days ago • 51
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents Paper • 2605.30621 • Published 7 days ago • 17
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation Paper • 2605.31264 • Published 6 days ago • 102
GrepSeek: Training Search Agents for Direct Corpus Interaction Paper • 2605.29307 • Published 7 days ago • 97
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 7 days ago • 75
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 7 days ago • 134
SkillGrad: Optimizing Agent Skills Like Gradient Descent Paper • 2605.27760 • Published 9 days ago • 27
Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems Paper • 2605.26302 • Published 10 days ago • 31
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 8 days ago • 419
Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini Paper • 2605.27295 • Published 9 days ago • 23
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation Paper • 2605.27366 • Published 9 days ago • 25
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence Paper • 2605.25979 • Published 10 days ago • 27
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness Paper • 2605.02396 • Published about 1 month ago • 24
MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing Paper • 2605.23986 • Published 19 days ago • 17
Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World Paper • 2605.26086 • Published 10 days ago • 23
Foundation Protocol: A Coordination Layer for Agentic Society Paper • 2605.23218 • Published 13 days ago • 79
SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research Paper • 2605.22878 • Published 15 days ago • 58
From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 13 days ago • 29
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 13 days ago • 220