Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2601.20834

about 1 hour ago

Endless Terminals: Scaling RL Environments for Terminal Agents

Paper • 2601.16443 • Published 16 days ago • 16
Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 10 days ago • 21
Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 10 days ago • 97
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 12 days ago • 40

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 15 days ago • 175
DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published 10 days ago • 54
Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 10 days ago • 21
BMAM: Brain-inspired Multi-Agent Memory Framework

Paper • 2601.20465 • Published 11 days ago • 4

Interpretability

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 119
Truth Neurons

Paper • 2505.12182 • Published May 18, 2025 • 8
Resa: Transparent Reasoning Models via SAEs

Paper • 2506.09967 • Published Jun 11, 2025 • 21
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

Paper • 2510.00184 • Published Sep 30, 2025 • 17

Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 10 days ago • 21
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Paper • 2602.02619 • Published 5 days ago • 47
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published 3 days ago • 44

about 10 hours ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 75
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 107
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 506
Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6, 2025 • 31

Representation & Optimization

Understanding about representation sheds light on optimization

Nuclear Norm Regularization for Deep Learning

Paper • 2405.14544 • Published May 23, 2024 • 1
Token embeddings violate the manifold hypothesis

Paper • 2504.01002 • Published Apr 1, 2025 • 1
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Paper • 2403.10476 • Published Mar 15, 2024 • 1
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning

Paper • 2504.00254 • Published Mar 31, 2025 • 1

about 1 hour ago

Endless Terminals: Scaling RL Environments for Terminal Agents

Paper • 2601.16443 • Published 16 days ago • 16
Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 10 days ago • 21
Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 10 days ago • 97
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 12 days ago • 40

Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 10 days ago • 21
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Paper • 2602.02619 • Published 5 days ago • 47
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published 3 days ago • 44

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 15 days ago • 175
DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published 10 days ago • 54
Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 10 days ago • 21
BMAM: Brain-inspired Multi-Agent Memory Framework

Paper • 2601.20465 • Published 11 days ago • 4

about 10 hours ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 75
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 107
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 506
Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6, 2025 • 31

Interpretability

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 119
Truth Neurons

Paper • 2505.12182 • Published May 18, 2025 • 8
Resa: Transparent Reasoning Models via SAEs

Paper • 2506.09967 • Published Jun 11, 2025 • 21
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

Paper • 2510.00184 • Published Sep 30, 2025 • 17

Representation & Optimization

Understanding about representation sheds light on optimization

Nuclear Norm Regularization for Deep Learning

Paper • 2405.14544 • Published May 23, 2024 • 1
Token embeddings violate the manifold hypothesis

Paper • 2504.01002 • Published Apr 1, 2025 • 1
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Paper • 2403.10476 • Published Mar 15, 2024 • 1
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning

Paper • 2504.00254 • Published Mar 31, 2025 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs