RLMs (Reasoning Language Models)
updated
LADDER: Self-Improving LLMs Through Recursive Problem Decomposition
Paper
•
2503.00735
•
Published
•
23
START: Self-taught Reasoner with Tools
Paper
•
2503.04625
•
Published
•
113
R1-Searcher: Incentivizing the Search Capability in LLMs via
Reinforcement Learning
Paper
•
2503.05592
•
Published
•
27
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with
Reinforcing Learning
Paper
•
2503.05379
•
Published
•
38
21B
•
Updated
•
17
•
388
Viewer
•
Updated
•
269
•
1.04k
•
47
Text Generation
•
33B
•
Updated
•
52.8k
•
•
2.89k
Text Generation
•
8B
•
Updated
•
368
•
•
182
Text Generation
•
33B
•
Updated
•
11
•
•
154
Reinforcement Learning for Reasoning in Small LLMs: What Works and What
Doesn't
Paper
•
2503.16219
•
Published
•
52
predibase/Predibase-T2T-32B-RFT
33B
•
Updated
•
3
•
20
agentica-org/DeepCoder-1.5B-Preview
Text Generation
•
2B
•
Updated
•
80
•
74
agentica-org/DeepCoder-14B-Preview
Text Generation
•
15B
•
Updated
•
296
•
•
680
Feature Extraction
•
8B
•
Updated
•
989
•
53
deepseek-ai/DeepSeek-R1-0528
Text Generation
•
685B
•
Updated
•
505k
•
•
2.4k
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B
Text Generation
•
2B
•
Updated
•
1.1k
•
236
Video-Text-to-Text
•
9B
•
Updated
•
75
•
23
mistralai/Magistral-Small-2506
24B
•
Updated
•
28.7k
•
609
microsoft/Phi-4-mini-reasoning
Text Generation
•
4B
•
Updated
•
12.4k
•
215
microsoft/Phi-4-mini-flash-reasoning
Text Generation
•
4B
•
Updated
•
27.7k
•
269
microsoft/Phi-4-reasoning
Text Generation
•
15B
•
Updated
•
6.78k
•
215
osmosis-ai/Osmosis-Apply-1.7B
Text Generation
•
2B
•
Updated
•
28
•
91
33B
•
Updated
•
16
•
192
numind/NuMarkdown-8B-Thinking
Image-to-Text
•
8B
•
Updated
•
1.14M
•
436
moonshotai/Kimi-K2-Thinking
Text Generation
•
170B
•
Updated
•
371k
•
•
1.66k
Text Generation
•
2B
•
Updated
•
2k
•
512
MaziyarPanahi/VibeThinker-1.5B-GGUF
Text Generation
•
2B
•
Updated
•
522
•
35
ServiceNow-AI/Apriel-1.5-15b-Thinker
Image-Text-to-Text
•
15B
•
Updated
•
468
•
464
Image-Text-to-Text
•
10B
•
Updated
•
36.6k
•
•
573