Composition-RL Collection Datasets and trained checkpoints of Composition-RL: https://github.com/XinXU-USTC/Composition-RL • 13 items • Updated about 22 hours ago • 1
Progressive Residual Warmup for Language Model Pretraining Paper • 2603.05369 • Published 15 days ago • 36
Progressive Residual Warmup for Language Model Pretraining Paper • 2603.05369 • Published 15 days ago • 36
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published Feb 12 • 93
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published Feb 12 • 93
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published Feb 12 • 93
Composition-RL Collection Datasets and trained checkpoints of Composition-RL: https://github.com/XinXU-USTC/Composition-RL • 13 items • Updated about 22 hours ago • 1