LARK Lab@HKUST (GZ)

non-profit

https://lark-lab-hkustgz.github.io/

AI & ML interests

Large Language Models

Recent Activity

FYYDCC updated a collection 10 days ago

Trainee2Trainer

FYYDCC updated a dataset 10 days ago

LARK-Lab/MAPF-FrozenLake-Benchmark

FYYDCC updated a model 10 days ago

LARK-Lab/Trainee2Trainer

View all activity

Papers

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

View all Papers

updated a collection 10 days ago

Trainee2Trainer

This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning • 3 items • Updated 10 days ago

updated a dataset 10 days ago

LARK-Lab/MAPF-FrozenLake-Benchmark

Viewer • Updated 10 days ago • 3.15k • 72 • 1

updated a model 10 days ago

LARK-Lab/Trainee2Trainer

Text Generation • 4B • Updated 10 days ago • 16 • 1

updated a collection 10 days ago

Trainee2Trainer

This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning • 3 items • Updated 10 days ago

in LARK-Lab/MAPF-FrozenLake-Benchmark 10 days ago

Rename configs from wr* to benchmark_wr* to match codebase convention

#2 opened 10 days ago by

Initial upload: MAPF-FrozenLake eval benchmark

#1 opened 10 days ago by

published a dataset 10 days ago

LARK-Lab/MAPF-FrozenLake-Benchmark

Viewer • Updated 10 days ago • 3.15k • 72 • 1

updated a collection 10 days ago

Trainee2Trainer

This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning • 3 items • Updated 10 days ago

submitted a paper to Daily Papers 10 days ago

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Paper • 2606.17682 • Published 12 days ago • 26

published a model 10 days ago

LARK-Lab/Trainee2Trainer

Text Generation • 4B • Updated 10 days ago • 16 • 1

updated a dataset 11 days ago

LARK-Lab/EnvFactory-SFT-DeepSeekV4Flash-OpenAI

Viewer • Updated 11 days ago • 3.27k • 53

published a dataset 16 days ago

LARK-Lab/EnvFactory-SFT-DeepSeekV4Flash-OpenAI

Viewer • Updated 11 days ago • 3.27k • 53

updated a dataset 16 days ago

LARK-Lab/EnvFactory-SFT-DeepSeekV4Flash

Updated 16 days ago • 39

published a dataset 16 days ago

LARK-Lab/EnvFactory-SFT-DeepSeekV4Flash

Updated 16 days ago • 39

published a dataset 16 days ago

LARK-Lab/SWITCH-Math-Train

Viewer • Updated 16 days ago • 45.8k • 53

published a model 16 days ago

LARK-Lab/SWITCH-Phase3-GRPO-LoRA-Qwen3-8B

Text Generation • Updated 16 days ago • 16

updated a dataset 16 days ago

LARK-Lab/SWITCH-Math-Train

Viewer • Updated 16 days ago • 45.8k • 53

updated a model 16 days ago

LARK-Lab/SWITCH-Phase3-GRPO-LoRA-Qwen3-8B

Text Generation • Updated 16 days ago • 16

submitted a paper to Daily Papers 18 days ago

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

Paper • 2606.11052 • Published 19 days ago • 16

authored a paper about 1 month ago

FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation

Paper • 2408.12168 • Published Aug 22, 2024