Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

upvoted a paper 4 days ago

FASA: Frequency-aware Sparse Attention

upvoted a paper 6 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

liked a model 8 days ago

allenai/OLMoE-1B-7B-0125

View all activity

Organizations

upvoted a paper 4 days ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published 20 days ago • 148

upvoted a paper 6 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 14 days ago • 263

upvoted a paper 11 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 25 days ago • 27

upvoted 2 papers 18 days ago

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Paper • 2602.03442 • Published 20 days ago • 19

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published 24 days ago • 195

upvoted 3 papers 19 days ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 196

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 78

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 105

upvoted a paper 21 days ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published 24 days ago • 38

upvoted a paper 22 days ago

Self-Improving Pretraining: using post-trained models to pretrain better models

Paper • 2601.21343 • Published 25 days ago • 17

upvoted a paper 24 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 197

upvoted a paper 25 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 109

upvoted 3 papers 26 days ago

upvoted 5 papers about 1 month ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 47

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 154

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 148

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 309

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published Jan 13 • 39

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity