nw's picture

nw

NightwingNg

·

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

unsloth/Qwen3.5-35B-A3B-GGUF

liked a model 14 days ago

unsloth/Qwen3.5-397B-A17B-GGUF

liked a model 14 days ago

Qwen/Qwen3.5-397B-A17B

View all activity

Organizations

None yet

upvoted an article 6 months ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Jul 27, 2024

•

35

upvoted a paper 7 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 318

upvoted a collection 8 months ago

Seed-X

A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated Aug 22, 2025 • 67

upvoted an article 8 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18, 2025

•

50

upvoted a paper 9 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

upvoted 3 collections 9 months ago

MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 17 days ago • 118

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 192

GRMR V3 Models

An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). • 6 items • Updated Jun 4, 2025 • 10

upvoted a paper 9 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

upvoted a collection 9 months ago

RpR Models

RpR (RolePlay with Reasoning) models which are built on RPMax datasets with properly trained multi-turn reasoning. • 8 items • Updated Jun 25, 2025 • 18

upvoted 2 collections 10 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 70 items • Updated about 9 hours ago • 261

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 79 items • Updated about 6 hours ago • 410

upvoted 2 collections 11 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.69k

GLM-4-0414

GLM-4-0414 series model • 6 items • Updated about 9 hours ago • 134

upvoted a paper 11 months ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

upvoted a collection about 1 year ago

Deepseek Papers

Deepseek papers collection • 31 items • Updated about 16 hours ago • 328