Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated about 14 hours ago • 219
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 105 items • Updated about 14 hours ago • 709
view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 6 days ago • 64
Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models Paper • 2606.11167 • Published 6 days ago • 3
Interactivity Alignment Collection Full-duplex speech models post-trained with reinforcement learning for improved conversational interactivity. • 4 items • Updated 6 days ago • 5
Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition Paper • 2312.17279 • Published Dec 27, 2023 • 4
view article Article How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent nvidia • 11 days ago • 57
CoreML Speech Models Collection Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. • 25 items • Updated 1 day ago • 4
MLX Speech Models Collection Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding. • 56 items • Updated 1 day ago • 5
Unified Panoramic Geometry Estimation via Multi-View Foundation Models Paper • 2605.26368 • Published 22 days ago • 4
CubePart: An Open-Vocabulary Part-Controllable 3D Generator Paper • 2605.28763 • Published 20 days ago • 14
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 20 days ago • 73
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 20 days ago • 423
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM Paper • 2312.06660 • Published Dec 11, 2023 • 2