NEW
Articles from
Team
or
Enterprise organizations will get promoted to the main section.
Announcing Moonshine Voice
•
1
What superpower does Kimi-K2.5 bring to the table?
•
1
Forge: Scalable Agent RL Framework and Algorithm
•
101
Compute and Competition in AI: Different FlOPs for Different Folks
•
1
LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling
•
37
Microgpt
•
1
How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism
•
12
🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs
•
13
SeedVR2 and FlashVSR+ Studio Level Image and Video Upscaler Pro Released
Enabling Large Scale RLHF of GPTOSS with Megatron backend in VeRL
•
3
2026 Agentic Coding Trends - Implementation Guide (Technical)
•
1
Training Qwen3 VL to label bbox : synthetic data, environment and training analysis
•
5
The Death of the Generalist and Rise of the Swarm
•
1
Scaling Mixture of Experts: Architecture Search for Billion-Parameter Language Models
•
1
Memory vs Storage: Understanding Trade-offs in Cloud-Based Caching
Setting Up a Stable GPU Environment for PyTorch and TensorFlow
2. Attention Optimizations: From Standard Attention to FlashAttention
•
1
2.2c: FlashAttention — IO Analysis and Evolution
Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB Atlas Vector Search
•
3