nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 2 days ago • 139k • 300
Running Featured 69 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 69 Who needs 1T parameters? Olympiad proofs with a 4B model