nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 4 days ago • 649k • 189
Helios Collection Helios: 14B Real-Time Long Video Generation Model can be Cheaper, Faster but Keep Stronger than 1.3B ones • 7 items • Updated 8 days ago • 24
EricRollei/HunyuanImage-3.0-Instruct-INT8-v2 Text-to-Image • 83B • Updated 30 days ago • 265 • 1
view post Post 1268 From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured OutputI ran 6 experiments trying to use Anthropic's SAE steering for JSON generation.- Base model: 86.8% valid JSON- Steering only: 24.4%- Fine-tuned: 96.6%- FSM constrained: 100%Steering is for semantics, not syntax.https://huggingface.co/blog/MaziyarPanahi/sae-steering-json See translation 👀 2 2 🚀 1 1 🤯 1 1 + Reply