04-16 Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation
04-16 Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning