04-16 Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation
04-16 Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
12-25 Read Like Humans: Autonomous, Bidirectional and Iterative LanguageModeling for Scene Text Recognition