NERV

与其感慨路难行，不如马上出发

0%

Um..! 30 posts in total. Keep on posting.

2026

04-16

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation

04-16

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

04-01

Attention Residuals

2023

10-27

Nougat: Neural Optical Understanding for Academic DocumentsA Survey of Large Language Models

06-05

A Survey of Large Language Models

2022

12-25

Read Like Humans: Autonomous, Bidirectional and Iterative LanguageModeling for Scene Text Recognition

12-18

Scaling Language-Image Pre-training via Masking

12-11

Few Could Be Better Than All:Feature Sampling and Grouping for Scene Text Detection

12-04

论文笔记 - DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting

11-27

论文笔记 - Pure Transformer with Integrated Experts forScene Text Recognition