10 17 1

Yonggan Fu PRO

YongganFu

https://www.yongganfu.com/

AI & ML interests

None yet

Recent Activity

upvoted a collection about 20 hours ago

Efficient-DLM

upvoted a paper 2 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

upvoted an article 6 days ago

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

View all activity

Organizations

upvoted a collection about 20 hours ago

Efficient-DLM

Collection

2 items • Updated about 9 hours ago • 4

upvoted a paper 2 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 3 days ago • 77

upvoted an article 6 days ago

Article

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

nvidia

•

7 days ago

• 27

upvoted a collection 10 days ago

Nemotron-Labs-Diffusion

Collection

Set of models of internal diffusion models • 7 items • Updated about 9 hours ago • 42

upvoted a paper 4 months ago

PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

Paper • 2601.10657 • Published Jan 15 • 20

upvoted a paper 5 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 232

upvoted 2 papers 6 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 128

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 36

upvoted an article 11 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 777

upvoted 2 papers about 1 year ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25, 2025 • 42

upvoted 2 papers over 1 year ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30, 2025 • 26

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 61

upvoted a collection over 1 year ago

Hymba

Collection

A series of Hybrid Small Language Models. • 3 items • Updated about 9 hours ago • 34

upvoted a paper over 1 year ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 49

Yonggan Fu PRO

AI & ML interests

Recent Activity

Organizations

YongganFu's activity

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

SmolLM3: smol, multilingual, long-context reasoner