Sirui Zhang

zsr200901

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

liked a dataset 10 days ago

cais/mmlu

upvoted a paper 11 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted a paper 8 days ago

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

Paper • 2602.03510 • Published 10 days ago • 27

liked a dataset 10 days ago

cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 310k • 672

upvoted a paper 11 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 105

upvoted a paper 29 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 225

upvoted a paper about 1 month ago

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published Jan 9 • 84

liked a model about 1 month ago

lmsys/gpt-oss-20b-bf16

21B • Updated Aug 18, 2025 • 46.8k • 11

liked 2 datasets about 2 months ago

k-mktr/improved-flux-prompts-photoreal-portrait

Viewer • Updated Oct 3, 2024 • 20k • 414 • 116

conorcl/portraits-512

Viewer • Updated Dec 30, 2022 • 2.92k • 70 • 8

upvoted 2 papers 2 months ago

Distribution Matching Variational AutoEncoder

Paper • 2512.07778 • Published Dec 8, 2025 • 29

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Paper • 2512.04926 • Published Dec 4, 2025 • 42

upvoted 2 papers 3 months ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 69

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 216

upvoted 2 papers 4 months ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 92

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18, 2025 • 49

updated a collection 5 months ago

VLA

Collection

2 items • Updated Sep 15, 2025

upvoted a paper 5 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 110

upvoted 4 papers 6 months ago

The Promise of RL for Autoregressive Image Editing

Paper • 2508.01119 • Published Aug 1, 2025 • 11

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 238

EDGE-GRPO: Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity

Paper • 2507.21848 • Published Jul 29, 2025 • 9

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Paper • 2507.23785 • Published Jul 31, 2025 • 18

Sirui Zhang

AI & ML interests

Recent Activity

Organizations

zsr200901's activity