16 18 16

Alex Jinpeng Wang

Awiny

https://fingerrec.github.io

FingerRec

AI & ML interests

Multi-Modality Pre-training, Data-Centric AI, Video Self-supervised Learning

Recent Activity

liked a model about 1 month ago

CSU-JPG/Glance

upvoted a paper about 1 month ago

Glance: Accelerating Diffusion Models with 1 Sample

upvoted a paper about 2 months ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

View all activity

Organizations

liked a model about 1 month ago

CSU-JPG/Glance

Text-to-Image • Updated 18 days ago • 401 • • 14

upvoted a paper about 1 month ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published about 1 month ago • 28

upvoted a paper about 2 months ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 44

liked a Space about 2 months ago

VCode

🐨

Convert images to SVG code

updated a Space about 2 months ago

README

📈

liked a dataset about 2 months ago

CSU-JPG/Chart2Code

Updated 20 days ago • 288 • 4

updated a collection about 2 months ago

🔱 Sailor2 Language Models

Collection

Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated Nov 19, 2025 • 30

upvoted 2 papers about 2 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 101

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback

Paper • 2511.01678 • Published Nov 3, 2025 • 35

upvoted a paper 2 months ago

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Paper • 2510.17932 • Published Oct 20, 2025 • 7

New activity in deepseek-ai/DeepSeek-OCR 2 months ago

Clarifying Prior Research on Visual Compression of Textual Contexts

❤️ 👍 14

#18 opened 2 months ago by

Awiny

upvoted a paper 3 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 118

liked a dataset 6 months ago

CSU-JPG/MVPBench

Viewer • Updated May 15, 2025 • 4.7k • 36 • 1

liked a model 6 months ago

showlab/show-o2-1.5B-HQ

Any-to-Any • Updated Sep 5, 2025 • 69 • 3

authored a paper 9 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8, 2025 • 13

upvoted a paper 9 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8, 2025 • 13

commented a paper 9 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8, 2025 • 13 •

authored a paper 9 months ago

Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Paper • 2503.20198 • Published Mar 26, 2025 • 4

upvoted a paper 9 months ago

Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Paper • 2503.20198 • Published Mar 26, 2025 • 4

commented a paper 9 months ago

Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Paper • 2503.20198 • Published Mar 26, 2025 • 4 •

Alex Jinpeng Wang

AI & ML interests

Recent Activity

Organizations

Awiny's activity

VCode

README

Clarifying Prior Research on Visual Compression of Textual Contexts