merve's picture

Building on HF

merve PRO

merve

huggingface

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

I love this website VLMs, vision & co

Recent Activity

new activity 2 days ago

mistral-hackaton-2026/README:Jobs/TRL Community Feedback

upvoted an article 4 days ago

Mixture of Experts (MoEs) in Transformers

published an article 4 days ago

Mixture of Experts (MoEs) in Transformers

View all activity

Organizations

upvoted an article 4 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

4 days ago

•

99

upvoted an article 7 days ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

12 days ago

•

16

upvoted a collection 13 days ago

Tiny Aya

Bridging Scale and Multilingual Depth • 10 items • Updated 13 days ago • 62

upvoted an article 22 days ago

Article

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

22 days ago

•

21

upvoted an article 24 days ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

+5

26 days ago

•

82

upvoted a changelog 24 days ago

Changelog

Community Evals and Benchmark Repositories

25 days ago

• 65

upvoted 2 articles 24 days ago

Article

🚀 SyGra V2.0.0

25 days ago

•

8

Article

Introducing SyGra Studio

25 days ago

•

25

upvoted 3 articles 26 days ago

Article

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

26 days ago

•

28

Article

Training Design for Text-to-Image Models: Lessons from Ablations

27 days ago

•

65

Article

H Company's new Holo2 model takes the lead in UI Localization

27 days ago

•

5

upvoted a paper 28 days ago

C-RADIOv4 (Tech Report)

Paper • 2601.17237 • Published Jan 24 • 10

upvoted a collection 28 days ago

Open Coding Agents

12 items • Updated 20 days ago • 49

upvoted an article about 1 month ago

Article

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

Nov 5, 2025

•

12

upvoted a collection about 1 month ago

Nemotron ColEmbed V2

State-of-the-Art Late Interaction Vision-Language Embedding Models • 3 items • Updated 6 days ago • 10

upvoted 3 articles about 1 month ago

Article

Security, Governance and Performance for Dell On-Prem AI Builders

Jan 21

•

7

Article

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

Jan 21

•

31

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

Jan 19

•

85

upvoted 2 collections about 1 month ago

LightOnOCR-2 🦉

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 11 days ago • 22

Kanana-2

Open Source Kanana-2 • 30 items • Updated Jan 27 • 36