Sladwell (Slad)

upvoted 6 papers 3 months ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19, 2025 • 56

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published Sep 19, 2025 • 45

When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance

Paper • 2509.22193 • Published Sep 26, 2025 • 37

upvoted 4 articles 4 months ago

Article

Small Language Models (SLM): A Comprehensive Overview

Feb 22, 2025

•

115

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

88

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

267

Article

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

+9

Sep 16, 2025

•

47

upvoted 6 papers 4 months ago

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5, 2025 • 51

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published Aug 22, 2025 • 53

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Paper • 2509.05263 • Published Sep 5, 2025 • 10

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published Sep 5, 2025 • 46

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 129

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

Slad

AI & ML interests

Organizations

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance

PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

Qwen3-Omni Technical Report

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Small Language Models (SLM): A Comprehensive Overview

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Symbolic Graphics Programming with Large Language Models

R-Zero: Self-Evolving Reasoning LLM from Zero Data

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Slad

AI & ML interests

Organizations

Sladwell's activity

Small Language Models (SLM): A Comprehensive Overview

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`