StepFun

company

https://www.stepfun.com/

AI & ML interests

None defined yet.

Recent Activity

frankzeng authored a paper 2 days ago

StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians

frankzeng authored a paper 2 days ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

frankzeng authored a paper 2 days ago

FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding

View all activity

Papers

Step-Audio-R1.5 Technical Report

GEditBench v2: A Human-Aligned Benchmark for General Image Editing

View all Papers

authored 20 papers 2 days ago

StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians

Paper • 2504.15281 • Published Apr 21, 2025 • 23

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8, 2025 • 186

FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding

Paper • 2503.14935 • Published Mar 19, 2025

Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model

Paper • 2503.11251 • Published Mar 14, 2025 • 1

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

Paper • 2502.03207 • Published Feb 5, 2025 • 1

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14, 2025 • 57

MikuDance: Animating Character Art with Mixed Motion Dynamics

Paper • 2411.08656 • Published Nov 13, 2024

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Paper • 2411.02336 • Published Nov 4, 2024 • 24

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models

Paper • 2405.20853 • Published May 31, 2024 • 1

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Paper • 2306.17115 • Published Jun 29, 2023 • 12

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models

Paper • 2312.13913 • Published Dec 21, 2023 • 24

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24, 2025 • 92

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Paper • 2505.16707 • Published May 22, 2025 • 44

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Paper • 2505.24862 • Published May 30, 2025 • 30

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

Paper • 2506.03065 • Published Jun 3, 2025 • 27

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Paper • 2506.07977 • Published Jun 9, 2025 • 40

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning

Paper • 2508.06125 • Published Aug 8, 2025

LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence

Paper • 2509.12203 • Published Sep 15, 2025 • 20

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published Oct 16, 2025 • 86