Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

93

Full-text search

Active filters: modelopt

DataSnake/Wayfarer-2-12B-NVFP4

Text Generation • 7B • Updated 10 days ago • 69

wangqia0309/Captain-Eris_Violet-V0.420-12B-FP8-KV-modelopt

12B • Updated Nov 29, 2025 • 10

rahtml/Qwen3-Coder-30B-A3B-Instruct-NVFP4

16B • Updated 26 days ago • 118

nvidia/Kimi-K2-Thinking-NVFP4

Text Generation • Updated 23 days ago • 10k • 15

eousphoros/DeepSeek-V3.2-NVFP4

Text Generation • 387B • Updated Dec 3, 2025 • 348 • 5

zhuyksir/qwen3_30b_a3b_nvfp4_baseline

16B • Updated about 1 month ago • 3

zhuyksir/qwen3_30b_a3b_nvfp4_qat

16B • Updated 25 days ago • 25

alphatozeta/sglang_glm_4_6_fp4_modelopt

177B • Updated 30 days ago • 307

ericlewis/Nemotron-Orchestrator-8B-NVFP4

Text Generation • 5B • Updated 26 days ago • 85

trithemius/Velvet-14B-nvfp4

8B • Updated 24 days ago • 17

OPENZEKA/Qwen3-4B-Instruct-2507-NVFP4

2B • Updated 10 days ago • 75

Z841973620/Qwen3-30B-A3B-NVFP4

Text Generation • 16B • Updated 20 days ago • 7

Z841973620/Qwen3-30B-A3B-FP8

Text Generation • 31B • Updated 20 days ago • 5

OPENZEKA/Qwen3-Coder-30B-A3B-Instruct-NVFP4

Text Generation • 16B • Updated 10 days ago • 52

josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4

Text Generation • 23B • Updated 19 days ago • 4

taharmasmaliyev07/Llama-2-7b-hf-fp8

7B • Updated 18 days ago • 49

OPENZEKA/Qwen3-Coder-480B-A35B-Instruct-NVFP4

241B • Updated 10 days ago • 130

Shifusen/Llama-3.3-70B-Instruct-abliterated-NVFP4-modelopt

36B • Updated 17 days ago • 29

taharmasmaliyev07/Mistral-7B-v0.1-fp8

7B • Updated 17 days ago • 8

taharmasmaliyev07/Llama-3.1-8B-fp8

8B • Updated 17 days ago • 10

taharmasmaliyev07/gemma-2-9b-it-fp8

9B • Updated 17 days ago • 7

cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16

Image-Text-to-Text • 2B • Updated 17 days ago • 128 • 1

cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16

Image-Text-to-Text • 3B • Updated 17 days ago • 6

cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16

Image-Text-to-Text • 5B • Updated 17 days ago • 29 • 1

CedricHwang/qwen2.5-0.5b-modelopt-fp8-pc-pt

Text Generation • 0.5B • Updated 16 days ago • 46

CedricHwang/qwen2.5-0.5b-modelopt-fp8-pb-wo

0.5B • Updated 16 days ago • 45

stepnoy/gpt-oss-120b-NVFP4

117B • Updated 15 days ago • 12

baseten-admin/glm-4.7-fp4

183B • Updated 11 days ago • 185

ericlewis/functiongemma-270m-it-nvfp4

0.2B • Updated 12 days ago • 9

cybermotaz/Qwen3-VL-32B-Instruct-NVFP4

Image-Text-to-Text • 18B • Updated 11 days ago • 40