Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

41,089

Full-text search

Active filters: 4-bit

0xSero/GLM-4.7-REAP-50-W4A16

Text Generation • 2B • Updated 6 days ago • 1.84k • 50

0xSero/MiniMax-M2.1-REAP-50-W4A16-REPAIR-IN-PROGRESS

Text Generation • 17B • Updated 7 days ago • 1.96k • 27

unsloth/gemma-3-12b-it-bnb-4bit

Any-to-Any • 13B • Updated May 12, 2025 • 6.92k • 23

mlx-community/GLM-4.7-REAP-50-mxfp4

Text Generation • 185B • Updated 9 days ago • 1.47k • 22

mlx-community/IQuest-Coder-V1-40B-Loop-Instruct-4bit

Text Generation • 40B • Updated 4 days ago • 1.38k • 10

QuantTrio/GLM-4.7-AWQ

Text Generation • 358B • Updated 14 days ago • 20.1k • 18

Intel/GLM-4.7-int4-mixed-AutoRound

Text Generation • 2B • Updated 12 days ago • 193 • 24

LiquidAI/LFM2.5-1.2B-Instruct-MLX-4bit

Text Generation • 0.2B • Updated 5 days ago • 220 • 5

LiquidAI/LFM2.5-1.2B-JP-MLX-4bit

Text Generation • 0.2B • Updated 5 days ago • 104 • 4

Disty0/LTX-2-SDNQ-4bit-dynamic

Updated 3 days ago • 92 • 4

Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32

Text-to-Image • Updated Dec 3, 2025 • 56.7k • 52

mlx-community/Youtu-LLM-2B-mlx-4bit

Text Generation • 0.3B • Updated 9 days ago • 111 • 3

mlx-community/Falcon-H1R-7B-4bit

Text Generation • 1B • Updated 7 days ago • 190 • 3

MaziyarPanahi/gemma-7b-GGUF

Text Generation • 9B • Updated Feb 29, 2024 • 1.33k • 15

MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF

Text Generation • 7B • Updated May 22, 2024 • 137k • 131

hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4

Text Generation • 8B • Updated Aug 7, 2024 • 143k • 84

MaziyarPanahi/gemma-3-4b-it-GGUF

Text Generation • 4B • Updated Mar 12, 2025 • 166k • 16

gaunernst/gemma-3-4b-it-int4-awq

Image-Text-to-Text • Updated Apr 6, 2025 • 40.3k • 5

unsloth/gemma-3-12b-it-qat-bnb-4bit

Image-Text-to-Text • 13B • Updated May 12, 2025 • 773 • 4

unsloth/Qwen3-14B-unsloth-bnb-4bit

Text Generation • 15B • Updated May 13, 2025 • 69k • 14

unsloth/Qwen3-8B-unsloth-bnb-4bit

8B • Updated May 13, 2025 • 169k • 14

unsloth/Qwen3-VL-8B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • 9B • Updated Oct 31, 2025 • 52.3k • 15

MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF

Text Generation • 8B • Updated Dec 6, 2025 • 58.3k • 4

uqer1244/MLX-z-image

Text-to-Image • Updated 18 days ago • 4

mbakgun/Qwen2.5-Coder-14B-n8n-Workflow-Generator

Text Generation • 15B • Updated 14 days ago • 941 • 4

QuantTrio/MiniMax-M2.1-AWQ

Text Generation • 229B • Updated 13 days ago • 6.45k • 8

tencent/HY-MT1.5-1.8B-GPTQ-Int4

Translation • 2B • Updated 11 days ago • 820 • 11

tencent/HY-MT1.5-7B-GPTQ-Int4

Translation • 8B • Updated 11 days ago • 583 • 7

mlx-community/Youtu-LLM-2B-4bit

Text Generation • 0.3B • Updated 11 days ago • 200 • 3

mlx-community/IQuest-Coder-V1-40B-Instruct-4bit

Text Generation • 40B • Updated 11 days ago • 925 • 2