⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
580,138 results found
Trending
Model Name
Input
Output
Type
moonshotai
Kimi-K2.7-Code
Base
Deploy
MiniMaxAI
MiniMax-M3
nvidia
Orchestrator-8B
Fine-tuned
microsoft
FastContext-1.0-4B-SFT
lordx64
Qwable-v1
nex-agi
Nex-N2-Pro
empero-ai
Qwythos-9B-Claude-Mythos-5-1M
zai-org
GLM-5.2
google
gemma-4-12B-it
GLM-5.1
gemma-4-31B-it
GLM-5
datalab-to
lift
Qwen
Qwen3.6-35B-A3B
huihui-ai
Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated
Qwen3.6-27B
GLM-4.6
FastContext-1.0-4B-RL
meta-llama
Llama-3.1-8B-Instruct
black-forest-labs
FLUX.1-dev
OBLITERATUS
Gemma-4-12B-OBLITERATED
Quantized
Nex-N2-mini
mistralai
Magistral-Small-2506
sakamakismile
gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4
gemma-4-12B
skt
A.X-3.1
prefeitura-rio
Rio-3.5-Open-397B
TeichAI
Qwen3.6-27B-Fable-5-Experimental
gemma-4-E2B-it
Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Instruct-2507
gemma-4-E4B-it
gemma-4-26B-A4B-it
THUDM
GLM-4.1V-9B-Thinking
deepseek-ai
DeepSeek-R1
DJLougen
Qwable-5-27B-Coder
0xSero
MiniMax-M2.1-REAP-50-W4A16
Qwen3-0.6B
openai
whisper-large-v3
yuxinlu1
gemma-4-12B-coder-fable5-composer2.5-v1
NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4
Qwen3.5-4B