⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,138 Open Models on the Frontier Inference Cloud.

Featured models

All models

580,138 results found

Model Name

Input

Output

Type

moonshotai

Kimi-K2.7-Code

Base

Deploy

MiniMaxAI

MiniMax-M3

Base

Deploy

nvidia

Orchestrator-8B

Fine-tuned

Deploy

microsoft

FastContext-1.0-4B-SFT

Fine-tuned

Deploy

lordx64

Qwable-v1

Fine-tuned

Deploy

nex-agi

Nex-N2-Pro

Base

Deploy

empero-ai

Qwythos-9B-Claude-Mythos-5-1M

Fine-tuned

Deploy

zai-org

GLM-5.2

Base

Deploy

google

gemma-4-12B-it

Fine-tuned

Deploy

zai-org

GLM-5.1

Base

Deploy

google

gemma-4-31B-it

Fine-tuned

Deploy

zai-org

GLM-5

Base

Deploy

datalab-to

lift

Base

Deploy

Qwen

Qwen3.6-35B-A3B

Base

Deploy

huihui-ai

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Fine-tuned

Deploy

Qwen

Qwen3.6-27B

Base

Deploy

zai-org

GLM-4.6

Base

Deploy

microsoft

FastContext-1.0-4B-RL

Fine-tuned

Deploy

meta-llama

Llama-3.1-8B-Instruct

Fine-tuned

Deploy

black-forest-labs

FLUX.1-dev

Base

Deploy

OBLITERATUS

Gemma-4-12B-OBLITERATED

Quantized

Deploy

nex-agi

Nex-N2-mini

Base

Deploy

mistralai

Magistral-Small-2506

Fine-tuned

Deploy

sakamakismile

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

google

gemma-4-12B

Base

Deploy

skt

A.X-3.1

Base

Deploy

prefeitura-rio

Rio-3.5-Open-397B

Fine-tuned

Deploy

TeichAI

Qwen3.6-27B-Fable-5-Experimental

Fine-tuned

Deploy

google

gemma-4-E2B-it

Fine-tuned

Deploy

Qwen

Qwen3-235B-A22B-Thinking-2507

Base

Deploy

Qwen

Qwen3-235B-A22B-Instruct-2507

Base

Deploy

google

gemma-4-E4B-it

Fine-tuned

Deploy

google

gemma-4-26B-A4B-it

Fine-tuned

Deploy

THUDM

GLM-4.1V-9B-Thinking

Fine-tuned

Deploy

deepseek-ai

DeepSeek-R1

Base

Deploy

DJLougen

Qwable-5-27B-Coder

Fine-tuned

Deploy

0xSero

MiniMax-M2.1-REAP-50-W4A16

Base

Deploy

Qwen

Qwen3-0.6B

Fine-tuned

Deploy

openai

whisper-large-v3

Base

Deploy

yuxinlu1

gemma-4-12B-coder-fable5-composer2.5-v1

Fine-tuned

Deploy

nvidia

NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Base

Deploy

Qwen

Qwen3.5-4B

Fine-tuned

Deploy

Load more models