LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️

Model Library

Browse canonical models across providers with performance and coverage highlights.

Visible models
42
Active models
304
Providers covered
296
Model variants
4113
Showing 1-24 of 42 models
42 modelsClear

GeminiGemini 2.5 Pro DeepSearch

Google Gemini 2.5 Pro DeepSearch is a search-augmented language model in the Gemini series, integrating web retrieval to provide up-to-date answers.

Input price

From $0.137/M

Avg speed

—

First token

—

Providers

8

GeminiGemini 3.0 Pro Image

Google Gemini 3.0 Pro Image is an image generation model, capable of producing images from text prompts.

Input price

From $0.100/M

Avg speed

—

First token

—

Providers

5

GeminiGemini 1.5 Pro 002

Google Gemini 1.5 Pro 002 is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.548/M

Avg speed

—

First token

—

Providers

8

GeminiGemini 2.5 Flash DeepSearch

Google Gemini 2.5 Flash DeepSearch is a search-augmented language model in the Gemini series, integrating web retrieval to provide up-to-date answers.

Input price

From $0.068/M

Avg speed

—

First token

—

Providers

7

GeminiGemini 3.0 Pro

Google Gemini 3.0 Pro is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $29.20/M

Avg speed

—

First token

—

Providers

7

GeminiGemini Robotics ER 1.5

Google Gemini Robotics ER 1.5 is a robotics-focused multimodal model in the Gemini series, designed for embodied reasoning and physical task planning.

Input price

From $0.0001/M

Avg speed

—

First token

—

Providers

20

GeminiGemini 3 Pro DeepSearch

Google Gemini 3 Pro DeepSearch is a search-augmented language model in the Gemini series, integrating web retrieval to provide up-to-date answers.

Input price

From $0.205/M

Avg speed

—

First token

—

Providers

7

GeminiGemini 2.5 Flash Live

Google Gemini 2.5 Flash Live is a realtime audio model in the Gemini series, supporting low-latency speech and conversational interactions.

Input price

From $0.100/M

Avg speed

—

First token

—

Providers

4

GeminiGemini 2.5 Pro 1m

Google Gemini 2.5 Pro 1m is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $75.00/M

Avg speed

—

First token

—

Providers

5

GeminiGemini 3.1 Flash

Google Gemini 3.1 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.

Input price

From $0.021/M

Avg speed

—

First token

—

Providers

17

GeminiGemini Embedding

Google Gemini Embedding is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price

From $0.490/M

Avg speed

—

First token

—

Providers

15

GeminiGemini 2.0 Flash Live 001

Google Gemini 2.0 Flash Live 001 is a realtime audio model in the Gemini series, supporting low-latency speech and conversational interactions.

Input price

From $0.200/M

Avg speed

—

First token

—

Providers

5

GeminiGemini Pro Vision

Google Gemini Pro Vision is a multimodal vision-language model in the Gemini series, supporting both text and image understanding.

Input price

From $0.548/M

Avg speed

—

First token

—

Providers

5

GeminiGemini Live 2.5 Flash

Google Gemini Live 2.5 Flash is a realtime audio model in the Gemini series, supporting low-latency speech and conversational interactions.

Input price

From $1.47/M

Avg speed

—

First token

—

Providers

4

GeminiGemini Imagen

Google Gemini Imagen is an image generation model, capable of producing images from text prompts.

Input price

From $0.014/M

Avg speed

—

First token

—

Providers

3

GeminiGemini 3.1 Pro

Google Gemini 3.1 Pro is a Gemini 3 series model with advanced multimodal reasoning, long-context support, and strong performance on coding and analytical tasks.

Input price+2 free

From $0.0001/M

Avg speed

93 t/s

First token

15.42s

Providers

203

GeminiGemini Embedding 2

Google Gemini Embedding 2 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price+1 free

From $0.0082/M

Avg speed

—

First token

—

Providers

36

GeminiGemini 1.0 Pro Vision

Google Gemini 1.0 Pro Vision is a multimodal vision-language model in the Gemini series, supporting both text and image understanding.

Input price

From $0.049/M

Avg speed

—

First token

—

Providers

3

GeminiGemini 1.5 Flash 002

Google Gemini 1.5 Flash 002 is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.

Input price

From $0.150/M

Avg speed

155 t/s

First token

1.62s

Providers

5

GeminiGemini 3.0 Flash

Google Gemini 3.0 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.

Input price

From $0.274/M

Avg speed

28 t/s

First token

20.33s

Providers

10

GeminiGemini 1.5 Flash

Google Gemini 1.5 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.

Input price

From $0.0005/M

Avg speed

200 t/s

First token

1.17s

Providers

24

GeminiGemini Pro

Google Gemini Pro is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.014/M

Avg speed

—

First token

—

Providers

33

GeminiGemini 2.0 Pro

Google Gemini 2.0 Pro is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.014/M

Avg speed

67 t/s

First token

8.47s

Providers

19

GeminiGemini 2.0 Flash Lite

Google Gemini 2.0 Flash Lite is a lightweight and cost-efficient language model in the Gemini series, optimized for fast responses at reduced cost.

Input price

From $0.0090/M

Avg speed

182 t/s

First token

1.49s

Providers

45

  • 1
  • 2
+11 more
+9 more
+5 more
+145 more
May 24
+22 more
Feb 9
+1 more
Feb 16
+14 more
Feb 20
+23 more
+9 more
May 4
+27 more
Jan 1