AI Model Comparison

Highlights

QUALITY

Artificial Analysis Quality Index; Higher is better

SPEED

Output Tokens per Second; Higher is better

PRICE

USD per 1M Tokens; Lower is better

Summary

Quality:

o1-preview and o1-mini are the highest quality models, followed by Gemini 1.5 Pro (Sep '24) & GPT-4o.

Output Speed (tokens/s):

Gemma 7B (917 t/s) and Llama 3.2 1B (485 t/s) are the fastest models, followed by Gemini 1.5 Flash (May '24) & Llama 3.1 8B.

Latency (seconds):

Sonar Large (0.00s) and Sonar Small (0.00s) are the lowest latency models, followed by Reka Edge & Sonar 3.1 Small.

Price ($ per M tokens):

OpenChat 3.5 ($0.06) and Gemma 7B ($0.07) are the cheapest models, followed by Llama 3.2 3B & Llama 3.2 1B.

Context Window:

Gemini 1.5 Pro (Sep '24) (2m) and Gemini 1.5 Pro (May '24) (2m) are the largest context window models, followed by Gemini 1.5 Flash (Sep '24) & Gemini 1.5 Flash (May '24).

ModelQuality Context Length
Input Price
(per 1M tokens)
Output Price
(per 1M tokens)
o1-preview
openai logoOpenAI
8512800015.0060.00
o1-mini
openai logoOpenAI
821280003.0012.00
Gemini 1.5 Pro (Sep '24)
google logoGoogle
8020000001.255.00
GPT-4o
openai logoOpenAI
771280002.5010.00
GPT-4o (May '24)
openai logoOpenAI
771280005.0015.00
Claude 3.5 Sonnet
anthropic logoAnthropic
772000003.0015.00
Qwen2.5 72B
BABA_BIG logoBABA BIG
751310000.380.40
GPT-4 Turbo
openai logoOpenAI
7412800010.0030.00
Gemini 1.5 Flash (Sep '24)
google logoGoogle
7310000000.070.30
Mistral Large 2
mistral. logoMistral.
731280003.009.00
Llama 3.1 405B
meta logoMeta
721280004.009.00
GPT-4o mini
openai logoOpenAI
711280000.150.60
Claude 3 Opus
anthropic logoAnthropic
7020000015.0075.00
Qwen2 72B
BABA_BIG logoBABA BIG
691280000.630.65
DeepSeek-Coder-V2
deepseek logoDeepSeek
671280000.140.28
DeepSeek-V2.5
deepseek logoDeepSeek
661280001.071.14
Llama 3.2 90B (Vision)
meta logoMeta
661280000.900.90
DeepSeek-V2
deepseek logoDeepSeek
661280000.140.28
Llama 3.1 70B
meta logoMeta
651280000.880.90
Jamba 1.5 Large
ai21 logoAI21
642560002.008.00
Llama 3 70B
meta logoMeta
6280000.880.90
Sonar Large
perplexity logoPerplexity
62330001.001.00
Gemma 2 27B
google logoGoogle
6180000.800.80
Mixtral 8x22B
mistral. logoMistral.
61650001.201.20
Mistral Small (Sep '24)
mistral. logoMistral.
601280000.200.60
Yi-Large
01ai logo01ai
58320003.003.00
Claude 3 Sonnet
anthropic logoAnthropic
572000003.0015.00
Reka Core
reka logoReka
571280002.0010.00
Mistral Large
mistral. logoMistral.
56330004.0012.00
Pixtral 12B
mistral. logoMistral.
561280000.130.13
Command-R+
cohere logoCohere
561280002.7512.50
Claude 3 Haiku
anthropic logoAnthropic
542000000.251.25
Llama 3.2 11B (Vision)
meta logoMeta
531280000.190.19
Llama 3.1 8B
meta logoMeta
531280000.130.16
GPT-3.5 Turbo
openai logoOpenAI
52160000.501.50
Mistral NeMo
mistral. logoMistral.
521280000.150.15
Command-R
cohere logoCohere
511280000.331.05
Mistral Small (Feb '24)
mistral. logoMistral.
50330001.003.00
DBRX
databricks logoDatabricks
49330000.971.73
Llama 3 8B
meta logoMeta
4680000.140.20
Llama 3.2 3B
meta logoMeta
461280000.090.10
Gemma 2 9B
google logoGoogle
4680000.200.20
Command-R+ (Apr '24)
cohere logoCohere
461280003.0015.00
Jamba 1.5 Mini
ai21 logoAI21
462560000.200.40
Reka Flash
reka logoReka
461280000.802.00
OpenChat 3.5
openchat logoOpenchat
4380000.060.06
Mixtral 8x7B
mistral. logoMistral.
42330000.470.55
Sonar Small
perplexity logoPerplexity
41330000.200.20
Llama 2 Chat 70B
meta logoMeta
3940001.222.17
Llama 2 Chat 13B
meta logoMeta
3640000.300.30
Codestral-Mamba
mistral. logoMistral.
362560000.250.25
Command-R (Mar '24)
cohere logoCohere
361280000.501.50
Reka Edge
reka logoReka
30640000.401.00
Jamba Instruct
ai21 logoAI21
282560000.500.70
Llama 3.2 1B
meta logoMeta
281280000.070.09
Mistral 7B
mistral. logoMistral.
24330000.150.20
GPT-4
openai logoOpenAI
N/A800030.0060.00
Llama 2 Chat 7B
meta logoMeta
N/A40000.290.46
Codestral
mistral. logoMistral.
N/A330000.200.60
GPT-3.5 Turbo Instruct
openai logoOpenAI
N/A40001.502.00
Gemini 1.0 Pro
google logoGoogle
N/A330000.501.50
Mistral Medium
mistral. logoMistral.
N/A330002.758.10
Gemini 1.5 Flash (May '24)
google logoGoogle
N/A10000000.070.30
Gemini 1.5 Pro (May '24)
google logoGoogle
N/A20000003.5010.50
Sonar 3.1 Small
perplexity logoPerplexity
N/A1310000.200.20
Phi-3 Medium 14B
microsoft. logoMicrosoft.
N/A1280000.200.70
Sonar 3.1 Large
perplexity logoPerplexity
N/A1310001.001.00