Model in the plot:
Move Data | Provider | Model Name | Model ID | Input Price $/1KToken | Output Price $/1KToken | Total Price $/1KToken | Context Length |
---|
| OpenAI | GPT-4 | gpt-4-turbo-2024-04-09 | 0.03 | 0.06 | 0 | 8192 |
| OpenAI | GPT-3.5 Turbo | gpt-3.5-turbo-0125 | 0.0005 | 0.0015 | 0 | 16385 |
| Mistral | Mixtral 8x7B | open-mixtral-8x7b | 0.0007 | 0.0007 | 0 | 32000 |
| AnyScale | Llama-3-8b-chat-hf | llama-3-8b-chat-hf | 0 | 0 | 0.00015 | 8000 |
| AWS | Llama 2 Chat (13B) | llama-2-13b-chat-hf | 0.00075 | 0.001 | 0 | 4000 |
Drag and drop to add more data to the plot.
Move Data | Provider | Model Name | Model ID | Input Price $/1KToken | Output Price $/1KToken | Total Price $/1KToken | Context Length |
---|
| OpenAI | GPT-4 Turbo | gpt-4-turbo-2024-04-09 | 0.01 | 0.03 | 0 | 128000 |
| OpenAI | GPT-4 32K | gpt-4-32k | 0.06 | 0.12 | 0 | 32000 |
| OpenAI | GPT-3.5 Turbo Instruct | gpt-3.5-turbo-instruct | 0.0015 | 0.002 | 0 | 4096 |
| Mistral | Mistral 7B | open-mistral-7b | 0.00025 | 0.00025 | 0 | 32000 |
| Mistral | Mixtral 8x22B | open-mixtral-8x22b | 0.002 | 0.006 | 0 | 64000 |
| Mistral | Mistral Small | mistral-small | 0.002 | 0.006 | 0 | 32000 |
| Mistral | Mistral Large | mistral-large | 0.0027 | 0.0081 | 0 | 32000 |
| AnyScale | Mistral-7B-Instruct-v0.1 | mistral-7b-instruct-v0.1 | 0 | 0 | 0.00015 | 32000 |
| AnyScale | Llama-2-7b-chat-hf | llama-2-7b-chat-hf | 0 | 0 | 0.00015 | 4000 |
| AnyScale | Llama-2-13b-chat-hf | llama-2-13b-chat-hf | 0 | 0 | 0.00025 | 4000 |
| AnyScale | Mixtral-8x7B-Instruct-v0.1 | mixtral-8x7b-instruct-v0.1 | 0 | 0 | 0.0005 | 32000 |
| AnyScale | Mixtral-8x22B-Instruct-v0.1 | mixtral-8x22b-instruct-v0.1 | 0 | 0 | 0.0009 | 64000 |
| AnyScale | Llama-2-70b-chat-hf | llama-2-70b-chat-hf | 0 | 0 | 0.001 | 4000 |
| AnyScale | Llama-3-70b-chat-hf | llama-3-70b-chat-hf | 0 | 0 | 0.001 | 8000 |
| AWS | Llama 2 Chat (70B) | llama-2-70b-chat-hf | 0.00195 | 0.00256 | 0 | 4000 |
| AWS | Mistral 7B | open-mistral-7b | 0.00015 | 0.0002 | 0 | 32000 |
| AWS | Mixtral 8x7B | open-mixtral-8x7b | 0.00045 | 0.0007 | 0 | 32000 |
| AWS | Mixtral Large | mistral-large | 0.008 | 0.024 | 0 | 32000 |
| AWS | Claude Instant | claude-instant | 0.0008 | 0.0024 | 0 | 100000 |
| AWS | Claude 2.0/2.1 | claude-2.1 | 0.008 | 0.024 | 0 | 200000 |
| AWS | Claude 3 Opus | claude-3-opus | 0.015 | 0.075 | 0 | 200000 |
| AWS | Claude 3 Sonnet | claude-3-sonnet | 0.003 | 0.015 | 0 | 200000 |
| AWS | Claude 3 Haiku | claude-3-haiku | 0.00025 | 0.00125 | 0 | 200000 |
| Moonshot AI | Moonshot-v1 8K | moonshot-v1-8k | 0 | 0 | 0.08688 | 8000 |
| Moonshot AI | Moonshot-v1 32K | moonshot-v1-32k | 0 | 0 | 0.17376 | 32000 |
| Moonshot AI | Moonshot-v1 128K | moonshot-v1-128k | 0 | 0 | 0.4344 | 128000 |
| ZhiPu AI | GLM-4 | glm-4 | 0 | 0 | 0.7240000000000001 | 128000 |
| ZhiPu AI | GLM-4V | glm-4v | 0 | 0 | 0.7240000000000001 | 2000 |
| ZhiPu AI | GLM-3-turbo | glm-3-turbo | 0 | 0 | 0.0362 | 128000 |
| Baidu | ERNIE-4.0-8K | ernie-4.0-8k | 0.8688 | 0.8688 | 0 | 8000 |
| Baidu | ERNIE-3.5-8K | ernie-3.5-8k | 0.08688 | 0.08688 | 0 | 8000 |
| Baidu | ERNIE-Lite-8K | ernie-lite-8k | 0.02172 | 0.04344 | 0 | 8000 |
| AliCloud | qwen-turbo | qwen-turbo | 0 | 0 | 0.057920000000000006 | 8000 |
| AliCloud | qwen-plus | qwen-plus | 0 | 0 | 0.1448 | 8000 |
| AliCloud | qwen-max | qwen-max | 0 | 0 | 0.8688 | 8000 |
| DeepSeek | DeepSeek-V2 | deepseek-v2 | 0.00014 | 0.00028 | 0 | 128000 |