Skip to content

Latest commit

 

History

History
830 lines (775 loc) · 129 KB

README.md

File metadata and controls

830 lines (775 loc) · 129 KB

Tokencost

Clientside token counting + price estimation for LLM apps and AI agents.

Python Version

🐦 Twitter   •   📢 Discord   •   🖇️ AgentOps

TokenCost

License: MIT PyPI - Version X (formerly Twitter) Follow

Tokencost helps calculate the USD cost of using major Large Language Model (LLMs) APIs by calculating the estimated cost of prompts and completions.

Building AI agents? Check out AgentOps

Features

  • LLM Price Tracking Major LLM providers frequently add new models and update pricing. This repo helps track the latest price changes
  • Token counting Accurately count prompt tokens before sending OpenAI requests
  • Easy integration Get the cost of a prompt or completion with a single function

Example usage:

from tokencost import calculate_prompt_cost, calculate_completion_cost

model = "gpt-3.5-turbo"
prompt = [{ "role": "user", "content": "Hello world"}]
completion = "How may I assist you today?"

prompt_cost = calculate_prompt_cost(prompt, model)
completion_cost = calculate_completion_cost(completion, model)

print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
# 0.0000135 + 0.000014 = 0.0000275

Installation

Recommended: PyPI:

pip install tokencost

Usage

Cost estimates

Calculating the cost of prompts and completions from OpenAI requests

from openai import OpenAI

client = OpenAI()
model = "gpt-3.5-turbo"
prompt = [{ "role": "user", "content": "Say this is a test"}]

chat_completion = client.chat.completions.create(
    messages=prompt, model=model
)

completion = chat_completion.choices[0].message.content
# "This is a test."

prompt_cost = calculate_prompt_cost(prompt, model)
completion_cost = calculate_completion_cost(completion, model)
print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
# 0.0000180 + 0.000010 = 0.0000280

Calculating cost using string prompts instead of messages:

from tokencost import calculate_prompt_cost

prompt_string = "Hello world" 
response = "How may I assist you today?"
model= "gpt-3.5-turbo"

prompt_cost = calculate_prompt_cost(prompt_string, model)
print(f"Cost: ${prompt_cost}")
# Cost: $3e-06

Counting tokens

from tokencost import count_message_tokens, count_string_tokens

message_prompt = [{ "role": "user", "content": "Hello world"}]
# Counting tokens in prompts formatted as message lists
print(count_message_tokens(message_prompt, model="gpt-3.5-turbo"))
# 9

# Alternatively, counting tokens in string prompts
print(count_string_tokens(prompt="Hello world", model="gpt-3.5-turbo"))
# 2

How tokens are counted

Under the hood, strings and ChatML messages are tokenized using Tiktoken, OpenAI's official tokenizer. Tiktoken splits text into tokens (which can be parts of words or individual characters) and handles both raw strings and message formats with additional tokens for message formatting and roles.

For Anthropic models above version 3 (i.e. Sonnet 3.5, Haiku 3.5, and Opus 3), we use the Anthropic beta token counting API to ensure accurate token counts. For older Claude models, we approximate using Tiktoken with the cl100k_base encoding.

Cost table

Units denominated in USD. All prices can be located in model_prices.json.

Model Name Prompt Cost (USD) per 1M tokens Completion Cost (USD) per 1M tokens Max Prompt Tokens Max Output Tokens
gpt-4 $30.00 $60.00 8192 4096
gpt-4o $ 5.00 $15.00 128,000 4096
gpt-4o-audio-preview $2.5 $10.00 128,000 16384
gpt-4o-audio-preview-2024-10-01 $2.5 $10.00 128,000 16384
gpt-4o-mini $0.15 $0.6 128,000 16384
gpt-4o-mini-2024-07-18 $0.15 $0.6 128,000 16384
o1-mini $ 3.00 $12.00 128,000 65536
o1-mini-2024-09-12 $ 3.00 $12.00 128,000 65536
o1-preview $15.00 $60.00 128,000 32768
o1-preview-2024-09-12 $15.00 $60.00 128,000 32768
chatgpt-4o-latest $ 5.00 $15.00 128,000 4096
gpt-4o-2024-05-13 $ 5.00 $15.00 128,000 4096
gpt-4o-2024-08-06 $2.5 $10.00 128,000 16384
gpt-4-turbo-preview $10.00 $30.00 128,000 4096
gpt-4-0314 $30.00 $60.00 8,192 4096
gpt-4-0613 $30.00 $60.00 8,192 4096
gpt-4-32k $60.00 $120.00 32,768 4096
gpt-4-32k-0314 $60.00 $120.00 32,768 4096
gpt-4-32k-0613 $60.00 $120.00 32,768 4096
gpt-4-turbo $10.00 $30.00 128,000 4096
gpt-4-turbo-2024-04-09 $10.00 $30.00 128,000 4096
gpt-4-1106-preview $10.00 $30.00 128,000 4096
gpt-4-0125-preview $10.00 $30.00 128,000 4096
gpt-4-vision-preview $10.00 $30.00 128,000 4096
gpt-4-1106-vision-preview $10.00 $30.00 128,000 4096
gpt-3.5-turbo $1.5 $ 2.00 16,385 4096
gpt-3.5-turbo-0301 $1.5 $ 2.00 4,097 4096
gpt-3.5-turbo-0613 $1.5 $ 2.00 4,097 4096
gpt-3.5-turbo-1106 $ 1.00 $ 2.00 16,385 4096
gpt-3.5-turbo-0125 $0.5 $1.5 16,385 4096
gpt-3.5-turbo-16k $ 3.00 $ 4.00 16,385 4096
gpt-3.5-turbo-16k-0613 $ 3.00 $ 4.00 16,385 4096
ft:gpt-3.5-turbo $ 3.00 $ 6.00 16,385 4096
ft:gpt-3.5-turbo-0125 $ 3.00 $ 6.00 16,385 4096
ft:gpt-3.5-turbo-1106 $ 3.00 $ 6.00 16,385 4096
ft:gpt-3.5-turbo-0613 $ 3.00 $ 6.00 4,096 4096
ft:gpt-4-0613 $30.00 $60.00 8,192 4096
ft:gpt-4o-2024-08-06 $3.75 $15.00 128,000 16384
ft:gpt-4o-mini-2024-07-18 $0.3 $1.2 128,000 16384
ft:davinci-002 $ 2.00 $ 2.00 16,384 4096
ft:babbage-002 $0.4 $0.4 16,384 4096
text-embedding-3-large $0.13 $ 0.00 8,191 nan
text-embedding-3-small $0.02 $ 0.00 8,191 nan
text-embedding-ada-002 $0.1 $ 0.00 8,191 nan
text-embedding-ada-002-v2 $0.1 $ 0.00 8,191 nan
text-moderation-stable $ 0.00 $ 0.00 32,768 0
text-moderation-007 $ 0.00 $ 0.00 32,768 0
text-moderation-latest $ 0.00 $ 0.00 32,768 0
256-x-256/dall-e-2 -- -- nan nan
512-x-512/dall-e-2 -- -- nan nan
1024-x-1024/dall-e-2 -- -- nan nan
hd/1024-x-1792/dall-e-3 -- -- nan nan
hd/1792-x-1024/dall-e-3 -- -- nan nan
hd/1024-x-1024/dall-e-3 -- -- nan nan
standard/1024-x-1792/dall-e-3 -- -- nan nan
standard/1792-x-1024/dall-e-3 -- -- nan nan
standard/1024-x-1024/dall-e-3 -- -- nan nan
whisper-1 -- -- nan nan
tts-1 -- -- nan nan
tts-1-hd -- -- nan nan
azure/tts-1 -- -- nan nan
azure/tts-1-hd -- -- nan nan
azure/whisper-1 -- -- nan nan
azure/o1-mini $ 3.00 $12.00 128,000 65536
azure/o1-mini-2024-09-12 $ 3.00 $12.00 128,000 65536
azure/o1-preview $15.00 $60.00 128,000 32768
azure/o1-preview-2024-09-12 $15.00 $60.00 128,000 32768
azure/gpt-4o $ 5.00 $15.00 128,000 4096
azure/gpt-4o-2024-08-06 $2.75 $11.00 128,000 16384
azure/gpt-4o-2024-05-13 $ 5.00 $15.00 128,000 4096
azure/global-standard/gpt-4o-2024-08-06 $2.5 $10.00 128,000 16384
azure/global-standard/gpt-4o-mini $0.15 $0.6 128,000 16384
azure/gpt-4o-mini $0.165 $0.66 128,000 16384
azure/gpt-4-turbo-2024-04-09 $10.00 $30.00 128,000 4096
azure/gpt-4-0125-preview $10.00 $30.00 128,000 4096
azure/gpt-4-1106-preview $10.00 $30.00 128,000 4096
azure/gpt-4-0613 $30.00 $60.00 8,192 4096
azure/gpt-4-32k-0613 $60.00 $120.00 32,768 4096
azure/gpt-4-32k $60.00 $120.00 32,768 4096
azure/gpt-4 $30.00 $60.00 8,192 4096
azure/gpt-4-turbo $10.00 $30.00 128,000 4096
azure/gpt-4-turbo-vision-preview $10.00 $30.00 128,000 4096
azure/gpt-35-turbo-16k-0613 $ 3.00 $ 4.00 16,385 4096
azure/gpt-35-turbo-1106 $ 1.00 $ 2.00 16,384 4096
azure/gpt-35-turbo-0613 $1.5 $ 2.00 4,097 4096
azure/gpt-35-turbo-0301 $0.2 $ 2.00 4,097 4096
azure/gpt-35-turbo-0125 $0.5 $1.5 16,384 4096
azure/gpt-35-turbo-16k $ 3.00 $ 4.00 16,385 4096
azure/gpt-35-turbo $0.5 $1.5 4,097 4096
azure/gpt-3.5-turbo-instruct-0914 $1.5 $ 2.00 4,097 nan
azure/gpt-35-turbo-instruct $1.5 $ 2.00 4,097 nan
azure/gpt-35-turbo-instruct-0914 $1.5 $ 2.00 4,097 nan
azure/mistral-large-latest $ 8.00 $24.00 32,000 nan
azure/mistral-large-2402 $ 8.00 $24.00 32,000 nan
azure/command-r-plus $ 3.00 $15.00 128,000 4096
azure/ada $0.1 $ 0.00 8,191 nan
azure/text-embedding-ada-002 $0.1 $ 0.00 8,191 nan
azure/text-embedding-3-large $0.13 $ 0.00 8,191 nan
azure/text-embedding-3-small $0.02 $ 0.00 8,191 nan
azure/standard/1024-x-1024/dall-e-3 -- $ 0.00 nan nan
azure/hd/1024-x-1024/dall-e-3 -- $ 0.00 nan nan
azure/standard/1024-x-1792/dall-e-3 -- $ 0.00 nan nan
azure/standard/1792-x-1024/dall-e-3 -- $ 0.00 nan nan
azure/hd/1024-x-1792/dall-e-3 -- $ 0.00 nan nan
azure/hd/1792-x-1024/dall-e-3 -- $ 0.00 nan nan
azure/standard/1024-x-1024/dall-e-2 -- $ 0.00 nan nan
azure_ai/jamba-instruct $0.5 $0.7 70,000 4096
azure_ai/mistral-large $ 4.00 $12.00 32,000 8191
azure_ai/mistral-small $ 1.00 $ 3.00 32,000 8191
azure_ai/Meta-Llama-3-70B-Instruct $1.1 $0.37 8,192 8192
azure_ai/Meta-Llama-3.1-8B-Instruct $0.3 $0.61 128,000 128000
azure_ai/Meta-Llama-3.1-70B-Instruct $2.68 $3.54 128,000 128000
azure_ai/Meta-Llama-3.1-405B-Instruct $5.33 $16.00 128,000 128000
azure_ai/cohere-rerank-v3-multilingual $ 0.00 $ 0.00 4,096 4096
azure_ai/cohere-rerank-v3-english $ 0.00 $ 0.00 4,096 4096
azure_ai/Cohere-embed-v3-english $0.1 $ 0.00 512 nan
azure_ai/Cohere-embed-v3-multilingual $0.1 $ 0.00 512 nan
babbage-002 $0.4 $0.4 16,384 4096
davinci-002 $ 2.00 $ 2.00 16,384 4096
gpt-3.5-turbo-instruct $1.5 $ 2.00 8,192 4096
gpt-3.5-turbo-instruct-0914 $1.5 $ 2.00 8,192 4097
claude-instant-1 $1.63 $5.51 100,000 8191
mistral/mistral-tiny $0.25 $0.25 32,000 8191
mistral/mistral-small $ 1.00 $ 3.00 32,000 8191
mistral/mistral-small-latest $ 1.00 $ 3.00 32,000 8191
mistral/mistral-medium $2.7 $8.1 32,000 8191
mistral/mistral-medium-latest $2.7 $8.1 32,000 8191
mistral/mistral-medium-2312 $2.7 $8.1 32,000 8191
mistral/mistral-large-latest $ 3.00 $ 9.00 128,000 128000
mistral/mistral-large-2402 $ 4.00 $12.00 32,000 8191
mistral/mistral-large-2407 $ 3.00 $ 9.00 128,000 128000
mistral/pixtral-12b-2409 $0.15 $0.15 128,000 128000
mistral/open-mistral-7b $0.25 $0.25 32,000 8191
mistral/open-mixtral-8x7b $0.7 $0.7 32,000 8191
mistral/open-mixtral-8x22b $ 2.00 $ 6.00 64,000 8191
mistral/codestral-latest $ 1.00 $ 3.00 32,000 8191
mistral/codestral-2405 $ 1.00 $ 3.00 32,000 8191
mistral/open-mistral-nemo $0.3 $0.3 128,000 128000
mistral/open-mistral-nemo-2407 $0.3 $0.3 128,000 128000
mistral/open-codestral-mamba $0.25 $0.25 256,000 256000
mistral/codestral-mamba-latest $0.25 $0.25 256,000 256000
mistral/mistral-embed $0.1 -- 8,192 nan
deepseek-chat $0.14 $0.28 128,000 4096
codestral/codestral-latest $ 0.00 $ 0.00 32,000 8191
codestral/codestral-2405 $ 0.00 $ 0.00 32,000 8191
text-completion-codestral/codestral-latest $ 0.00 $ 0.00 32,000 8191
text-completion-codestral/codestral-2405 $ 0.00 $ 0.00 32,000 8191
deepseek-coder $0.14 $0.28 128,000 4096
groq/llama2-70b-4096 $0.7 $0.8 4,096 4096
groq/llama3-8b-8192 $0.05 $0.08 8,192 8192
groq/llama3-70b-8192 $0.59 $0.79 8,192 8192
groq/llama-3.1-8b-instant $0.05 $0.08 8,192 8192
groq/llama-3.1-70b-versatile $0.59 $0.79 8,192 8192
groq/llama-3.1-405b-reasoning $0.59 $0.79 8,192 8192
groq/mixtral-8x7b-32768 $0.24 $0.24 32,768 32768
groq/gemma-7b-it $0.07 $0.07 8,192 8192
groq/gemma2-9b-it $0.2 $0.2 8,192 8192
groq/llama3-groq-70b-8192-tool-use-preview $0.89 $0.89 8,192 8192
groq/llama3-groq-8b-8192-tool-use-preview $0.19 $0.19 8,192 8192
cerebras/llama3.1-8b $0.1 $0.1 128,000 128000
cerebras/llama3.1-70b $0.6 $0.6 128,000 128000
friendliai/mixtral-8x7b-instruct-v0-1 $0.4 $0.4 32,768 32768
friendliai/meta-llama-3-8b-instruct $0.1 $0.1 8,192 8192
friendliai/meta-llama-3-70b-instruct $0.8 $0.8 8,192 8192
claude-instant-1.2 $0.163 $0.551 100,000 8191
claude-2 $ 8.00 $24.00 100,000 8191
claude-2.1 $ 8.00 $24.00 200,000 8191
claude-3-haiku-20240307 $0.25 $1.25 200,000 4096
claude-3-haiku-latest $0.25 $1.25 200,000 4096
claude-3-opus-20240229 $15.00 $75.00 200,000 4096
claude-3-opus-latest $15.00 $75.00 200,000 4096
claude-3-sonnet-20240229 $ 3.00 $15.00 200,000 4096
claude-3-5-sonnet-20240620 $ 3.00 $15.00 200,000 8192
claude-3-5-sonnet-20241022 $ 3.00 $15.00 200,000 8192
claude-3-5-sonnet-latest $ 3.00 $15.00 200,000 8192
text-bison -- -- 8,192 2048
text-bison@001 -- -- 8,192 1024
text-bison@002 -- -- 8,192 1024
text-bison32k $0.125 $0.125 8,192 1024
text-bison32k@002 $0.125 $0.125 8,192 1024
text-unicorn $10.00 $28.00 8,192 1024
text-unicorn@001 $10.00 $28.00 8,192 1024
chat-bison $0.125 $0.125 8,192 4096
chat-bison@001 $0.125 $0.125 8,192 4096
chat-bison@002 $0.125 $0.125 8,192 4096
chat-bison-32k $0.125 $0.125 32,000 8192
chat-bison-32k@002 $0.125 $0.125 32,000 8192
code-bison $0.125 $0.125 6,144 1024
code-bison@001 $0.125 $0.125 6,144 1024
code-bison@002 $0.125 $0.125 6,144 1024
code-bison32k $0.125 $0.125 6,144 1024
code-bison-32k@002 $0.125 $0.125 6,144 1024
code-gecko@001 $0.125 $0.125 2,048 64
code-gecko@002 $0.125 $0.125 2,048 64
code-gecko $0.125 $0.125 2,048 64
code-gecko-latest $0.125 $0.125 2,048 64
codechat-bison@latest $0.125 $0.125 6,144 1024
codechat-bison $0.125 $0.125 6,144 1024
codechat-bison@001 $0.125 $0.125 6,144 1024
codechat-bison@002 $0.125 $0.125 6,144 1024
codechat-bison-32k $0.125 $0.125 32,000 8192
codechat-bison-32k@002 $0.125 $0.125 32,000 8192
gemini-pro $0.5 $1.5 32,760 8192
gemini-1.0-pro $0.5 $1.5 32,760 8192
gemini-1.0-pro-001 $0.5 $1.5 32,760 8192
gemini-1.0-ultra $0.5 $1.5 8,192 2048
gemini-1.0-ultra-001 $0.5 $1.5 8,192 2048
gemini-1.0-pro-002 $0.5 $1.5 32,760 8192
gemini-1.5-pro $0.078125 $0.3125 2,097,152 8192
gemini-1.5-pro-002 $0.078125 $0.3125 2,097,152 8192
gemini-1.5-pro-001 $0.078125 $0.3125 1,000,000 8192
gemini-1.5-pro-preview-0514 $0.078125 $0.3125 1,000,000 8192
gemini-1.5-pro-preview-0215 $0.078125 $0.3125 1,000,000 8192
gemini-1.5-pro-preview-0409 $0.078125 $0.3125 1,000,000 8192
gemini-1.5-flash $0.004688 $0.0046875 1,000,000 8192
gemini-1.5-flash-exp-0827 $0.004688 $0.0046875 1,000,000 8192
gemini-1.5-flash-002 $0.004688 $0.0046875 1,048,576 8192
gemini-1.5-flash-001 $0.004688 $0.0046875 1,000,000 8192
gemini-1.5-flash-preview-0514 $0.004688 $0.0046875 1,000,000 8192
gemini-pro-experimental $ 0.00 $ 0.00 1,000,000 8192
gemini-flash-experimental $ 0.00 $ 0.00 1,000,000 8192
gemini-pro-vision $0.25 $0.5 16,384 2048
gemini-1.0-pro-vision $0.25 $0.5 16,384 2048
gemini-1.0-pro-vision-001 $0.25 $0.5 16,384 2048
medlm-medium -- -- 32,768 8192
medlm-large -- -- 8,192 1024
vertex_ai/claude-3-sonnet@20240229 $ 3.00 $15.00 200,000 4096
vertex_ai/claude-3-5-sonnet@20240620 $ 3.00 $15.00 200,000 8192
vertex_ai/claude-3-5-sonnet-v2@20241022 $ 3.00 $15.00 200,000 8192
vertex_ai/claude-3-haiku@20240307 $0.25 $1.25 200,000 4096
vertex_ai/claude-3-opus@20240229 $15.00 $75.00 200,000 4096
vertex_ai/meta/llama3-405b-instruct-maas $ 0.00 $ 0.00 32,000 32000
vertex_ai/meta/llama3-70b-instruct-maas $ 0.00 $ 0.00 32,000 32000
vertex_ai/meta/llama3-8b-instruct-maas $ 0.00 $ 0.00 32,000 32000
vertex_ai/meta/llama-3.2-90b-vision-instruct-maas $ 0.00 $ 0.00 128,000 8192
vertex_ai/mistral-large@latest $ 3.00 $ 9.00 128,000 8191
vertex_ai/mistral-large@2407 $ 3.00 $ 9.00 128,000 8191
vertex_ai/mistral-nemo@latest $ 3.00 $ 3.00 128,000 128000
vertex_ai/jamba-1.5-mini@001 $0.2 $0.4 256,000 256000
vertex_ai/jamba-1.5-large@001 $ 2.00 $ 8.00 256,000 256000
vertex_ai/jamba-1.5 $0.2 $0.4 256,000 256000
vertex_ai/jamba-1.5-mini $0.2 $0.4 256,000 256000
vertex_ai/jamba-1.5-large $ 2.00 $ 8.00 256,000 256000
vertex_ai/mistral-nemo@2407 $ 3.00 $ 3.00 128,000 128000
vertex_ai/codestral@latest $ 1.00 $ 3.00 128,000 128000
vertex_ai/codestral@2405 $ 1.00 $ 3.00 128,000 128000
vertex_ai/imagegeneration@006 -- -- nan nan
vertex_ai/imagen-3.0-generate-001 -- -- nan nan
vertex_ai/imagen-3.0-fast-generate-001 -- -- nan nan
text-embedding-004 $0.00625 $ 0.00 3,072 nan
text-multilingual-embedding-002 $0.00625 $ 0.00 2,048 nan
textembedding-gecko $0.00625 $ 0.00 3,072 nan
textembedding-gecko-multilingual $0.00625 $ 0.00 3,072 nan
textembedding-gecko-multilingual@001 $0.00625 $ 0.00 3,072 nan
textembedding-gecko@001 $0.00625 $ 0.00 3,072 nan
textembedding-gecko@003 $0.00625 $ 0.00 3,072 nan
text-embedding-preview-0409 $0.00625 $ 0.00 3,072 nan
text-multilingual-embedding-preview-0409 $0.00625 $ 0.00 3,072 nan
palm/chat-bison $0.125 $0.125 8,192 4096
palm/chat-bison-001 $0.125 $0.125 8,192 4096
palm/text-bison $0.125 $0.125 8,192 1024
palm/text-bison-001 $0.125 $0.125 8,192 1024
palm/text-bison-safety-off $0.125 $0.125 8,192 1024
palm/text-bison-safety-recitation-off $0.125 $0.125 8,192 1024
gemini/gemini-1.5-flash-002 $0.075 $0.3 1,048,576 8192
gemini/gemini-1.5-flash-001 $0.075 $0.3 1,048,576 8192
gemini/gemini-1.5-flash $0.075 $0.3 1,048,576 8192
gemini/gemini-1.5-flash-latest $0.075 $0.3 1,048,576 8192
gemini/gemini-1.5-flash-8b-exp-0924 $ 0.00 $ 0.00 1,048,576 8192
gemini/gemini-1.5-flash-exp-0827 $ 0.00 $ 0.00 1,048,576 8192
gemini/gemini-1.5-flash-8b-exp-0827 $ 0.00 $ 0.00 1,000,000 8192
gemini/gemini-pro $0.35 $1.05 32,760 8192
gemini/gemini-1.5-pro $3.5 $10.5 2,097,152 8192
gemini/gemini-1.5-pro-002 $3.5 $10.5 2,097,152 8192
gemini/gemini-1.5-pro-001 $3.5 $10.5 2,097,152 8192
gemini/gemini-1.5-pro-exp-0801 $3.5 $10.5 2,097,152 8192
gemini/gemini-1.5-pro-exp-0827 $ 0.00 $ 0.00 2,097,152 8192
gemini/gemini-1.5-pro-latest $3.5 $1.05 1,048,576 8192
gemini/gemini-pro-vision $0.35 $1.05 30,720 2048
gemini/gemini-gemma-2-27b-it $0.35 $1.05 nan 8192
gemini/gemini-gemma-2-9b-it $0.35 $1.05 nan 8192
command-r $0.15 $0.6 128,000 4096
command-r-08-2024 $0.15 $0.6 128,000 4096
command-light $0.3 $0.6 4,096 4096
command-r-plus $2.5 $10.00 128,000 4096
command-r-plus-08-2024 $2.5 $10.00 128,000 4096
command-nightly $ 1.00 $ 2.00 4,096 4096
command $ 1.00 $ 2.00 4,096 4096
rerank-english-v3.0 $ 0.00 $ 0.00 4,096 4096
rerank-multilingual-v3.0 $ 0.00 $ 0.00 4,096 4096
rerank-english-v2.0 $ 0.00 $ 0.00 4,096 4096
rerank-multilingual-v2.0 $ 0.00 $ 0.00 4,096 4096
embed-english-v3.0 $0.1 $ 0.00 1,024 nan
embed-english-light-v3.0 $0.1 $ 0.00 1,024 nan
embed-multilingual-v3.0 $0.1 $ 0.00 1,024 nan
embed-english-v2.0 $0.1 $ 0.00 4,096 nan
embed-english-light-v2.0 $0.1 $ 0.00 1,024 nan
embed-multilingual-v2.0 $0.1 $ 0.00 768 nan
replicate/meta/llama-2-13b $0.1 $0.5 4,096 4096
replicate/meta/llama-2-13b-chat $0.1 $0.5 4,096 4096
replicate/meta/llama-2-70b $0.65 $2.75 4,096 4096
replicate/meta/llama-2-70b-chat $0.65 $2.75 4,096 4096
replicate/meta/llama-2-7b $0.05 $0.25 4,096 4096
replicate/meta/llama-2-7b-chat $0.05 $0.25 4,096 4096
replicate/meta/llama-3-70b $0.65 $2.75 8,192 8192
replicate/meta/llama-3-70b-instruct $0.65 $2.75 8,192 8192
replicate/meta/llama-3-8b $0.05 $0.25 8,086 8086
replicate/meta/llama-3-8b-instruct $0.05 $0.25 8,086 8086
replicate/mistralai/mistral-7b-v0.1 $0.05 $0.25 4,096 4096
replicate/mistralai/mistral-7b-instruct-v0.2 $0.05 $0.25 4,096 4096
replicate/mistralai/mixtral-8x7b-instruct-v0.1 $0.3 $ 1.00 4,096 4096
openrouter/deepseek/deepseek-coder $0.14 $0.28 32,000 4096
openrouter/microsoft/wizardlm-2-8x22b:nitro $ 1.00 $ 1.00 nan nan
openrouter/google/gemini-pro-1.5 $2.5 $7.5 1,000,000 8192
openrouter/mistralai/mixtral-8x22b-instruct $0.65 $0.65 nan nan
openrouter/cohere/command-r-plus $ 3.00 $15.00 nan nan
openrouter/databricks/dbrx-instruct $0.6 $0.6 nan nan
openrouter/anthropic/claude-3-haiku $0.25 $1.25 nan nan
openrouter/anthropic/claude-3-haiku-20240307 $0.25 $1.25 200,000 4096
anthropic/claude-3-5-sonnet-20241022 $ 3.00 $15.00 200,000 8192
anthropic/claude-3-5-sonnet-latest $ 3.00 $15.00 200,000 8192
openrouter/anthropic/claude-3.5-sonnet $ 3.00 $15.00 200,000 8192
openrouter/anthropic/claude-3.5-sonnet:beta $ 3.00 $15.00 200,000 8192
openrouter/anthropic/claude-3-sonnet $ 3.00 $15.00 nan nan
openrouter/mistralai/mistral-large $ 8.00 $24.00 nan nan
openrouter/cognitivecomputations/dolphin-mixtral-8x7b $0.5 $0.5 nan nan
openrouter/google/gemini-pro-vision $0.125 $0.375 nan nan
openrouter/fireworks/firellava-13b $0.2 $0.2 nan nan
openrouter/meta-llama/llama-3-8b-instruct:free $ 0.00 $ 0.00 nan nan
openrouter/meta-llama/llama-3-8b-instruct:extended $0.225 $2.25 nan nan
openrouter/meta-llama/llama-3-70b-instruct:nitro $0.9 $0.9 nan nan
openrouter/meta-llama/llama-3-70b-instruct $0.59 $0.79 nan nan
openrouter/openai/o1-mini $ 3.00 $12.00 128,000 65536
openrouter/openai/o1-mini-2024-09-12 $ 3.00 $12.00 128,000 65536
openrouter/openai/o1-preview $15.00 $60.00 128,000 32768
openrouter/openai/o1-preview-2024-09-12 $15.00 $60.00 128,000 32768
openrouter/openai/gpt-4o $ 5.00 $15.00 128,000 4096
openrouter/openai/gpt-4o-2024-05-13 $ 5.00 $15.00 128,000 4096
openrouter/openai/gpt-4-vision-preview $10.00 $30.00 nan nan
openrouter/openai/gpt-3.5-turbo $1.5 $ 2.00 nan nan
openrouter/openai/gpt-3.5-turbo-16k $ 3.00 $ 4.00 nan nan
openrouter/openai/gpt-4 $30.00 $60.00 nan nan
openrouter/anthropic/claude-instant-v1 $1.63 $5.51 nan 8191
openrouter/anthropic/claude-2 $11.02 $32.68 nan 8191
openrouter/anthropic/claude-3-opus $15.00 $75.00 200,000 4096
openrouter/google/palm-2-chat-bison $0.5 $0.5 nan nan
openrouter/google/palm-2-codechat-bison $0.5 $0.5 nan nan
openrouter/meta-llama/llama-2-13b-chat $0.2 $0.2 nan nan
openrouter/meta-llama/llama-2-70b-chat $1.5 $1.5 nan nan
openrouter/meta-llama/codellama-34b-instruct $0.5 $0.5 nan nan
openrouter/nousresearch/nous-hermes-llama2-13b $0.2 $0.2 nan nan
openrouter/mancer/weaver $5.625 $5.625 nan nan
openrouter/gryphe/mythomax-l2-13b $1.875 $1.875 nan nan
openrouter/jondurbin/airoboros-l2-70b-2.1 $13.875 $13.875 nan nan
openrouter/undi95/remm-slerp-l2-13b $1.875 $1.875 nan nan
openrouter/pygmalionai/mythalion-13b $1.875 $1.875 nan nan
openrouter/mistralai/mistral-7b-instruct $0.13 $0.13 nan nan
openrouter/mistralai/mistral-7b-instruct:free $ 0.00 $ 0.00 nan nan
j2-ultra $15.00 $15.00 8,192 8192
jamba-1.5-mini@001 $0.2 $0.4 256,000 256000
jamba-1.5-large@001 $ 2.00 $ 8.00 256,000 256000
jamba-1.5 $0.2 $0.4 256,000 256000
jamba-1.5-mini $0.2 $0.4 256,000 256000
jamba-1.5-large $ 2.00 $ 8.00 256,000 256000
j2-mid $10.00 $10.00 8,192 8192
j2-light $ 3.00 $ 3.00 8,192 8192
dolphin $0.5 $0.5 16,384 16384
chatdolphin $0.5 $0.5 16,384 16384
luminous-base $30.00 $33.00 nan nan
luminous-base-control $37.5 $41.25 nan nan
luminous-extended $45.00 $49.5 nan nan
luminous-extended-control $56.25 $61.875 nan nan
luminous-supreme $175.00 $192.5 nan nan
luminous-supreme-control $218.75 $240.625 nan nan
ai21.j2-mid-v1 $12.5 $12.5 8,191 8191
ai21.j2-ultra-v1 $18.8 $18.8 8,191 8191
ai21.jamba-instruct-v1:0 $0.5 $0.7 70,000 4096
amazon.titan-text-lite-v1 $0.3 $0.4 42,000 4000
amazon.titan-text-express-v1 $1.3 $1.7 42,000 8000
amazon.titan-text-premier-v1:0 $0.5 $1.5 42,000 32000
amazon.titan-embed-text-v1 $0.1 $ 0.00 8,192 nan
amazon.titan-embed-text-v2:0 $0.2 $ 0.00 8,192 nan
mistral.mistral-7b-instruct-v0:2 $0.15 $0.2 32,000 8191
mistral.mixtral-8x7b-instruct-v0:1 $0.45 $0.7 32,000 8191
mistral.mistral-large-2402-v1:0 $ 8.00 $24.00 32,000 8191
mistral.mistral-large-2407-v1:0 $ 3.00 $ 9.00 128,000 8191
mistral.mistral-small-2402-v1:0 $ 1.00 $ 3.00 32,000 8191
bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 $0.45 $0.7 32,000 8191
bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 $0.45 $0.7 32,000 8191
bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 $0.59 $0.91 32,000 8191
bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 $0.15 $0.2 32,000 8191
bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 $0.15 $0.2 32,000 8191
bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 $0.2 $0.26 32,000 8191
bedrock/us-east-1/mistral.mistral-large-2402-v1:0 $ 8.00 $24.00 32,000 8191
bedrock/us-west-2/mistral.mistral-large-2402-v1:0 $ 8.00 $24.00 32,000 8191
bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 $10.4 $31.2 32,000 8191
anthropic.claude-3-sonnet-20240229-v1:0 $ 3.00 $15.00 200,000 4096
anthropic.claude-3-5-sonnet-20240620-v1:0 $ 3.00 $15.00 200,000 4096
anthropic.claude-3-5-sonnet-20241022-v2:0 $ 3.00 $15.00 200,000 4096
anthropic.claude-3-5-sonnet-latest-v2:0 $ 3.00 $15.00 200,000 4096
anthropic.claude-3-haiku-20240307-v1:0 $0.25 $1.25 200,000 4096
anthropic.claude-3-opus-20240229-v1:0 $15.00 $75.00 200,000 4096
us.anthropic.claude-3-sonnet-20240229-v1:0 $ 3.00 $15.00 200,000 4096
us.anthropic.claude-3-5-sonnet-20240620-v1:0 $ 3.00 $15.00 200,000 4096
us.anthropic.claude-3-5-sonnet-20241022-v2:0 $ 3.00 $15.00 200,000 4096
us.anthropic.claude-3-haiku-20240307-v1:0 $0.25 $1.25 200,000 4096
us.anthropic.claude-3-opus-20240229-v1:0 $15.00 $75.00 200,000 4096
eu.anthropic.claude-3-sonnet-20240229-v1:0 $ 3.00 $15.00 200,000 4096
eu.anthropic.claude-3-5-sonnet-20240620-v1:0 $ 3.00 $15.00 200,000 4096
eu.anthropic.claude-3-5-sonnet-20241022-v2:0 $ 3.00 $15.00 200,000 4096
eu.anthropic.claude-3-haiku-20240307-v1:0 $0.25 $1.25 200,000 4096
eu.anthropic.claude-3-opus-20240229-v1:0 $15.00 $75.00 200,000 4096
anthropic.claude-v1 $ 8.00 $24.00 100,000 8191
bedrock/us-east-1/anthropic.claude-v1 $ 8.00 $24.00 100,000 8191
bedrock/us-west-2/anthropic.claude-v1 $ 8.00 $24.00 100,000 8191
bedrock/ap-northeast-1/anthropic.claude-v1 $ 8.00 $24.00 100,000 8191
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
bedrock/eu-central-1/anthropic.claude-v1 $ 8.00 $24.00 100,000 8191
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
bedrock/us-east-1/1-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
bedrock/us-east-1/6-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
bedrock/us-west-2/1-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
bedrock/us-west-2/6-month-commitment/anthropic.claude-v1 -- -- 100,000 8191
anthropic.claude-v2 $ 8.00 $24.00 100,000 8191
bedrock/us-east-1/anthropic.claude-v2 $ 8.00 $24.00 100,000 8191
bedrock/us-west-2/anthropic.claude-v2 $ 8.00 $24.00 100,000 8191
bedrock/ap-northeast-1/anthropic.claude-v2 $ 8.00 $24.00 100,000 8191
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
bedrock/eu-central-1/anthropic.claude-v2 $ 8.00 $24.00 100,000 8191
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
bedrock/us-east-1/1-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
bedrock/us-east-1/6-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
bedrock/us-west-2/1-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
bedrock/us-west-2/6-month-commitment/anthropic.claude-v2 -- -- 100,000 8191
anthropic.claude-v2:1 $ 8.00 $24.00 100,000 8191
bedrock/us-east-1/anthropic.claude-v2:1 $ 8.00 $24.00 100,000 8191
bedrock/us-west-2/anthropic.claude-v2:1 $ 8.00 $24.00 100,000 8191
bedrock/ap-northeast-1/anthropic.claude-v2:1 $ 8.00 $24.00 100,000 8191
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
bedrock/eu-central-1/anthropic.claude-v2:1 $ 8.00 $24.00 100,000 8191
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
bedrock/us-east-1/1-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
bedrock/us-east-1/6-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
bedrock/us-west-2/1-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
bedrock/us-west-2/6-month-commitment/anthropic.claude-v2:1 -- -- 100,000 8191
anthropic.claude-instant-v1 $1.63 $5.51 100,000 8191
bedrock/us-east-1/anthropic.claude-instant-v1 $0.8 $2.4 100,000 8191
bedrock/us-east-1/1-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
bedrock/us-east-1/6-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
bedrock/us-west-2/1-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
bedrock/us-west-2/6-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
bedrock/us-west-2/anthropic.claude-instant-v1 $0.8 $2.4 100,000 8191
bedrock/ap-northeast-1/anthropic.claude-instant-v1 $2.23 $7.55 100,000 8191
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
bedrock/eu-central-1/anthropic.claude-instant-v1 $2.48 $8.38 100,000 8191
bedrock/eu-central-1/1-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
bedrock/eu-central-1/6-month-commitment/anthropic.claude-instant-v1 -- -- 100,000 8191
cohere.command-text-v14 $1.5 $ 2.00 4,096 4096
bedrock/*/1-month-commitment/cohere.command-text-v14 -- -- 4,096 4096
bedrock/*/6-month-commitment/cohere.command-text-v14 -- -- 4,096 4096
cohere.command-light-text-v14 $0.3 $0.6 4,096 4096
bedrock/*/1-month-commitment/cohere.command-light-text-v14 -- -- 4,096 4096
bedrock/*/6-month-commitment/cohere.command-light-text-v14 -- -- 4,096 4096
cohere.command-r-plus-v1:0 $ 3.00 $15.00 128,000 4096
cohere.command-r-v1:0 $0.5 $1.5 128,000 4096
cohere.embed-english-v3 $0.1 $ 0.00 512 nan
cohere.embed-multilingual-v3 $0.1 $ 0.00 512 nan
meta.llama2-13b-chat-v1 $0.75 $ 1.00 4,096 4096
meta.llama2-70b-chat-v1 $1.95 $2.56 4,096 4096
meta.llama3-8b-instruct-v1:0 $0.3 $0.6 8,192 8192
bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 $0.3 $0.6 8,192 8192
bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 $0.3 $0.6 8,192 8192
bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 $0.36 $0.72 8,192 8192
bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 $0.35 $0.69 8,192 8192
bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 $0.32 $0.65 8,192 8192
bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 $0.39 $0.78 8,192 8192
bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 $0.5 $1.01 8,192 8192
meta.llama3-70b-instruct-v1:0 $2.65 $3.5 8,192 8192
bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 $2.65 $3.5 8,192 8192
bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 $2.65 $3.5 8,192 8192
bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 $3.18 $4.2 8,192 8192
bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 $3.05 $4.03 8,192 8192
bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 $2.86 $3.78 8,192 8192
bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 $3.45 $4.55 8,192 8192
bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 $4.45 $5.88 8,192 8192
meta.llama3-1-8b-instruct-v1:0 $0.22 $0.22 128,000 2048
meta.llama3-1-70b-instruct-v1:0 $0.99 $0.99 128,000 2048
meta.llama3-1-405b-instruct-v1:0 $5.32 $16.00 128,000 4096
meta.llama3-2-1b-instruct-v1:0 $0.1 $0.1 128,000 4096
us.meta.llama3-2-1b-instruct-v1:0 $0.1 $0.1 128,000 4096
eu.meta.llama3-2-1b-instruct-v1:0 $0.13 $0.13 128,000 4096
meta.llama3-2-3b-instruct-v1:0 $0.15 $0.15 128,000 4096
us.meta.llama3-2-3b-instruct-v1:0 $0.15 $0.15 128,000 4096
eu.meta.llama3-2-3b-instruct-v1:0 $0.19 $0.19 128,000 4096
meta.llama3-2-11b-instruct-v1:0 $0.35 $0.35 128,000 4096
us.meta.llama3-2-11b-instruct-v1:0 $0.35 $0.35 128,000 4096
meta.llama3-2-90b-instruct-v1:0 $ 2.00 $ 2.00 128,000 4096
us.meta.llama3-2-90b-instruct-v1:0 $ 2.00 $ 2.00 128,000 4096
512-x-512/50-steps/stability.stable-diffusion-xl-v0 -- -- 77 nan
512-x-512/max-steps/stability.stable-diffusion-xl-v0 -- -- 77 nan
max-x-max/50-steps/stability.stable-diffusion-xl-v0 -- -- 77 nan
max-x-max/max-steps/stability.stable-diffusion-xl-v0 -- -- 77 nan
1024-x-1024/50-steps/stability.stable-diffusion-xl-v1 -- -- 77 nan
1024-x-1024/max-steps/stability.stable-diffusion-xl-v1 -- -- 77 nan
sagemaker/meta-textgeneration-llama-2-7b $ 0.00 $ 0.00 4,096 4096
sagemaker/meta-textgeneration-llama-2-7b-f $ 0.00 $ 0.00 4,096 4096
sagemaker/meta-textgeneration-llama-2-13b $ 0.00 $ 0.00 4,096 4096
sagemaker/meta-textgeneration-llama-2-13b-f $ 0.00 $ 0.00 4,096 4096
sagemaker/meta-textgeneration-llama-2-70b $ 0.00 $ 0.00 4,096 4096
sagemaker/meta-textgeneration-llama-2-70b-b-f $ 0.00 $ 0.00 4,096 4096
together-ai-up-to-4b $0.1 $0.1 nan nan
together-ai-4.1b-8b $0.2 $0.2 nan nan
together-ai-8.1b-21b $0.3 $0.3 nan nan
together-ai-21.1b-41b $0.8 $0.8 nan nan
together-ai-41.1b-80b $0.9 $0.9 nan nan
together-ai-81.1b-110b $1.8 $1.8 nan nan
together-ai-embedding-up-to-150m $0.008 $ 0.00 nan nan
together-ai-embedding-151m-to-350m $0.016 $ 0.00 nan nan
together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 $0.6 $0.6 nan nan
together_ai/mistralai/Mistral-7B-Instruct-v0.1 -- -- nan nan
together_ai/togethercomputer/CodeLlama-34b-Instruct -- -- nan nan
ollama/codegemma $ 0.00 $ 0.00 8,192 8192
ollama/codegeex4 $ 0.00 $ 0.00 32,768 8192
ollama/deepseek-coder-v2-instruct $ 0.00 $ 0.00 32,768 8192
ollama/deepseek-coder-v2-base $ 0.00 $ 0.00 8,192 8192
ollama/deepseek-coder-v2-lite-instruct $ 0.00 $ 0.00 32,768 8192
ollama/deepseek-coder-v2-lite-base $ 0.00 $ 0.00 8,192 8192
ollama/internlm2_5-20b-chat $ 0.00 $ 0.00 32,768 8192
ollama/llama2 $ 0.00 $ 0.00 4,096 4096
ollama/llama2:7b $ 0.00 $ 0.00 4,096 4096
ollama/llama2:13b $ 0.00 $ 0.00 4,096 4096
ollama/llama2:70b $ 0.00 $ 0.00 4,096 4096
ollama/llama2-uncensored $ 0.00 $ 0.00 4,096 4096
ollama/llama3 $ 0.00 $ 0.00 8,192 8192
ollama/llama3:8b $ 0.00 $ 0.00 8,192 8192
ollama/llama3:70b $ 0.00 $ 0.00 8,192 8192
ollama/llama3.1 $ 0.00 $ 0.00 8,192 8192
ollama/mistral-large-instruct-2407 $ 0.00 $ 0.00 65,536 8192
ollama/mistral $ 0.00 $ 0.00 8,192 8192
ollama/mistral-7B-Instruct-v0.1 $ 0.00 $ 0.00 8,192 8192
ollama/mistral-7B-Instruct-v0.2 $ 0.00 $ 0.00 32,768 32768
ollama/mixtral-8x7B-Instruct-v0.1 $ 0.00 $ 0.00 32,768 32768
ollama/mixtral-8x22B-Instruct-v0.1 $ 0.00 $ 0.00 65,536 65536
ollama/codellama $ 0.00 $ 0.00 4,096 4096
ollama/orca-mini $ 0.00 $ 0.00 4,096 4096
ollama/vicuna $ 0.00 $ 0.00 2,048 2048
deepinfra/lizpreciatior/lzlv_70b_fp16_hf $0.7 $0.9 4,096 4096
deepinfra/Gryphe/MythoMax-L2-13b $0.22 $0.22 4,096 4096
deepinfra/mistralai/Mistral-7B-Instruct-v0.1 $0.13 $0.13 32,768 8191
deepinfra/meta-llama/Llama-2-70b-chat-hf $0.7 $0.9 4,096 4096
deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b $0.27 $0.27 32,768 8191
deepinfra/codellama/CodeLlama-34b-Instruct-hf $0.6 $0.6 4,096 4096
deepinfra/deepinfra/mixtral $0.27 $0.27 32,000 4096
deepinfra/Phind/Phind-CodeLlama-34B-v2 $0.6 $0.6 16,384 4096
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 $0.27 $0.27 32,768 8191
deepinfra/deepinfra/airoboros-70b $0.7 $0.9 4,096 4096
deepinfra/01-ai/Yi-34B-Chat $0.6 $0.6 4,096 4096
deepinfra/01-ai/Yi-6B-200K $0.13 $0.13 200,000 4096
deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 $0.7 $0.9 4,096 4096
deepinfra/meta-llama/Llama-2-13b-chat-hf $0.22 $0.22 4,096 4096
deepinfra/amazon/MistralLite $0.2 $0.2 32,768 8191
deepinfra/meta-llama/Llama-2-7b-chat-hf $0.13 $0.13 4,096 4096
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct $0.08 $0.08 8,191 4096
deepinfra/meta-llama/Meta-Llama-3-70B-Instruct $0.59 $0.79 8,191 4096
deepinfra/01-ai/Yi-34B-200K $0.6 $0.6 200,000 4096
deepinfra/openchat/openchat_3.5 $0.13 $0.13 4,096 4096
perplexity/codellama-34b-instruct $0.35 $1.4 16,384 16384
perplexity/codellama-70b-instruct $0.7 $2.8 16,384 16384
perplexity/llama-3.1-70b-instruct $ 1.00 $ 1.00 131,072 131072
perplexity/llama-3.1-8b-instruct $0.2 $0.2 131,072 131072
perplexity/llama-3.1-sonar-huge-128k-online $ 5.00 $ 5.00 127,072 127072
perplexity/llama-3.1-sonar-large-128k-online $ 1.00 $ 1.00 127,072 127072
perplexity/llama-3.1-sonar-large-128k-chat $ 1.00 $ 1.00 131,072 131072
perplexity/llama-3.1-sonar-small-128k-chat $0.2 $0.2 131,072 131072
perplexity/llama-3.1-sonar-small-128k-online $0.2 $0.2 127,072 127072
perplexity/pplx-7b-chat $0.07 $0.28 8,192 8192
perplexity/pplx-70b-chat $0.7 $2.8 4,096 4096
perplexity/pplx-7b-online $ 0.00 $0.28 4,096 4096
perplexity/pplx-70b-online $ 0.00 $2.8 4,096 4096
perplexity/llama-2-70b-chat $0.7 $2.8 4,096 4096
perplexity/mistral-7b-instruct $0.07 $0.28 4,096 4096
perplexity/mixtral-8x7b-instruct $0.07 $0.28 4,096 4096
perplexity/sonar-small-chat $0.07 $0.28 16,384 16384
perplexity/sonar-small-online $ 0.00 $0.28 12,000 12000
perplexity/sonar-medium-chat $0.6 $1.8 16,384 16384
perplexity/sonar-medium-online $ 0.00 $1.8 12,000 12000
fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct $0.1 $0.1 16,384 16384
fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct $0.1 $0.1 16,384 16384
fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct $0.2 $0.2 16,384 16384
accounts/fireworks/models/llama-v3p2-90b-vision-instruct $0.9 $0.9 16,384 16384
fireworks_ai/accounts/fireworks/models/firefunction-v2 $0.9 $0.9 8,192 8192
fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf $1.2 $1.2 65,536 65536
fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct $0.9 $0.9 32,768 32768
fireworks_ai/accounts/fireworks/models/yi-large $ 3.00 $ 3.00 32,768 32768
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct $1.2 $1.2 65,536 8192
fireworks_ai/nomic-ai/nomic-embed-text-v1.5 $0.008 $ 0.00 8,192 nan
fireworks_ai/nomic-ai/nomic-embed-text-v1 $0.008 $ 0.00 8,192 nan
fireworks_ai/WhereIsAI/UAE-Large-V1 $0.016 $ 0.00 512 nan
fireworks_ai/thenlper/gte-large $0.016 $ 0.00 512 nan
fireworks_ai/thenlper/gte-base $0.008 $ 0.00 512 nan
fireworks-ai-up-to-16b $0.2 $0.2 nan nan
fireworks-ai-16.1b-to-80b $0.9 $0.9 nan nan
fireworks-ai-moe-up-to-56b $0.5 $0.5 nan nan
fireworks-ai-56b-to-176b $1.2 $1.2 nan nan
fireworks-ai-default $ 0.00 $ 0.00 nan nan
fireworks-ai-embedding-up-to-150m $0.008 $ 0.00 nan nan
fireworks-ai-embedding-150m-to-350m $0.016 $ 0.00 nan nan
anyscale/mistralai/Mistral-7B-Instruct-v0.1 $0.15 $0.15 16,384 16384
anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 $0.15 $0.15 16,384 16384
anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 $0.9 $0.9 65,536 65536
anyscale/HuggingFaceH4/zephyr-7b-beta $0.15 $0.15 16,384 16384
anyscale/google/gemma-7b-it $0.15 $0.15 8,192 8192
anyscale/meta-llama/Llama-2-7b-chat-hf $0.15 $0.15 4,096 4096
anyscale/meta-llama/Llama-2-13b-chat-hf $0.25 $0.25 4,096 4096
anyscale/meta-llama/Llama-2-70b-chat-hf $ 1.00 $ 1.00 4,096 4096
anyscale/codellama/CodeLlama-34b-Instruct-hf $ 1.00 $ 1.00 4,096 4096
anyscale/codellama/CodeLlama-70b-Instruct-hf $ 1.00 $ 1.00 4,096 4096
anyscale/meta-llama/Meta-Llama-3-8B-Instruct $0.15 $0.15 8,192 8192
anyscale/meta-llama/Meta-Llama-3-70B-Instruct $ 1.00 $ 1.00 8,192 8192
cloudflare/@cf/meta/llama-2-7b-chat-fp16 $1.923 $1.923 3,072 3072
cloudflare/@cf/meta/llama-2-7b-chat-int8 $1.923 $1.923 2,048 2048
cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 $1.923 $1.923 8,192 8192
cloudflare/@hf/thebloke/codellama-7b-instruct-awq $1.923 $1.923 4,096 4096
voyage/voyage-01 $0.1 $ 0.00 4,096 nan
voyage/voyage-lite-01 $0.1 $ 0.00 4,096 nan
voyage/voyage-large-2 $0.12 $ 0.00 16,000 nan
voyage/voyage-law-2 $0.12 $ 0.00 16,000 nan
voyage/voyage-code-2 $0.12 $ 0.00 16,000 nan
voyage/voyage-2 $0.1 $ 0.00 4,000 nan
voyage/voyage-lite-02-instruct $0.1 $ 0.00 4,000 nan
voyage/voyage-finance-2 $0.12 $ 0.00 4,000 nan
databricks/databricks-meta-llama-3-1-405b-instruct $ 5.00 $15.00002 128,000 128000
databricks/databricks-meta-llama-3-1-70b-instruct $1.00002 $2.99999 128,000 128000
databricks/databricks-dbrx-instruct $0.74998 $2.24901 32,768 32768
databricks/databricks-meta-llama-3-70b-instruct $1.00002 $2.99999 128,000 128000
databricks/databricks-llama-2-70b-chat $0.50001 $1.5 4,096 4096
databricks/databricks-mixtral-8x7b-instruct $0.50001 $0.99902 4,096 4096
databricks/databricks-mpt-30b-instruct $0.99902 $0.99902 8,192 8192
databricks/databricks-mpt-7b-instruct $0.50001 $ 0.00 8,192 8192
databricks/databricks-bge-large-en $0.10003 $ 0.00 512 nan
databricks/databricks-gte-large-en $0.12999 $ 0.00 8,192 nan
azure/gpt-4o-mini-2024-07-18 $0.165 $0.66 128,000 16384
amazon.titan-embed-image-v1 $0.8 $ 0.00 128 nan

Callback handlers

You may also calculate token costs in LLM wrapper/framework libraries using callbacks.

LlamaIndex

pip install `'tokencost[llama-index]'`

To use the base callback handler, you may import it:

from tokencost.callbacks.llama_index import TokenCostHandler

and pass to your framework callback handler.

Langchain

(Coming Soon)

Running locally

Installation via GitHub:

git clone git@github.com:AgentOps-AI/tokencost.git
cd tokencost
pip install -e .

Running tests

  1. Install pytest if you don't have it already
pip install pytest
  1. Run the tests/ folder while in the parent directory
pytest tests

This repo also supports tox, simply run python -m tox.

Contributing

Contributions to TokenCost are welcome! Feel free to create an issue for any bug reports, complaints, or feature suggestions.

License

TokenCost is released under the MIT License.