AI Provider Directory

Compare pricing across major AI API providers. TokenShrink compresses your prompts before you send them — saving you money with every provider listed here.

How it works: Compress your prompt with TokenShrink, then paste the compressed version into your provider of choice. Fewer tokens in = lower cost. It works with every provider below.

OpenAI

Works with TokenShrink

The pioneer in large language models

ModelInput / 1MOutput / 1M
GPT-4o$2.50$10.00
GPT-4o mini$0.15$0.60
o1$15.00$60.00
o3-mini$1.10$4.40
Free tier with usage limits for new accountsView pricing →

Anthropic

Works with TokenShrink

Safety-focused AI with Claude models

ModelInput / 1MOutput / 1M
Claude Opus 4$15.00$75.00
Claude Sonnet 4$3.00$15.00
Claude Haiku 3.5$0.80$4.00
Free tier via claude.ai (limited usage)View pricing →

Google

Works with TokenShrink

Gemini models with massive context windows

ModelInput / 1MOutput / 1M
Gemini 2.5 Flash$0.15$0.60
Gemini 2.5 Pro$1.25$10.00
Gemini 2.0 Flash$0.10$0.40
Generous free tier via Google AI StudioView pricing →

Mistral

Works with TokenShrink

European AI with efficient open-weight models

ModelInput / 1MOutput / 1M
Mistral Large$2.00$6.00
Mistral Small$0.10$0.30
Codestral$0.30$0.90
Free tier for experimentationView pricing →

Meta / Llama

Works with TokenShrink

Open-source models available via multiple hosts

ModelInput / 1MOutput / 1M
Llama 3.3 70B$0.60$0.60
Llama 3.2 8B$0.05$0.05
Llama 4 Scout$0.17$0.17
Open weights — self-host for free, or use hosted providersView pricing →

Cohere

Works with TokenShrink

Enterprise-focused AI with RAG specialization

ModelInput / 1MOutput / 1M
Command R+$2.50$10.00
Command R$0.15$0.60
Free trial tier for developersView pricing →

Groq

Works with TokenShrink

Ultra-fast inference with custom LPU hardware

ModelInput / 1MOutput / 1M
Llama 3.3 70B$0.59$0.79
Mixtral 8x7B$0.24$0.24
Gemma 2 9B$0.20$0.20
Free tier with rate limitsView pricing →

Cerebras

Works with TokenShrink

Wafer-scale inference for blazing speed

ModelInput / 1MOutput / 1M
Llama 3.3 70B$0.60$0.60
Llama 3.1 8B$0.10$0.10
Free tier available for developersView pricing →

TokenShrink is not affiliated with any AI provider listed above. Pricing shown is approximate and may be outdated — always check the provider’s official pricing page for current rates. All trademarks belong to their respective owners.

Ready to save on all of them?

Try TokenShrink free