Question 1

What is a token in an LLM API?

Accepted Answer

A token is a chunk of text that language models process. It can be a word, part of a word, or punctuation. Models like Claude, GPT, and Gemini each use different tokenizers, so the same text produces different token counts — and different costs.

Question 2

Why do Claude, GPT, and Gemini have different token counts for the same text?

Accepted Answer

Each provider uses a different tokenizer algorithm. OpenAI uses o200k_base (tiktoken), Anthropic uses their own proprietary tokenizer, and Google uses SentencePiece. These split text into tokens differently, resulting in different counts and costs.

Question 3

How do I reduce my API costs?

Accepted Answer

Remove verbose filler words, use prompt caching, specify output formats to reduce output tokens, trim few-shot examples to 2-3, and remove duplicate instructions. TokenAdvisor's Advisor section identifies these patterns automatically.

Question 4

Is TokenAdvisor free?

Accepted Answer

Yes, completely free with no signup required. Token counting for OpenAI models happens entirely in your browser. Claude and Gemini counts use official free APIs proxied through our server.

Question 5

Does TokenAdvisor send my prompts anywhere?

Accepted Answer

OpenAI token counting happens 100% client-side using tiktoken. For Claude and Gemini counts, your text is sent to their respective count_tokens APIs (not the chat API) via our proxy — these APIs only count tokens and do not store or learn from your content.

Why Tokens Matter

Frequently Asked Questions