What are tokens?
Tokens are the chunks of text models read and write—often a few characters or part of a word. You are billed separately for input (your prompt) and output (the model’s reply). Big JSON payloads, long system prompts, and high max_tokens all push cost up.