Token Efficiency

Token Efficiency in Language Models

Definition

The measure of how effectively a language model uses tokens (input/output units) to complete tasks. More token-efficient models require fewer tokens to accomplish the same task, reducing latency and costs.

Examples in the Wild

  • Example 1:Gemma uses fewer tokens than Claude for coding tasks
  • Example 2:Gemini uses far fewer tokens compared to Claude and OpenAI