DEFINITION
Token Efficiency
Token Efficiency in Language Models
Definition
The measure of how effectively a language model uses tokens (input/output units) to complete tasks. More token-efficient models require fewer tokens to accomplish the same task, reducing latency and costs.
Examples in the Wild
- Example 1:Gemma uses fewer tokens than Claude for coding tasks
- Example 2:Gemini uses far fewer tokens compared to Claude and OpenAI