Introducing Teti Aura 3

Teti Aura 3, our fast and efficient model, is now generally available — and it's our new default.

Aura was the Greek personification of the morning breeze: light, fast, and everywhere at once. Our most efficient model carries her name because that's what it's built to be — quick, light on resources, and good enough to be the model you reach for by default.

Aura 3 is a single, multimodal model with native vision and built-in reasoning. It won't top a frontier leaderboard — that's not what it's for. It's built to handle the vast majority of everyday work well, at a price low enough to leave running all day.

What's new

Aura 3 is a single, fast model with native vision, built-in reasoning, and a long context window. One model handles text, images, reasoning, and tool use, so most users never need to switch.

Solid knowledge and science. On GPQA Diamond, a graduate-level science benchmark, Aura 3 scores 85.7% — strong for a model at its price point.
Native vision. Aura 3 reads screenshots, diagrams, and dense documents directly, so computer-use agents and data extraction don't need a heavier model.
Built-in reasoning. Extended reasoning is part of the model, not a separate mode to opt into.
Long context. A 262K-token window fits large codebases and long documents.

Benchmarks

Aura 3 belongs to the fast, efficient tier — so we compare it against models in its own category (a Gemini Flash, a Claude Haiku, a DeepSeek Flash), not against frontier flagships in a different price class. Independent scores from Artificial Analysis, which runs the same evaluation suite across every model:

Benchmark	Aura 3	Fast-tier peers*
GPQA Diamond (science)	85.7%	67–92%
Coding Index	43.4	44–56
Humanity's Last Exam	22.7%	10–41%
Intelligence Index	29.4	30–50

*Efficient-tier models — Gemini 3.5 Flash, DeepSeek V4 Flash, GPT-5.4 mini, Claude 4.5 Haiku.

Among fast models, Aura 3 is right in the mix on knowledge — its 85.7% on GPQA Diamond sits close to the strongest flash models and well ahead of a Claude Haiku — while adding native vision that most of them don't have:

GPQA Diamond, fast models — Teti Aura 3 at 85.7%, alongside Gemini 3.5 Flash (92.2%), DeepSeek V4 Flash (89.4%), GPT-5.4 mini (87.5%) and Claude 4.5 Haiku (67.2%). Independent scores via Artificial Analysis.

The real story is the price

Aura 3 delivers this at $0.75 per million input tokens and $1.50 per million output tokens — among the lowest in its class. Set against the models it actually competes with, it holds a strong knowledge score at one of the lowest prices on the board:

Price per 1M output tokens, fast models — Teti Aura 3 at $1.50, below Gemini 3.5 Flash ($9), Claude 4.5 Haiku ($5) and GPT-5.4 mini ($4.50), with DeepSeek V4 Flash ($0.28) lower still.

Fast, long, and multimodal

Aura 3 pairs that pricing with a 262K-token context window — room for large codebases, long documents, and extended agent runs — and native vision, so it can read screenshots, diagrams, and dense documents directly.

Efficiency by design

Aura 3 is built around an efficiency-first architecture that keeps the compute per token low. That's what makes the pricing possible — and it's consistent with our commitment to sustainable AI: more capability on the same hardware, with a smaller energy footprint.

Governance

Like every Teti model, Aura 3 is governed by our Charter — the public, binding commitments that define how we build.

Pricing and availability

Teti Aura 3 is available today to all users — including the free plan — across all Teti products and the Cloud API, and it is now the default model. Pricing is $0.75 per million input tokens and $1.50 per million output tokens. Developers can select teti-aura-3 via the API, or try it directly in the console.

Which model should I use?

Reach for Aura 3 for the vast majority of work — it's fast, inexpensive, and more capable than its price suggests. Step up to Teti Metis 3 when a task demands the deepest reasoning or the longest, hardest agentic runs. You can switch between them per conversation.

Try Teti Aura 3 today.

Introducing Teti Aura 3

Teti Aura 3, our fast and efficient model, is now generally available — and it's our new default.

What's new

Aura 3 is a single, fast model with native vision, built-in reasoning, and a long context window. One model handles text, images, reasoning, and tool use, so most users never need to switch.

Solid knowledge and science. On GPQA Diamond, a graduate-level science benchmark, Aura 3 scores 85.7% — strong for a model at its price point.

Native vision. Aura 3 reads screenshots, diagrams, and dense documents directly, so computer-use agents and data extraction don't need a heavier model.

Built-in reasoning. Extended reasoning is part of the model, not a separate mode to opt into.

Long context. A 262K-token window fits large codebases and long documents.

Benchmarks

Benchmark

Aura 3

Fast-tier peers*

GPQA Diamond (science)

85.7%

67–92%

Coding Index

43.4

44–56

Humanity's Last Exam

22.7%

10–41%

Intelligence Index

29.4

30–50

*Efficient-tier models — Gemini 3.5 Flash, DeepSeek V4 Flash, GPT-5.4 mini, Claude 4.5 Haiku.

GPQA Diamond, fast models — Teti Aura 3 at 85.7%, alongside Gemini 3.5 Flash (92.2%), DeepSeek V4 Flash (89.4%), GPT-5.4 mini (87.5%) and Claude 4.5 Haiku (67.2%). Independent scores via Artificial Analysis.

Pricing and availability

Introducing Teti Aura 3

What's new

Benchmarks

The real story is the price

Fast, long, and multimodal

Efficiency by design

Governance

Pricing and availability

Which model should I use?

Stay Informed

Introducing Teti Aura 3

What's new

Benchmarks

The real story is the price

Fast, long, and multimodal

Efficiency by design

Governance

Pricing and availability

Which model should I use?

Stay Informed