An LLM trained or configured to produce long chains of internal "thinking" tokens before answering — trading latency for higher accuracy on complex problems. OpenAI's o1 and Claude's extended thinking are examples.
"Switched to a reasoning model for math problems. Slower but actually correct."
No comments yet — say something.
Add your own interpretation of "reasoning model".