Reasoning

Replacing thinking with tool usage enables reasoning in small language models

We replace natural language "thinking" with structured tool interactions, enabling even 3B-parameter models to learn effective test-time compute scaling.