llm 0.32a2

simonwillison.net·May 12, 2026

The release of LLM 0.32a2 introduces significant updates, notably that most reasoning-capable OpenAI models now utilize the `/v1/responses` endpoint for enhanced interleaved reasoning, allowing users to view summarized reasoning tokens in prompts. Users can opt to hide this display with specific flags if desired.

The key insight from this release is the shift to using the `/v1/responses` endpoint for most reasoning-capable OpenAI models, which facilitates interleaved reasoning across tool calls for GPT-5 class models. This change allows for better visibility of summarized reasoning tokens during prompt execution, enhancing the debugging and interpretation of AI model outputs. This could be particularly useful for improving AI workflow and productivity by offering deeper insights into decision-making processes within AI systems.

Want more content like this?

twixb tracks your favorite blogs and social media, filters by keywords, and delivers personalized key learnings — straight to your inbox.

Create Your Own →Explore Newsfeeds

More from AI Productivity

Recent stories curated alongside this one.

Browse all AI Productivity →

llm 0.32a2

Want more content like this?

More from AI Productivity

[AINews] Codex Rises, Claude Meters Programmatic Usage

Welcome to the Datasette blog

Quoting Boris Mann

CSP Allow-list Experiment

datasette 1.0a29