

| Version | Build | OS | Arch | Last Updated | Download URL |
|---|---|---|---|---|---|
0.4.7 | 2 | Mac | arm64 | 03/11/2026 | Download |
0.4.7 | 2 | Windows | x86_64 | 03/11/2026 | Download |
0.4.7 | 2 | Windows | arm64 | 03/11/2026 | Download |
0.4.7 | 2 | Linux | x86_64 | 03/11/2026 | Download |
0.4.7 | 2 | Linux | x86_64 | 03/11/2026 | Download |
0.4.7 | 2 | Linux | arm64 | 03/11/2026 | Download |
0.4.7 | 2 | Linux | arm64 | 03/11/2026 | Download |
Build 2
Build 1
reasoning_content and content in API responses" is now ON by default in order to improve compatibility with /v1/chat/completions clients
parallel parameter to /api/v1/load endpointpresence_penalty sampling parameter/v1/responses endpoint erroring on none and xhigh reasoning effort/v1/responses responses included logProbs for MLX models even if message.output_text.logprobs was omitted/v1/messages API now surfaces errors when the model generates an invalid tool call, enabling Claude Code to recover gracefully