Forked from qwen/qwen3-coder-480b
Description
Qwen's most powerful code model, featuring 480B total parameters with 35B activated through Mixture of Experts (MoE) architecture.
Capabilities
Minimum system memory
Tags
Last updated
Updated 10 days agobyREADME
Qwen's most powerful code model, featuring 480B total parameters with 35B activated through Mixture of Experts (MoE) architecture.
Key Features:
Technical Specifications:
Note: This model operates in non-thinking mode only and does not generate <think></think> blocks.
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses