Forked from qwen/qwen3-4b-2507
Description
Updated version of Qwen3 4B non-thinking mode featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.
Capabilities
Minimum system memory
Tags
Last updated
Updated 4 days agobyREADME
Updated version of Qwen3-4B non-thinking mode featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.
This model delivers substantial gains in long-tail knowledge coverage across multiple languages and markedly better alignment with user preferences in subjective and open-ended tasks, enabling more helpful responses and higher-quality text generation.
Enhanced capabilities in 256K long-context understanding.
Note: This model supports only non-thinking mode and does not generate <think></think> blocks in its output.
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses