Qwen3.5: Towards Native Multimodal Agents
Qwen3.5: Towards Native Multimodal Agents Alibaba’s Qwen just released the first two models in the Qwen 3.5 series – one open weights, one proprietary. Both are multi-modal for vision input. The open weight one is a Mixture of Experts model called Qwen3.5-397B-A17B. Interesting to see Qwen call out serving efficiency as a benefit of that architecture: Built on an innovative hybrid architecture that fuses linear attention (via Gated Delta Networks) with a sparse mixture-of-experts, the model attains remarkable […]