[D] Qwen3.5 rumored to merge MoE + Hybrid Attention — thoughts?

Chinese AI news suggests Qwen3.5 integrates MoE with Hybrid Attention for better inference efficiency. Do you think routing efficiency matters more than raw parameter size?

submitted by /u/AppropriateMark8528
[link] [comments]

Liked Liked