[D] Qwen3.5 rumored to merge MoE + Hybrid Attention — thoughts?
Chinese AI news suggests Qwen3.5 integrates MoE with Hybrid Attention for better inference efficiency. Do you think routing efficiency matters more than raw parameter size?
submitted by /u/AppropriateMark8528
[link] [comments]
Like
0
Liked
Liked