[P] ColQwen3.5-v2 4.5B is out!
Follow-up to v1. ColQwen3.5-v2 is a 4.5B param visual document retrieval model built on Qwen3.5-4B with the ColPali late-interaction recipe.
Results:
- ViDoRe V3 nDCG@10: 0.6177 (currently top of the leaderboard)
- ViDoRe V1 nDCG@5: 0.9172 (top among 4B models)
- ViDoRe V3 nDCG@5: 0.5913, closing the gap to TomoroAI from 0.010 to 0.002
Main change from v1 is a simpler training recipe: 2 phases instead of 4. Hard negatives mined once and reused, domain data (finance + tables) baked in from the start, then model souped with v1 at a 55/45 weight ratio. Fewer seeds (3 vs 4), better results.
Apache 2.0, weights on HF: https://huggingface.co/athrael-soju/colqwen3.5-4.5B-v2
Let me know if you try it out!
submitted by /u/madkimchi
[link] [comments]
Like
0
Liked
Liked