[P] ColQwen3.5-v2 4.5B is out!

Follow-up to v1. ColQwen3.5-v2 is a 4.5B param visual document retrieval model built on Qwen3.5-4B with the ColPali late-interaction recipe.

Results:

  • ViDoRe V3 nDCG@10: 0.6177 (currently top of the leaderboard)
  • ViDoRe V1 nDCG@5: 0.9172 (top among 4B models)
  • ViDoRe V3 nDCG@5: 0.5913, closing the gap to TomoroAI from 0.010 to 0.002

Main change from v1 is a simpler training recipe: 2 phases instead of 4. Hard negatives mined once and reused, domain data (finance + tables) baked in from the start, then model souped with v1 at a 55/45 weight ratio. Fewer seeds (3 vs 4), better results.

Apache 2.0, weights on HF: https://huggingface.co/athrael-soju/colqwen3.5-4.5B-v2

Let me know if you try it out!

submitted by /u/madkimchi
[link] [comments]

Liked Liked