[R] Multi-Modal Reasoning with
|
Hi everyone, Cosmos-Reason2 is a recent Qwen3-VL-based multimodal reasoning model designed for physical AI tasks. However, it has been limited to powerful devices like DGX Spark, H100, GB200 and Jetson AGX Thor. We have deployed Cosmos-Reason2-2B under an 8GB memory constraint (Jetson Orin Nano) using model compression and inference optimizations, enabling text, image, and video reasoning. HF Link with models, instructions, and benchmarks: Interested to hear any feedback, or others experience deploying VLM reasoning models on memory-constrained edge hardware. submitted by /u/No-Dragonfly6246 |
Like
0
Liked
Liked