Grounding 2 🖼️ Qwen2.5-VL: Next-Gen Vision-Language Model with Dynamic Resolution & Long Video Understanding Sep 8, 2025 📍 GEM: Grounding Everything in Vision-Language Transformers Sep 8, 2025