Episode Appearances
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) · Dec 9, 2025
Why Vision Language Models Ignore What They See with Munawar Hayat - #758
“Researcher at Qualcomm AI Research; PhD in computer vision from Australia; presented three NeurIPS papers on VLM hallucination, multimodal retrieval, and multi-person generation”
Vision Language Model Hallucination and Attention MechanismsVisual Token Grounding in Multimodal Language ModelsPhysics-Aware Image Generation and Simulation
View Analysis