Publications by Pooyan Fazli

Publications by authors named "Pooyan Fazli"

Page 1 of 1

VIDHALLUC: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding.

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit

June 2025

Multimodal large language models (MLLMs) have recently shown significant advancements in video understanding, excelling in content reasoning and instruction-following tasks. However, hallucination, where models generate inaccurate or misleading content, remains underexplored in the video domain. Building on the observation that MLLM visual encoders often fail to distinguish visually different yet semantically similar video pairs, we introduce VIDHALLUC, the largest benchmark designed to examine hallucinations in MLLMs for video understanding.

View Article and Find Full Text PDF

VideoA11y: Method and Dataset for Accessible Video Description.

Chaoyu Li , Sid Padmanabhuni , Maryam S Cheema , Hasti Seifi , Pooyan Fazli

Proc SIGCHI Conf Hum Factor Comput Syst

April 2025

Video descriptions are crucial for blind and low vision (BLV) users to access visual content. However, current artificial intelligence models for generating descriptions often fall short due to limitations in the quality of human annotations within training datasets, resulting in descriptions that do not fully meet BLV users' needs. To address this gap, we introduce VideoA11y, an approach that leverages multimodal large language models (MLLMs) and video accessibility guidelines to generate descriptions tailored for BLV individuals.

View Article and Find Full Text PDF