Publications by Enxin Song

Publications by authors named "Enxin Song"

Page 1 of 1

MovieChat+: Question-aware Sparse Memory for Long Video Question Answering.

Enxin Song , Wenhao Chai , Tian Ye , Jenq-Neng Hwang , Xi Li

IEEE Trans Pattern Anal Mach Intell

September 2025

Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific vision tasks. Yet, existing methods either employ complex spatial-temporal modules or rely heavily on additional perception models to extract temporal features for video understanding, performing well only on short videos. For long videos, the computational complexity and memory costs associated with long-term temporal connections are significantly increased, posing additional challenges.

View Article and Find Full Text PDF