A multimodal convolutional neuro-fuzzy network for emotion understanding of movie clips.

Neural Netw

School of Electronics Engineering, IT-1, Kyungpook National University, Daegu, 41566, South Korea. Electronic address:

Published: October 2019


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Multimodal emotion understanding enables AI systems to interpret human emotions. With accelerated video surge, emotion understanding remains challenging due to inherent data ambiguity and diversity of video content. Although deep learning has made a considerable progress in big data feature learning, they are viewed as deterministic models used in a "black-box" manner which does not have capabilities to represent inherent ambiguities with data. Since the possibility theory of fuzzy logic focuses on knowledge representation and reasoning under uncertainty, we intend to incorporate the concepts of fuzzy logic into deep learning framework. This paper presents a novel convolutional neuro-fuzzy network, which is an integration of convolutional neural networks in fuzzy logic domain to extract high-level emotion features from text, audio, and visual modalities. The feature sets extracted by fuzzy convolutional layers are compared with those of convolutional neural networks at the same level using t-distributed Stochastic Neighbor Embedding. This paper demonstrates a multimodal emotion understanding framework with an adaptive neural fuzzy inference system that can generate new rules to classify emotions. For emotion understanding of movie clips, we concatenate audio, visual, and text features extracted using the proposed convolutional neuro-fuzzy network to train adaptive neural fuzzy inference system. In this paper, we go one step further to explain how deep learning arrives at a conclusion that can guide us to an interpretable AI. To identify which visual/text/audio aspects are important for emotion understanding, we use direct linear non-Gaussian additive model to explain the relevance in terms of causal relationships between features of deep hidden layers. The critical features extracted are input to the proposed multimodal framework to achieve higher accuracy.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2019.06.010DOI Listing

Publication Analysis

Top Keywords

emotion understanding
24
convolutional neuro-fuzzy
12
neuro-fuzzy network
12
deep learning
12
fuzzy logic
12
understanding movie
8
movie clips
8
multimodal emotion
8
convolutional neural
8
neural networks
8

Similar Publications

Background: The benefits of physical activity for frail older acutely hospitalized adults are becoming increasingly clear. To enhance opportunities for physical activity on geriatric wards, it is essential to understand the older adult's perspective.

Aim: The aim of the study was to explore the experiences and perceptions of physical activity among older adults during hospital stays on a geriatric ward.

View Article and Find Full Text PDF

Background: Depression is a common mental disorder in hemodialysis patients. The present study aimed to identify subgroups of patients receiving hemodialysis based on depression and explore the influencing factors in a multicenter hemodialysis population in China.

Methods: A total of 1,090 hemodialysis patients (682 men, mean aged 61.

View Article and Find Full Text PDF

Background: Multi-cancer detection (MCED) blood tests have the potential to screen for early-stage cancers. Understanding how people experience an MCED cancer signal result is vital prior to any future implementation. We explored experiences in a trial context.

View Article and Find Full Text PDF

[Towards an integrative understanding of complex trauma].

Encephale

September 2025

Département de psychiatrie de l'adolescent et du jeune adulte, institut mutualiste Montsouris, 42, boulevard Jourdan, Paris, France; UVSQ, Inserm U1178, PsyDev, CESP université Paris-Saclay, Villejuif, France; Université Paris-Cité, Paris, France.

The body of knowledge on trauma is rapidly expanding. Since 2022, the WHO has been calling for the history of adversity to be systematically taken into account when assessing the state of health of all individuals. But at this stage, our understanding of the precise mechanisms of complex trauma remains incomplete.

View Article and Find Full Text PDF

Sexual pleasure in older age: haptic visuality and female eroticism in three contemporary Spanish films.

J Aging Stud

September 2025

Dean of Area Studies and Assistant Dean of Faculty, IES Abroad Barcelona (Spain) & Research Fellow, Aston University, UK. Electronic address:

This article explores the representation of female sexuality in later life through the lens of three contemporary Spanish films: La vida era eso (2020), Destello bravío (2021), and Mamacruz (2023). Drawing from feminist aging studies, film theory, and concepts such as haptic visuality and clitoral sexuality, the study challenges the patriarchal, ageist, and phallocentric narratives that have long shaped cultural understandings of older women's erotic lives. Through close readings of these films, the article demonstrates how they subvert the dominant heteronormative gaze by foregrounding sensory pleasure, autoeroticism, and the reawakening of desire in older women.

View Article and Find Full Text PDF