Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In the Image Aesthetics Computing (IAC) field, most prior methods leveraged the off-the-shelf backbones pre-trained on the large-scale ImageNet database. While these pre-trained backbones have achieved notable success, they often overemphasize object-level semantics and fail to capture the high-level concepts of image aesthetics, which may only achieve suboptimal performances. To tackle this long-neglected problem, we propose a multi-modality multi-attribute contrastive pre-training framework, targeting at constructing an alternative to ImageNet-based pre-training for IAC. Specifically, the proposed framework consists of two main aspects. 1) We build a multi-attribute image description database with human feedback, leveraging the competent image understanding capability of the multi-modality large language model to generate rich aesthetic descriptions. 2) To better adapt models to aesthetic computing tasks, we integrate the image-based visual features with the attribute-based text features, and map the integrated features into different embedding spaces, based on which the multi-attribute contrastive learning is proposed for obtaining more comprehensive aesthetic representation. To alleviate the distribution shift encountered when transitioning from the general visual domain to the aesthetic domain, we further propose a semantic affinity loss to restrain the content information and enhance model generalization. Extensive experiments demonstrate that the proposed framework sets new state-of-the-arts for IAC tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2024.3492259DOI Listing

Publication Analysis

Top Keywords

multi-attribute contrastive
12
image aesthetics
12
multi-modality multi-attribute
8
contrastive pre-training
8
aesthetics computing
8
proposed framework
8
image
5
pre-training image
4
computing image
4
computing iac
4

Similar Publications

Military experience predicts military multitasking better than laboratory measures in officer cadets.

Cogn Res Princ Implic

June 2025

Department of Human Sciences, Institute of Psychology, General Psychology, University of the Bundeswehr Munich, Werner-Heisenberg-Weg 39, 85577, Neubiberg, Germany.

Whether or not it is possible to predict military performance using laboratory measures constitutes an important question. There are indications that humans possess a common multitasking ability enabling them to perform complex behaviors irrespective of task requirements. Working memory processing abilities likely illustrate cognitive substrates thereof.

View Article and Find Full Text PDF

In 1993, the World Bank released a global report on the efficacy of health promotion, introducing the disability-adjusted life years (DALY) as a novel indicator. The DALY, a composite metric incorporating temporal and qualitative data, is grounded in preferences regarding disability status. This review delineates the algorithm used to calculate the value of the proposed DALY synthetic indicator and elucidates key methodological challenges associated with its application.

View Article and Find Full Text PDF

An infectious eye illness known as pink eye results in ocular redness, irritation, and mucus. Schools are an especially vulnerable region for dissemination because they can propagate that contagious disease quickly via direct or indirect interactions. Choosing the right medication to treat pink eye infection is typically thought of as an intricate multi-attribute group decision-making concern.

View Article and Find Full Text PDF

In the Image Aesthetics Computing (IAC) field, most prior methods leveraged the off-the-shelf backbones pre-trained on the large-scale ImageNet database. While these pre-trained backbones have achieved notable success, they often overemphasize object-level semantics and fail to capture the high-level concepts of image aesthetics, which may only achieve suboptimal performances. To tackle this long-neglected problem, we propose a multi-modality multi-attribute contrastive pre-training framework, targeting at constructing an alternative to ImageNet-based pre-training for IAC.

View Article and Find Full Text PDF

A new method has recently been developed for valuing health states, called 'Online elicitation of Personal Utility Functions' (OPUF). In contrast to established methods, such as time trade-off or discrete choice experiments, OPUF does not require hundreds of respondents, but allows estimating utility functions for small groups and even at the individual level. In this study, we used OPUF to elicit EQ-5D-5L health state preferences from a (not representative) sample of the UK general population, and then compared utility functions on the societal-, group-, and individual level.

View Article and Find Full Text PDF