Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The accurate measurement of perceptual color differences (CDs) between two images plays an important role in modern smartphone photography. Although traditional CD metrics provide numerical scores to quantify color variations, they often lack the ability to offer intuitive insights or explanations that reflect the factors behind these differences in a way that aligns with human perception and reasoning. Here, we present CD-Reasoning, an innovative method designed not merely to compute numerical CD scores but also to provide a detailed rationale for the observed CDs between images. This method surpasses simple numerical quantification, delivering a more profound and explanatory analysis that bridges quantitative assessments with the qualitative reasoning characteristic of human perception. The development of the CD-Reasoning model begins with the compilation of a multi-modal CD dataset dubbed M-SPCD based on the existing SPCD, where we collect textual descriptions that detail the quantification of CDs across seven pivotal attributes: white balance, brightness contrast, color contrast, overall brightness, overall color, shadow detail, and highlight detail. Utilizing the newly curated M-SPCD dataset, we enhance the capabilities of cutting-edge Multimodal Large Language Models (MLLMs) to not only accurately assess numerical CD scores but also to provide in-depth reasoning that explains the CDs between two images. Extensive experiments demonstrate that the proposed CD-Reasoning not only achieves superior accuracy compared to state-of-the-art CD metrics but also significantly exceeds leading MLLMs in CD interpreting. Source codes will be available at https://github.com/LongYu-LY/CD-Reasoning.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2024.3522802DOI Listing

Publication Analysis

Top Keywords

cds images
12
numerical scores
12
large language
8
language models
8
color differences
8
human perception
8
scores provide
8
color
5
harnessing multi-modal
4
multi-modal large
4

Similar Publications

Objective: Despite rapid advancements in understanding of cognitive disengagement syndrome (CDS) in children, less is known about the neural correlates of CDS. The aim of this study was to examine associations between CDS symptom severity and connectivity within and between specific brain networks.

Method: The study recruited 65 right-handed children (ages 8-13 years; 36 boys) with the full continuum of CDS symptom severity from the community.

View Article and Find Full Text PDF

Mn-doped carbon dots-based fluorescent-colorimetric dual-mode probes for selective and sensitive detection of Cr(VI) ions and l-ascorbic acid via smartphone-integrated analytical platform.

Anal Chim Acta

November 2025

Guangxi Key Laboratory of Natural Polymer Chemistry and Physics, Key Laboratory of Nanobiosensor Analysis, College of Chemistry and Materials, Nanning Normal University, Nanning, 530001, PR China. Electronic address:

Background: Hexavalent chromium ions (Cr(VI)), a notorious toxic heavy metal pollutant with proven carcinogenicity, endangers human health and the environment. Meanwhile, l-ascorbic acid (L-AA), a vital biological antioxidant, has abnormal levels closely tied to various diseases. Developing efficient synchronous detection methods for these two key analytes is of great value in clinical and environmental monitoring.

View Article and Find Full Text PDF

Gout, which affects 3-6 % of Western populations, has well-established therapies but still lacks agents that directly target monosodium urate (MSU) deposits. This study investigates a novel strategy employing cyclodextrins (CDs) and hyperbranched cyclodextrin-based polymers (HBCD-Pol) to both mobilize and prevent MSU formation. Among the CDs tested, HPβ-CD exhibited the strongest uric acid (UA) complexation at 25 °C, while HBCD-Pol showed superior performance by chelating Na ions.

View Article and Find Full Text PDF

Objective: To systematically evaluate the diagnostic accuracy, educational utility, and communication potential of generative AI, particularly Large Language Models (LLMs) such as ChatGPT, in otolaryngology.

Data Sources: A comprehensive search of PubMed, Embase, Scopus, Web of Science, and IEEE Xplore identified English-language peer-reviewed studies from January 2022 to March 2025.

Review Methods: Eligible studies evaluated text-based generative AI models used in otolaryngology.

View Article and Find Full Text PDF

Simple Two-Component CDs/PVA Hydrogels Possess Dynamic High-Definition and Two-Stage Information Encryption Capability.

ACS Appl Mater Interfaces

September 2025

National Engineering Research Center of Clean Technology in Leather Industry, Sichuan University, Chengdu 610065, PR China.

Ensuring the fidelity and security level of stored information is essential for information carrier materials to safeguard data and prevent counterfeiting. However, low resolution, limited encryption modes, and complex fabrication hinder existing information carriers from meeting evolving technological demands. Herein, a solvent exchange strategy from DMSO to water is employed to stably anchor hydrophobic fluorescent carbon dots (CDs) with multiple emission states onto a 3D framework of poly(vinyl alcohol) (PVA) chains, forming a simple two-component CDs/PVA hydrogel with tunable fluorescent colors and recyclability.

View Article and Find Full Text PDF