Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Composed query image retrieval task aims to retrieve the target image in the database by a query that composes two different modalities: a reference image and a sentence declaring that some details of the reference image need to be modified and replaced by new elements. Tackling this task needs to learn a multimodal embedding space, which can make semantically similar targets and queries close but dissimilar targets and queries as far away as possible. Most of the existing methods start from the perspective of model structure and design some clever interactive modules to promote the better fusion and embedding of different modalities. However, their learning objectives use conventional query-level examples as negatives while neglecting the composed query's multimodal characteristics, leading to the inadequate utilization of the training data and suboptimal construction of metric space. To this end, in this paper, we propose to improve the learning objective by constructing and mining hard negative examples from the perspective of multimodal fusion. Specifically, we compose the reference image and its logically unpaired sentences rather than paired ones to create component-level negative examples to better use data and enhance the optimization of metric space. In addition, we further propose a new sentence augmentation method to generate more indistinguishable multimodal negative examples from the element level and help the model learn a better metric space. Massive comparison experiments on four real-world datasets confirm the effectiveness of the proposed method.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2024.3359062DOI Listing

Publication Analysis

Top Keywords

reference image
12
metric space
12
negative examples
12
composed query
8
query image
8
image retrieval
8
targets queries
8
image
6
multimodal
5
multimodal composition
4

Similar Publications

Clinicopathological features of dermal clear cell sarcoma: A series of 13 cases.

Pathol Res Pract

September 2025

Department of Pathology, Xijing Hospital and School of Basic Medicine, Fourth Military Medical University, Xi'an, China. Electronic address:

Background: Dermal clear cell sarcoma (DCCS) is a rare malignant mesenchymal neoplasm. Owing to the overlaps in its morphological and immunophenotypic profiles with a broad spectrum of tumors exhibiting melanocytic differentiation, it is frequently misdiagnosed as other tumor entities in clinical practice. By systematically analyzing the clinicopathological characteristics, immunophenotypic features, and molecular biological properties of DCCS, this study intends to further enhance pathologists' understanding of this disease and provide a valuable reference for its accurate diagnosis.

View Article and Find Full Text PDF

The aim of this in-vitro study was to verify which field of view (FOV) in cone-beam computed tomography (CBCT) yields greater accuracy in the detection of internal root resorption (IRR) volume, in comparison to the gold standard of micro-computed tomography (micro-CT) and to a physical method. Twenty-five extractedsingle-rooted teeth were scanned by CBCT with two different FOV parameters (6x6-FOV and 10x10-FOV) and via micro-CT. The volume of dental hard tissue was measured on these images.

View Article and Find Full Text PDF

Advancements in digital media have driven the study and use of photographic records as a diagnostic method for carious lesions, with smartphone images being widely utilized across various health fields. This study aimed to evaluate the diagnostic accuracy of smartphone photography for detecting active caries in orthodontic patients. The sample comprised 100 individuals of both sexes, aged 11 to 46 years, who were undergoing fixed orthodontic treatment.

View Article and Find Full Text PDF

Neural Quantum Embedding via Deterministic Quantum Computation with One Qubit.

Phys Rev Lett

August 2025

Southern University of Science and Technology, Department of Physics, State Key Laboratory of Quantum Functional Materials, and Guangdong Basic Research Center of Excellence for Quantum Science, Shenzhen 518055, China.

Quantum computing is expected to provide an exponential speedup in machine learning. However, optimizing the data loading process, commonly referred to as "quantum data embedding," to maximize classification performance remains a critical challenge. In this Letter, we propose a neural quantum embedding (NQE) technique based on deterministic quantum computation with one qubit (DQC1).

View Article and Find Full Text PDF

Catalyzing computational biology research at an academic institute through an interest network.

PLoS Comput Biol

September 2025

Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, California, United States of America.

Biology has been transformed by the rapid development of computing and the concurrent rise of data-rich approaches such as, omics or high-resolution imaging. However, there is a persistent computational skills gap in the biomedical research workforce. Inherent limitations of classroom teaching and institutional core support highlight the need for accessible ways for researchers to explore developments in computational biology.

View Article and Find Full Text PDF