98%
921
2 minutes
20
Composed query image retrieval task aims to retrieve the target image in the database by a query that composes two different modalities: a reference image and a sentence declaring that some details of the reference image need to be modified and replaced by new elements. Tackling this task needs to learn a multimodal embedding space, which can make semantically similar targets and queries close but dissimilar targets and queries as far away as possible. Most of the existing methods start from the perspective of model structure and design some clever interactive modules to promote the better fusion and embedding of different modalities. However, their learning objectives use conventional query-level examples as negatives while neglecting the composed query's multimodal characteristics, leading to the inadequate utilization of the training data and suboptimal construction of metric space. To this end, in this paper, we propose to improve the learning objective by constructing and mining hard negative examples from the perspective of multimodal fusion. Specifically, we compose the reference image and its logically unpaired sentences rather than paired ones to create component-level negative examples to better use data and enhance the optimization of metric space. In addition, we further propose a new sentence augmentation method to generate more indistinguishable multimodal negative examples from the element level and help the model learn a better metric space. Massive comparison experiments on four real-world datasets confirm the effectiveness of the proposed method.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TIP.2024.3359062 | DOI Listing |
Pathol Res Pract
September 2025
Department of Pathology, Xijing Hospital and School of Basic Medicine, Fourth Military Medical University, Xi'an, China. Electronic address:
Background: Dermal clear cell sarcoma (DCCS) is a rare malignant mesenchymal neoplasm. Owing to the overlaps in its morphological and immunophenotypic profiles with a broad spectrum of tumors exhibiting melanocytic differentiation, it is frequently misdiagnosed as other tumor entities in clinical practice. By systematically analyzing the clinicopathological characteristics, immunophenotypic features, and molecular biological properties of DCCS, this study intends to further enhance pathologists' understanding of this disease and provide a valuable reference for its accurate diagnosis.
View Article and Find Full Text PDFBraz Oral Res
September 2025
Ankara University, Faculty of Dentistry, Department of Oral and Maxillofacial Radiology, Ankara, Turkey.
The aim of this in-vitro study was to verify which field of view (FOV) in cone-beam computed tomography (CBCT) yields greater accuracy in the detection of internal root resorption (IRR) volume, in comparison to the gold standard of micro-computed tomography (micro-CT) and to a physical method. Twenty-five extractedsingle-rooted teeth were scanned by CBCT with two different FOV parameters (6x6-FOV and 10x10-FOV) and via micro-CT. The volume of dental hard tissue was measured on these images.
View Article and Find Full Text PDFBraz Oral Res
September 2025
Universidade Federal de Santa Maria -UFSM, Department of Stomatology, Santa Maria, RS, Brazil.
Advancements in digital media have driven the study and use of photographic records as a diagnostic method for carious lesions, with smartphone images being widely utilized across various health fields. This study aimed to evaluate the diagnostic accuracy of smartphone photography for detecting active caries in orthodontic patients. The sample comprised 100 individuals of both sexes, aged 11 to 46 years, who were undergoing fixed orthodontic treatment.
View Article and Find Full Text PDFPhys Rev Lett
August 2025
Southern University of Science and Technology, Department of Physics, State Key Laboratory of Quantum Functional Materials, and Guangdong Basic Research Center of Excellence for Quantum Science, Shenzhen 518055, China.
Quantum computing is expected to provide an exponential speedup in machine learning. However, optimizing the data loading process, commonly referred to as "quantum data embedding," to maximize classification performance remains a critical challenge. In this Letter, we propose a neural quantum embedding (NQE) technique based on deterministic quantum computation with one qubit (DQC1).
View Article and Find Full Text PDFPLoS Comput Biol
September 2025
Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, California, United States of America.
Biology has been transformed by the rapid development of computing and the concurrent rise of data-rich approaches such as, omics or high-resolution imaging. However, there is a persistent computational skills gap in the biomedical research workforce. Inherent limitations of classroom teaching and institutional core support highlight the need for accessible ways for researchers to explore developments in computational biology.
View Article and Find Full Text PDF