Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The sparsity is an attractive property that has been widely and intensively utilized in various image processing fields (e.g., robust image representation, image compression, image analysis, etc.). Its actual success owes to the exhaustive mining of the intrinsic (or homogenous) information from the whole data carrying redundant information. From the perspective of image representation, the sparsity can successfully find an underlying homogenous subspace from a collection of training data to represent a given test sample. The famous sparse representation (SR) and its variants embed the sparsity by representing the test sample using a linear combination of training samples with $L_{0}$ -norm regularization and $L_{1}$ -norm regularization. However, although these state-of-the-art methods achieve powerful and robust performances, the sparsity is not fully exploited on the image representation in the following three aspects: 1) the within-sample sparsity, 2) the between-sample sparsity, and 3) the image structural sparsity. In this paper, to make the above-mentioned multi-context sparsity properties agree and simultaneously learned in one model, we propose the concept of consensus sparsity (Con-sparsity) and correspondingly build a multi-context sparse image representation (MCSIR) framework to realize this. We theoretically prove that the consensus sparsity can be achieved by the $L_{\infty }$ -induced matrix variate based on the Bayesian inference. Extensive experiments and comparisons with the state-of-the-art methods (including deep learning) are performed to demonstrate the promising performance and property of the proposed consensus sparsity.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2022.3231083DOI Listing

Publication Analysis

Top Keywords

image representation
20
consensus sparsity
16
sparsity
11
image
9
multi-context sparse
8
sparse image
8
matrix variate
8
test sample
8
-norm regularization
8
state-of-the-art methods
8

Similar Publications

Cortical networks with multiple interneuron types generate oscillatory patterns during predictive coding.

PLoS Comput Biol

September 2025

Faculty of Science, Cognitive and Systems Neuroscience Group, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, the Netherlands.

Predictive coding (PC) proposes that our brains work as an inference machine, generating an internal model of the world and minimizing predictions errors (i.e., differences between external sensory evidence and internal prediction signals).

View Article and Find Full Text PDF

YOLOv11-WBD: A wavelet-bidirectional network with dilated perception for robust metal surface defect detection.

PLoS One

September 2025

Department of Smart Manufacturing, Industrial Perception and Intelligent Manufacturing Equipment Engineering Research Center of Jiangsu Province, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, China.

In the field of quality control, metal surface defect detection is an important yet challenging task. Although YOLO models perform well in most object detection scenarios, metal surface images under operational conditions often exhibit coexisting high-frequency noise components and spectral aliasing background textures, and defect targets typically exhibit characteristics such as small scale, weak contrast, and multi-class coexistence, posing challenges for automatic defect detection systems. To address this, we introduce concepts including wavelet decomposition, cross-attention, and U-shaped dilated convolution into the YOLO framework, proposing the YOLOv11-WBD model to enhance feature representation capability and semantic mining effectiveness.

View Article and Find Full Text PDF

Science of music-based citizen science: How seeing influences hearing.

PLoS One

September 2025

Department of Engineering and School of Biomedical Engineering and Imaging Sciences, King's College London, London, United Kingdom.

Citizen science engages volunteers to contribute data to scientific projects, often through visual annotation tasks. Hearing based activities are rare and less well understood. Having high quality annotations of performed music structures is essential for reliable algorithmic analysis of recorded music with applications ranging from music information retrieval to music therapy.

View Article and Find Full Text PDF

Camouflaged Object Segmentation (COS) faces significant challenges due to the scarcity of annotated data, where meticulous pixel-level annotation is both labor-intensive and costly, primarily due to the intricate object-background boundaries. Addressing the core question, "Can COS be effectively achieved in a zero-shot manner without manual annotations for any camouflaged object?", we propose an affirmative solution. We analyze the learned attention patterns for camouflaged objects and introduce a robust zero-shot COS framework.

View Article and Find Full Text PDF

Alpha oscillations have been implicated in the maintenance of working memory representations. Notably, when memorised content is spatially lateralised, the power of posterior alpha activity exhibits corresponding lateralisation during the retention interval, consistent with the retinotopic organisation of the visual cortex. Beyond power, alpha frequency has also been linked to memory performan ce, with faster alpha rhythms associated with enhanced retention.

View Article and Find Full Text PDF