98%
921
2 minutes
20
PCBs play a critical role in electronic manufacturing, and accurate defect detection is essential for ensuring product quality and reliability. However, PCB defects are often small, irregularly shaped, and embedded in complex textures, making them difficult to detect using traditional methods. In this paper, we propose CM-UNetv2, a semantic segmentation network designed to address these challenges through three architectural modules incorporating four key innovations. First, a Parallelized Patch-Aware Attention (PPA) module is incorporated into the encoder to enhance multi-scale feature representation through a multi-branch attention mechanism combining local, global, and serial convolutions. Second, we propose a Dual-Stream Skip Guidance (DSSG) module that decouples semantic refinement from spatial information preservation via two separate skip pathways, enabling finer detail retention. Third, we design a decoder module called Frequency-domain Guided Context Mamba (FGCMamba), which integrates two novel mechanisms: a Spatial Guidance Cross-Attention (SGCA) mechanism to enhance the alignment of spatial and semantic features, and a Frequency-domain Self-Attention Solver (FSAS) to compute global attention efficiently in the frequency domain, improving boundary restoration and reducing computational overhead. Experiments on the MeiweiPCB and KWSD2 datasets demonstrate that the CM-UNetv2 achieves state-of-the-art performance in small object detection, boundary accuracy, and overall segmentation robustness.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12389856 | PMC |
http://dx.doi.org/10.3390/s25164919 | DOI Listing |
IEEE Trans Neural Netw Learn Syst
September 2025
In industrial scenarios, semantic segmentation of surface defects is vital for identifying, localizing, and delineating defects. However, new defect types constantly emerge with product iterations or process updates. Existing defect segmentation models lack incremental learning capabilities, and direct fine-tuning (FT) often leads to catastrophic forgetting.
View Article and Find Full Text PDFJ Korean Med Sci
September 2025
Department of Transdisciplinary Medicine, Seoul National University Hospital, Seoul, Korea.
Background: With the increasing incidence of skin cancer, the workload for pathologists has surged. The diagnosis of skin samples, especially for complex lesions such as malignant melanomas and melanocytic lesions, has shown higher diagnostic variability compared to other organ samples. Consequently, artificial intelligence (AI)-based diagnostic assistance programs are increasingly needed to support dermatopathologists in achieving more consistent diagnoses.
View Article and Find Full Text PDFFront Plant Sci
September 2025
College of Big Data, Yunnan Agricultural University, Kunming, China.
Introduction: Accurate identification of cherry maturity and precise detection of harvestable cherry contours are essential for the development of cherry-picking robots. However, occlusion, lighting variation, and blurriness in natural orchard environments present significant challenges for real-time semantic segmentation.
Methods: To address these issues, we propose a machine vision approach based on the PIDNet real-time semantic segmentation framework.
Med Biol Eng Comput
September 2025
Key Laboratory of Mechanism Theory and Equipment Design of Ministry of Education, Tianjin University, Tianjin, 300072, China.
Surgical instrument segmentation plays an important role in robotic autonomous surgical navigation systems as it can accurately locate surgical instruments and estimate their posture, which helps surgeons understand the position and orientation of the instruments. However, there are still some problems affecting segmentation accuracy, like insufficient attention to the edges and center of surgical instruments, insufficient usage of low-level feature details, etc. To address these issues, a lightweight network for surgical instrument segmentation in gastrointestinal (GI) endoscopy (GESur_Net) is proposed.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
September 2025
Generalized visual grounding tasks, including Generalized Referring Expression Comprehension (GREC) and Segmentation (GRES), extend the classical visual grounding paradigm by accommodating multi-target and non-target scenarios. Specifically, GREC focuses on accurately identifying all referential objects at the coarse bounding box level, while GRES aims for achieve fine-grained pixel-level perception. However, existing approaches typically treat these tasks independently, overlooking the benefits of jointly training GREC and GRES to ensure consistent multi-granularity predictions and streamline the overall process.
View Article and Find Full Text PDF