OCCMNet: Occlusion-Aware Class Characteristic Mining Network for multi-class artifacts detection in endoscopy.

Med Biol Eng Comput

Department of Computer Science and Technology, Anhui University, 111 Jiulong Road, Hefei, 230601, Anhui, China.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Multi-class endoscope artifacts detection is crucial for eliminating interference caused by artifacts during clinical examinations and reducing the rate of misdiagnosis and missed diagnoses by physicians. However, this task remains challenging such as data imbalance, similarity, and occlusion among artifacts. To overcome these challenges, we propose an Occlusion-Aware Class Characteristic Mining Network (OCCMNet) to detect eight classes of artifacts in endoscope simultaneously. The OCCMNet comprises the following: (1) A Dual-Branch Class Rebalancing Module (DCRM) rebalances the impact of various classes by fully exploiting the benefits of two complementary data distributions, sampling and detecting from the majority and minority classes respectively. (2) A Class Discrimination Enhancement Module (CDEM) effectively enhances the discrepancy of inter-class by enhance important information and introduce nuance information nonlinearly. (3) A Global Occlusion-Aware Module (GOAM) infers the obscured part of the artifacts by capturing the global information to initially identify the obscured artifacts and combining local details to sense the overall structure of the artifacts. Our OCCMNet has been validated on a public dataset (EndoCV2020). Compared to the latest methods in both medical and computer vision detection, our approach demonstrated 3.5-6.5% improvement in mAP50. The results proved the superiority of our OCCMNet in multi-class endoscopic artifact detection and demonstrated its great potential in reducing clinical interference.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s11517-025-03332-yDOI Listing

Publication Analysis

Top Keywords

occlusion-aware class
8
class characteristic
8
characteristic mining
8
mining network
8
artifacts
8
artifacts detection
8
obscured artifacts
8
occmnet
5
occmnet occlusion-aware
4
class
4

Similar Publications

OCCMNet: Occlusion-Aware Class Characteristic Mining Network for multi-class artifacts detection in endoscopy.

Med Biol Eng Comput

August 2025

Department of Computer Science and Technology, Anhui University, 111 Jiulong Road, Hefei, 230601, Anhui, China.

Multi-class endoscope artifacts detection is crucial for eliminating interference caused by artifacts during clinical examinations and reducing the rate of misdiagnosis and missed diagnoses by physicians. However, this task remains challenging such as data imbalance, similarity, and occlusion among artifacts. To overcome these challenges, we propose an Occlusion-Aware Class Characteristic Mining Network (OCCMNet) to detect eight classes of artifacts in endoscope simultaneously.

View Article and Find Full Text PDF

Person re-identification (ReID) typically encounters varying degrees of occlusion in real-world scenarios. While previous methods have addressed this using handcrafted partitions or external cues, they often compromise semantic information or increase network complexity. In this paper, we propose a new method from a novel perspective, termed as OAT.

View Article and Find Full Text PDF

We introduce here a large tracking database that offers an unprecedentedly wide coverage of common moving objects in the wild, called GOT-10k. Specifically, GOT-10k is built upon the backbone of WordNet structure [1] and it populates the majority of over 560 classes of moving objects and 87 motion patterns, magnitudes wider than the most recent similar-scale counterparts [19], [20], [23], [26]. By releasing the large high-diversity database, we aim to provide a unified training and evaluation platform for the development of class-agnostic, generic purposed short-term trackers.

View Article and Find Full Text PDF

In this paper we present Latent-Class Hough Forests, a method for object detection and 6 DoF pose estimation in heavily cluttered and occluded scenarios. We adapt a state of the art template matching feature into a scale-invariant patch descriptor and integrate it into a regression forest using a novel template-based split function. We train with positive samples only and we treat class distributions at the leaf nodes as latent variables.

View Article and Find Full Text PDF