Resolution-based distillation for efficient histology image classification.

Artif Intell Med

Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA; Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA; Department of Epidemiology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA. Electronic address: Saeed.Hass

Published: September 2021


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Developing deep learning models to analyze histology images has been computationally challenging, as the massive size of the images causes excessive strain on all parts of the computing pipeline. This paper proposes a novel deep learning-based methodology for improving the computational efficiency of histology image classification. The proposed approach is robust when used with images that have reduced input resolution, and it can be trained effectively with limited labeled data. Moreover, our approach operates at either the tissue- or slide-level, removing the need for laborious patch-level labeling. Our method uses knowledge distillation to transfer knowledge from a teacher model pre-trained at high resolution to a student model trained on the same images at a considerably lower resolution. Also, to address the lack of large-scale labeled histology image datasets, we perform the knowledge distillation in a self-supervised fashion. We evaluate our approach on three distinct histology image datasets associated with celiac disease, lung adenocarcinoma, and renal cell carcinoma. Our results on these datasets demonstrate that a combination of knowledge distillation and self-supervision allows the student model to approach and, in some cases, surpass the teacher model's classification accuracy while being much more computationally efficient. Additionally, we observe an increase in student classification performance as the size of the unlabeled dataset increases, indicating that there is potential for this method to scale further with additional unlabeled data. Our model outperforms the high-resolution teacher model for celiac disease in accuracy, F1-score, precision, and recall while requiring 4 times fewer computations. For lung adenocarcinoma, our results at 1.25× magnification are within 1.5% of the results for the teacher model at 10× magnification, with a reduction in computational cost by a factor of 64. Our model on renal cell carcinoma at 1.25× magnification performs within 1% of the teacher model at 5× magnification while requiring 16 times fewer computations. Furthermore, our celiac disease outcomes benefit from additional performance scaling with the use of more unlabeled data. In the case of 0.625× magnification, using unlabeled data improves accuracy by 4% over the tissue-level baseline. Therefore, our approach can improve the feasibility of deep learning solutions for digital pathology on standard computational hardware and infrastructures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449014PMC
http://dx.doi.org/10.1016/j.artmed.2021.102136DOI Listing

Publication Analysis

Top Keywords

histology image
16
teacher model
16
knowledge distillation
12
celiac disease
12
unlabeled data
12
image classification
8
deep learning
8
model
8
student model
8
image datasets
8

Similar Publications

Aim: The purpose of this study was to assess the accuracy of a customized deep learning model based on CNN and U-Net for detecting and segmenting the second mesiobuccal canal (MB2) of maxillary first molar teeth on cone beam computed tomography (CBCT) scans.

Methodology: CBCT scans of 37 patients were imported into 3D slicer software to crop and segment the canals of the mesiobuccal (MB) root of the maxillary first molar. The annotated data were divided into two groups: 80% for training and validation and 20% for testing.

View Article and Find Full Text PDF

Background: This study aimed to investigate the gender-specific associations of skeletal muscle mass and fat mass with non-alcoholic fatty liver disease (NAFLD) and NAFLD-related liver fibrosis in two population-based studies.

Methods: Analyses were based on data from the MEGA (n = 238) and the MEIA study (n = 594) conducted between 2018 and 2023 in Augsburg, Germany. Bioelectrical impedance analysis was used to evaluate relative skeletal muscle mass (rSM) and SM index (SMI) as well as relative fat mass (rFM) and FM index (FMI); furthermore, the fat-to-muscle ratio was built.

View Article and Find Full Text PDF

Aim Of The Study: To present a case series of four pediatric patients with PDPV, each with a different clinical presentation and surgical management.

Methods: We retrospectively reviewed four cases of PDPV managed at our institution. Two cases were associated with extrahepatic biliary atresia (EHBA) and discovered incidentally during surgery.

View Article and Find Full Text PDF

Background: Thyroid nodules (TNs) are frequent and often benign. Accurately differentiating between benign and malignant nodules is crucial for proper management. This research aims to use ultrasonography to examine TNs and identify possible risk factors in order to improve patient outcomes and diagnostic accuracy.

View Article and Find Full Text PDF

Background: Fetal MRI is increasingly used to investigate fetal lung pathologies, and super-resolution (SR) algorithms could be a powerful clinical tool for this assessment. Our goal was to investigate whether SR reconstructions result in an improved agreement in lung volume measurements determined by different raters, also known as inter-rater reliability.

Materials And Methods: In this single-center retrospective study, fetal lung volumes calculated from both SR reconstructions and the original images were analyzed.

View Article and Find Full Text PDF