Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

Gabriel Kalweit , Anusha Klett , Paula Silvestrini , Jens Rahnfeld , Mehdi Naouar , Yannick Vogt , Diana Infante , Rebecca Berger , Jesús Duque-Afonso , Tanja Nicole Hartmann , Marie Follo , Elitsa Bodurova-Spassova , Michael Lübbert , Roland Mertelsmann , Joschka Boedecker , Evelyn Ullrich , Maria Kalweit

Front Oncol

Collaborative Research Institute Intelligent Oncology (CRIION), Freiburg, Germany.

Published: June 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background: Cellular imaging analysis using the traditional retrospective approach is extremely time-consuming and labor-intensive. Although AI-based solutions are available, these approaches rely heavily on supervised learning techniques that require high quality, large labeled datasets from the same microscope to be reliable. In addition, primary patient samples are often heterogeneous cell populations and need to be stained to distinguish the cellular subsets. The resulting imaging data is analyzed and labeled manually by experts. Therefore, a method to distinguish cell populations across imaging devices without the need for staining and extensive manual labeling would help immensely to gain real-time insights into cell population dynamics. This especially holds true for recognizing specific cell types and states in response to treatments.

Objective: We aim to develop an unsupervised approach using general vision foundation models trained on diverse and extensive imaging datasets to extract rich visual features for cell-analysis across devices, including both stained and unstained live cells. Our method, Entropy-guided Weighted Combinational FAISS (EWC-FAISS), uses these models purely in an inference-only mode without task-specific retraining on the cellular data. Combining the generated embeddings in an efficient and adaptive k-nearest neighbor search allows for automated, cross device identification of cell types and states, providing a strong basis for AI-assisted cancer therapy.

Methods: We utilized two publicly available datasets. The WBC dataset includes 14,424 images of stained white blood cell samples from patients with acute myeloid and lymphoid leukemia, as well as those without leukemic pathology. The LISC dataset comprises 257 images of white blood cell samples from healthy individuals. We generated four in-house datasets utilizing the JIMT-1 breast cancer cell line, as well as Jurkat and K562 (leukemic cell lines). These datasets were acquired using the Nanolive 3D Cell Explorer-fluo (CX-A) holotomographic microscope and the BioTek Lionheart FX automated brightfield microscope. The images from the in-house datasets were manually annotated using Roboflow software. To generate the embeddings, we used and optimized a concatenated combination of SAM, DINO, ConvNeXT, SWIN, CLIP and ViTMAE. The combined embeddings were used as input for the adaptive k-nearest neighbor search, building an approximate Hierarchical Navigable Small World FAISS index. We compared EWC-FAISS to fully fined-tuned ViT-Classifiers with DINO-, and SWIN-backbones, a ConvNeXT architecture, as well as to NMTune as a lightweight domain-adaptation method with frozen backbone.

Results: EWC-FAISS performed competitively with the baselines on the original datasets in terms of macro accuracy. Macro accuracy is the average of class-specific accuracies, treating all classes equally by averaging their individual accuracies. EWC-FAISS ranked second for the WBC dataset (macro accuracy: 97.6 ± 0.2), first for cell state classification from Nanolive (macro accuracy: 90 ± 0), and performed comparably for cell type classification from Lionheart (macro accuracy: 87 ± 0). For the transfer to out-of-distribution (OOD) datasets, which the model had not seen during training, EWC-FAISS consistently outperformed the other baselines. For the LISC dataset, EWC-FAISS achieved a macro accuracy of 78.5 ± 0.3, compared to DINO FT's 17 ± 1, SWIN FT's 44 ± 14, ConvNeXT FT's 45 ± 9, and NMTune's 52 ± 10. For the cell state classification from Lionheart, EWC-FAISS had a macro accuracy of 86 ± 1, while DINO FT, SWIN FT, and ConvNeXT FT achieved 65 ± 11, 68 ± 16, and 81 ± 1, respectively, and NMTune 81 ± 7. For the transfer of cell type classification from Nanolive, EWC-FAISS attained a macro accuracy of 85 ± 0, compared to DINO FT's 24.5 ± 0.9, SWIN FT's 57 ± 6, ConvNeXT FT's 54 ± 4, and NMTune's 63 ± 4. Additionally, building EWC-FAISS after embedding generation was significantly faster than training DINO FT (∼ 6 minutes compared to 10 hours). Lastly, EWC-FAISS performed comparably in distinguishing cancerous cell lines from Peripheral Blood Mononuclear Cells with a mean accuracy of 80 ± 5, compared to CellMixer with a mean accuracy of 79.7.

Conclusion: We present a novel approach to identify various cell lines and primary cells based on their identity and state using images acquired across various imaging platforms which vary in resolution, magnification and image quality. Despite these differences, we could show that our efficient, adaptive k-nearest neighbor search pipeline can be applied on a large image dataset containing different cell types and effectively differentiate between the cells and their states such as live, apoptotic or necrotic. There are several applications, particularly in distinguishing various cell populations in patient samples or monitoring therapy.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12213826	PMC
http://dx.doi.org/10.3389/fonc.2025.1480384	DOI Listing

Publication Analysis

Top Keywords

macro accuracy

cell

cell populations

cell types

adaptive k-nearest

k-nearest neighbor

neighbor search

cell lines

ewc-faiss

accuracy

Similar Publications

Explainable Reasoning Path Inference of Anti-cancer Drug Sensitivity on Genomic Knowledge Graph via Macro-Micro Agent Collaborative Reinforcement Learning.

IEEE Trans Comput Biol Bioinform

September 2025

Minhua Feng , Liping Tang , Juntao Liang , Song Huang , Jianfeng Ma

Artificial intelligence (AI) based anticancer drug recommendation systems have emerged as powerful tools for precision dosing. Although existing methods have advanced in terms of predictive accuracy, they encounter three significant obstacles, including the "black-box" problem resulting in unexplainable reasoning, the computational difficulty for graphbased structures, and the combinatorial explosion during multistep reasoning. To tackle these issues, we introduce a novel Macro-Micro agent Drug sensitivity inference (MarMirDrug).

View Article and Find Full Text PDF

Similar Publications

Collaborative Learning Macroscopic Binding Trends and Microscopic Residue Interactions to Predict Peptide-Protein Interactions.

IEEE J Biomed Health Inform

September 2025

Li Zeng , Yang Liu , Zu-Guo Yu , Guosheng Han , Yuansheng Liu

Short peptides and their structural modifications have demonstrated significant potential in the field of therapeutic drug development. During the research and development process, peptide-protein interaction plays a crucial role for screening highly effective peptides. Although traditional experimental methods can identity peptide-protein interactions, their time-consuming and resource-intensive nature make researchers develop various of computational alternatives.

View Article and Find Full Text PDF

Similar Publications

End-to-end deep learning model with multi-channel and attention mechanisms for multi-class diagnosis in CT-T staging of advanced gastric cancer.

Eur J Radiol

September 2025

Department of Radiology, Affiliated People's Hospital of Jiangsu University, Zhenjiang, Jiangsu 212002, China. Electronic address:

Bowen Liu , Pengcheng Jiang , Zehui Wang , Xiaoxiao Wang , Zhixuan Wang

Background: Homogeneous AI assessment is required for CT-T staging of gastric cancer.

Purpose: To construct an End-to-End CT-based Deep Learning (DL) model for tumor T-staging in advanced gastric cancer.

Materials And Methods: A retrospective study was conducted on 460 cases of presurgical CT patients with advanced gastric cancer between 2011 and 2024.

View Article and Find Full Text PDF

Similar Publications

AlzFormer: Video-based space-time attention model for early diagnosis of Alzheimer's disease.

Neuroscience

September 2025

Department of Medicine, LSU Health Shreveport, Shreveport, LA, USA. Electronic address:

Taymaz Akan , Sara Akan , Sait Alp , Christina Raye Ledbetter , Mohammad Alfrad Nobel Bhuiyan

Early and accurate Alzheimer's disease (AD) diagnosis is critical for effective intervention, but it is still challenging due to neurodegeneration's slow and complex progression. Recent studies in brain imaging analysis have highlighted the crucial roles of deep learning techniques in computer-assisted interventions for diagnosing brain diseases. In this study, we propose AlzFormer, a novel deep learning framework based on a space-time attention mechanism, for multiclass classification of AD, MCI, and CN individuals using structural MRI scans.

View Article and Find Full Text PDF

Similar Publications

From Support Vector Machines to Neural Networks: Advancing Automated Velopharyngeal Dysfunction Detection in Patients With Cleft Palate.

Ann Plast Surg

September 2025

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN.

Noah Alter , Claiborne Lucas , Ricardo Torres-Guzman , Andrew James , Amy Stone

Background: The generation of intelligible speech is the single most important outcome after cleft palate repair. The development of velopharyngeal dysfunction (VPD) compromises the outcome, and the burden of VPD remains largely unknown in low- and middle-income countries (LMICs). To scale up VPD care in these areas, we continue to explore the use of artificial intelligence (AI) and machine learning (ML) for automatic detection of VPD from speech samples alone.

View Article and Find Full Text PDF

Similar Publications