Publications by authors named "Praphulla Bhawsar"

Despite promising results in using deep learning to infer genetic features from histological whole-slide images (WSIs), no prior studies have specifically applied these methods to lung adenocarcinomas from subjects who have never smoked tobacco (NS-LUAD) - a molecularly and histologically distinct subset of lung cancer. Existing models have focused on LUAD from predominantly smoker populations, with limited molecular scope and variable performance. Here, we propose a customized deep convolutional neural network based on ResNet50 architecture, optimized for multilabel classification for NS-LUAD, enabling simultaneous prediction of 16 molecular alterations from a single H&E-stained WSI.

View Article and Find Full Text PDF

Importance: Inflammation impacts cancer risk and tumor biological processes, yet studies linking it to social and environmental risk factors are lacking.

Objective: To investigate the association of neighborhood deprivation and air pollution with breast adipose inflammation as well as the association between crown-like structures of the breast (CLS-B) and DNA methylation in Black and White women.

Design, Setting, And Participants: This cross-sectional study analyzed women with and without breast cancer participating in the National Cancer Institute-Maryland Breast Cancer Study, most of whom were recruited between January 1, 1993, and December 1, 2003, from the University of Maryland Medical Center and surrounding hospitals in the Baltimore, Maryland, area.

View Article and Find Full Text PDF

Whole slide images (WSI), obtained by high-resolution digital scanning of microscope slides at multiple scales, are the cornerstone of modern Digital Pathology. However, they represent a particular challenge to artificial intelligence (AI)-based/AI-mediated analysis because pathology labeling is typically done at slide-level, instead of tile-level. It is not just that medical diagnostics is recorded at the specimen level, the detection of oncogene mutation is also experimentally obtained, and recorded by initiatives like The Cancer Genome Atlas (TCGA), at the slide level.

View Article and Find Full Text PDF

Whole Slide Images (WSI), obtained by high-resolution digital scanning of microscope slides at multiple scales, are the cornerstone of modern Digital Pathology. However, they represent a particular challenge to AI-based/AI-mediated analysis because pathology labeling is typically done at slide-level, instead of tile-level. It is not just that medical diagnostics is recorded at the specimen level, the detection of oncogene mutation is also experimentally obtained, and recorded by initiatives like The Cancer Genome Atlas (TCGA), at the slide level.

View Article and Find Full Text PDF

: The proliferation of genetic testing and consumer genomics represents a logistic challenge to the personalized use of GWAS data in VCF format. Specifically, the challenge of retrieving target genetic variation from large compressed files filled with unrelated variation information. Compounding the data traversal challenge, privacy-sensitive VCF files are typically managed as large stand-alone single files (no companion index file) composed of variable-sized compressed chunks, hosted in consumer-facing environments with no native support for hosted execution.

View Article and Find Full Text PDF

Motivation: Currently, the Polygenic Score (PGS) Catalog curates over 400 publications on over 500 traits corresponding to over 3000 polygenic risk scores (PRSs). To assess the feasibility of privately calculating the underlying multivariate relative risk for individuals with consumer genomics data, we developed an in-browserPRS calculator for genomic data that does not circulate any data or engage in any computation outside of the user's personal device.

Results: A prototype personal risk score calculator, created for research purposes, was developed to demonstrate how the PGS Catalog can be privately and readily applied to readily available direct-to-consumer genetic testing services, such as 23andMe.

View Article and Find Full Text PDF

Epidemiological studies face two important challenges: the need to ingest ever more complex data types, and mounting concerns about participant privacy and data governance. These two challenges are compounded by the expectation that data infrastructure will eventually need to facilitate cross-registration of participants by multiple epidemiological studies. The portable web-service epiDonate was developed using the serverless model known as FaaS (Function-as-a-Service).

View Article and Find Full Text PDF

Background: Artificial intelligence (AI) is fast becoming the tool of choice for scalable and reliable analysis of medical images. However, constraints in sharing medical data outside the institutional or geographical space, as well as difficulties in getting AI models and modeling platforms to work across different environments, have led to a "reproducibility crisis" in digital medicine.

Methods: This study details the implementation of a web platform that can be used to mitigate these challenges by orchestrating a digital pathology AI pipeline, from raw data to model inference, entirely on the local machine.

View Article and Find Full Text PDF
Article Synopsis
  • - A study involving high-coverage whole-genome sequencing of 232 lung cancer cases in never smokers (LCINS) identified three distinct subtypes based on genetic alterations, primarily involving copy number changes.
  • - The dominant subtype, termed "piano," is characterized by unique genetic features like UBA1 mutations and low mutational burden, indicating stem cell-like traits and a slower tumor growth rate compared to typical lung cancer in smokers.
  • - Notably, no significant tobacco-related mutations were found, even in patients exposed to secondhand smoke, and certain genetic changes were linked to increased mortality, suggesting potential for tailored treatment strategies for LCINS.
View Article and Find Full Text PDF

Motivation: Mortality Tracker is an in-browser application for data wrangling, analysis, dissemination and visualization of public time series of mortality in the United States. It was developed in response to requests by epidemiologists for portable real time assessment of the effect of COVID-19 on other causes of death and all-cause mortality. This is performed by comparing 2020 real time values with observations from the same week in the previous 5 years, and by enabling the extraction of temporal snapshots of mortality series that facilitate modeling the interdependence between its causes.

View Article and Find Full Text PDF