Construction and application of nasopharyngeal carcinoma-specific big data platform based on electronic health records.

Am J Otolaryngol

Department of Radiation Oncology, Nanfang Hospital, Southern Medical University, Guangzhou, China; Guangdong Province Key Laboratory of Molecular Tumor Pathology, Guangzhou, China. Electronic address:

Published: May 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objective: To establish a nasopharyngeal carcinoma-specific big data platform based on electronic health records (EHRs) to provide data support for real-world study of nasopharyngeal carcinoma.

Methods: A multidisciplinary expert team was established for this project. Based on industry standards and practical feasibility, the team designed the nasopharyngeal carcinoma data element standards including 14 modules and 640 fields. Data from patients diagnosed with nasopharyngeal carcinoma who visited Southern Hospital after 1999 were extracted from 15 EHRs systems and were cleaned, structured, and standardized using information technologies such as machine learning and natural language processing. In addition, a series of measures such as quality control and data encryption were taken to ensure data quality and patient privacy. At the platform application level, 10 functional modules were designed according to the needs of nasopharyngeal carcinoma research.

Results: As of 1 October 2022, the Big Data platform has included 11,617patients, of whom 8228 (70.83 %) were male and 3389 (29.17 %) were female, with a median age of 48 years (interquartile range, 40 years). The data in the platform were validated to have a high level of completeness and accuracy, especially for key variables such as social demographics, laboratory tests and vital signs. Currently, six projects involving risk factors, early diagnosis, treatment efficacy and prevention of treatment-related toxic reactions have been conducted on the platform.

Conclusions: We have established a high-quality NPC-specific big data platform by integrating heterogeneous data from multiple sources in the EHR. The platform provides an effective tool and strong data support for real-world studies of nasopharyngeal carcinoma, which helps to improve research efficiency, reduce costs, and improve the quality of research results. We expect to promote multicenter nasopharyngeal carcinoma data sharing in the future to facilitate the generation of high-quality real-world evidence in nasopharyngeal carcinoma. This article may provide some reference value for other comprehensive hospitals to establish a big data platform for nasopharyngeal carcinoma.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.amjoto.2023.104204DOI Listing

Publication Analysis

Top Keywords

nasopharyngeal carcinoma
28
data platform
24
big data
20
data
14
nasopharyngeal
10
nasopharyngeal carcinoma-specific
8
carcinoma-specific big
8
platform
8
platform based
8
based electronic
8

Similar Publications

ALYREF stabilizes CREPT mRNA to accelerate the development of nasopharyngeal carcinoma through dependence on m5C modification.

Exp Cell Res

September 2025

The Department of Hematology, The First Affiliated Hospital of Hainan Medical University, No.31 Longhua Road, Haikou City, Hainan Province, 570000, P.R. China. Electronic address:

Background: Nasopharyngeal carcinoma (NPC) is a kind of tumor disease with high malignant degree. CREPT expression was elevated abnormally in multi-cancers. However, the role and regulatory mechanism of CREPT in NPC remains unknown.

View Article and Find Full Text PDF

Background: Nasopharyngeal carcinoma (NPC) pathogenesis is multi-factorial, involving synergistic interactions among genetic susceptibility, Epstein-Barr virus (EBV) infection, and environmental exposures. Notably, specific multi-generational families exhibit NPC incidence substantially exceeding both sporadic cases and general genetic susceptibility cohorts, demonstrating Mendelian inheritance patterns. This supports the hypothesis that high penetrance pathogenic variants dominate disease initiation and progression in familial NPC.

View Article and Find Full Text PDF

The Hippo pathway and its transcription co-activator YAP play a critical role in the regulation of cell proliferation, apoptosis and the control of organ size. In the past several years, YAP has been found to be expressed in various human cancers, however, its expression in Nasopharyngeal Carcinoma (NPC) remains unstudied. In this report, we found that YAP was overexpressed in human NPC tissues, and its expression was also significantly higher in five NPC cell lines when compared with the nasopharyngeal epithelial cell line NP69 (P < 0.

View Article and Find Full Text PDF

Accurate tumor mutation burden (TMB) quantification is critical for immunotherapy stratification, yet remains challenging due to variability across sequencing platforms, tumor heterogeneity, and variant calling pipelines. Here, we introduce TMBquant, an explainable AI-powered caller designed to optimize TMB estimation through dynamic feature selection, ensemble learning, and automated strategy adaptation. Built upon the H2O AutoML framework, TMBquant integrates variant features, minimizes classification errors, and enhances both accuracy and stability across diverse datasets.

View Article and Find Full Text PDF

Background: C-C motif chemokine ligand 3 (CCL3) is a crucial chemokine that plays a fundamental role in the immune microenvironment and is closely linked to the development of various cancers. Despite its importance, there is limited research regarding the expression and function of CCL3 in nasopharyngeal carcinoma (NPC). Therefore, this study seeks to examine the expression of CCL3 and assess its clinical significance in NPC using bioinformatics analysis and experiments.

View Article and Find Full Text PDF