Cancer-related Keywords in 2023: Insights from Text Mining of a Major Consumer Portal.

Healthc Inform Res

Cancer Knowledge & Information Center, National Cancer Control Institute, National Cancer Center, Goyang, Korea.

Published: October 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objectives: With the growing importance of monitoring cancer patients' internet usage, there is an increasing need for technology that expands access to relevant information through text mining. This study analyzed internet articles from portal sites in 2023 to identify trends in the information available to cancer patients and to derive meaningful insights.

Methods: This study analyzed 19,578 news articles published on Naver, a major Korean portal site, from January 1, 2023, to December 31, 2023. Natural language processing, text mining, network analysis, and word cloud analysis were employed. The search term "am" (Korean for "cancer") was used to identify keywords related to cancer.

Results: In 2023, an average of 1,631 cancer-related articles were published monthly, with a peak of 1,946 in September and a low of 1,371 in February. A total of 132,456 keywords were extracted, with "cure" (2,218 occurrences), "lung cancer" (1,652), and "breast cancer" (1,235) being the most frequent. Term frequency-inverse document frequency analysis ranked "struggle" (1064.172) as the most significant keyword, followed by "lung cancer" (839.988) and "breast cancer" (744.840). Network analysis revealed four distinct clusters focusing on treatment, celebrity-related issues, major cancer types, and cancer-causing factors.

Conclusions: The analysis of cancer-related keywords in 2023 indicates that news articles often prioritize gossip over essential information. These findings provide foundational data for future policy directions and strategies to address misinformation. This study underscores the importance of understanding the nature of cancer-related information consumed by the public and offers insights to guide official policies and healthcare practices.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11570664PMC
http://dx.doi.org/10.4258/hir.2024.30.4.398DOI Listing

Publication Analysis

Top Keywords

text mining
12
cancer-related keywords
8
keywords 2023
8
study analyzed
8
news articles
8
articles published
8
network analysis
8
"lung cancer"
8
"breast cancer"
8
0
6

Similar Publications

Background: Soil salinization represents a critical global challenge to agricultural productivity, profoundly impacting crop yields and threatening food security. Plant salt-responsive is complex and dynamic, making it challenging to fully elucidate salt tolerance mechanism and leading to gaps in our understanding of how plants adapt to and mitigate salt stress.

Results: Here, we conduct high-resolution time-series transcriptomic and metabolomic profiling of the extremely salt-tolerant maize inbred line, HLZY, and the salt-sensitive elite line, JI853.

View Article and Find Full Text PDF

Harmonizing mouse anatomy terminology: a common language?

Mamm Genome

September 2025

Department of Animal Health and Anatomy, Center for Animal Biotechnology and Gene Therapy, Universitat Autònoma de Barcelona, Travessera Dels Turons, 08193, Cerdanyola del Vallès, Barcelona, Spain.

The mouse remains the principal animal model for investigating human diseases due, among other reasons, to its anatomical similarities to humans. Despite its widespread use, the assumption that mouse anatomy is a fully established field with standardized and universally accepted terminology is misleading. Many phenotypic anatomical annotations do not refer to the authority or origin of the terminology used, while others inappropriately adopt outdated or human-centric nomenclature.

View Article and Find Full Text PDF

Homemade explosives (HMEs) present significant challenges to forensic investigations due to their diverse chemical compositions and varying construction methods. Identifying the origin of these explosives is crucial for linking evidence across crime scenes. To address this challenge, this study employs an advanced data mining technique to enhance the forensic analysis of a unique dataset consisting of 344 HME samples collected from 129 real cases in China over an eight-year period (2015-2022).

View Article and Find Full Text PDF

Background: On September 27, 2024, Rwanda reported an outbreak of Marburg virus disease (MVD), after a cluster of cases of viral hemorrhagic fever was detected at two urban hospitals.

Methods: We report key aspects of the epidemiology, clinical manifestations, and treatment of MVD during this outbreak, as well as the overall response to the outbreak. We performed a retrospective epidemiologic and clinical analysis of data compiled across all pillars of the outbreak response and a case-series analysis to characterize clinical features, disease progression, and outcomes among patients who received supportive care and investigational therapeutic agents.

View Article and Find Full Text PDF

Predicting Unplanned Readmission Risk in Patients With Cirrhosis: Complication-Aware Dynamic Classifier Selection Approach.

JMIR Med Inform

September 2025

College of Medical Informatics, Chongqing Medical University, 1 Yixueyuan Road, Yuzhong District, Chongqing, 400016, China, 86 13500303273.

Background: Cirrhosis is a leading cause of noncancer deaths in gastrointestinal diseases, resulting in high hospitalization and readmission rates. Early identification of high-risk patients is vital for proactive interventions and improving health care outcomes. However, the quality and integrity of real-world electronic health records (EHRs) limit their utility in developing risk assessment tools.

View Article and Find Full Text PDF