Publications by authors named "Johan van Soest"

Access to large datasets, the rise of the Internet of Things (IoT) and the ease of collecting personal data, have led to significant breakthroughs in machine learning. However, they have also raised new concerns about privacy data protection. Controversies like the Facebook-Cambridge Analytica scandal highlight unethical practices in today's digital landscape.

View Article and Find Full Text PDF

We present preliminary results of a systematic review of reporting guidelines for nine papers from clinical and non-clinical databases. Initial results show that there is no single standard that fulfills the needs of distinct stakeholders.

View Article and Find Full Text PDF

Federated Learning is becoming more widely used. However, a governance framework is needed to make sure this technology is used safely. In this work, we present a governance framework for Federated Learning project and show how often we have applied the accompanying agreements.

View Article and Find Full Text PDF

Background: The rapid advancement of deep learning in health care presents significant opportunities for automating complex medical tasks and improving clinical workflows. However, widespread adoption is impeded by data privacy concerns and the necessity for large, diverse datasets across multiple institutions. Federated learning (FL) has emerged as a viable solution, enabling collaborative artificial intelligence model development without sharing individual patient data.

View Article and Find Full Text PDF

Purpose: This study aims to develop and externally validate a clinically plausible Bayesian network structure to predict one-year erectile dysfunction in prostate cancer patients by combining expert knowledge with evidence from data using clinical and Patient-reported outcome measures (PROMs) data. In addition, compare and contrast structures that stem from PROM information and routine clinical data.

Summary Of Background: For men with localized prostate cancer, choosing the optimal treatment can be challenging since each option comes with different side effects, such as erectile dysfunction, which negatively impacts their quality of life.

View Article and Find Full Text PDF

An ever-increasing amount of data on a person's daily functioning is being collected, which holds information to revolutionize person-centered healthcare. However, the full potential of data on daily functioning cannot yet be exploited as it is mostly stored in an unstructured and inaccessible manner. The integration of these data, and thereby expedited knowledge discovery, is possible by the introduction of functionomics as a complementary 'omics' initiative, embracing the advances in data science.

View Article and Find Full Text PDF

Objective: Hospitals and healthcare providers should assess and compare the quality of care given to patients and based on this improve the care. In the Netherlands, hospitals provide data to national quality registries, which in return provide annual quality indicators. However, this process is time-consuming, resource intensive and risks patient privacy and confidentiality.

View Article and Find Full Text PDF
Article Synopsis
  • Obesity is often viewed as a lifestyle choice rather than a disease, leading to initiatives like the IMI SOPHIA project, which aims to better categorize individuals with obesity based on their disease risk and treatment responses.
  • SOPHIA faces challenges due to siloed clinical cohorts, which limit data sharing for biomarker discovery, but tackles this by using a federated database built on open-source DataSHIELD technology that integrates 16 different data sources.
  • The project allows secure analysis of combined data without revealing individual patient information, demonstrated through a proof-of-concept analysis linking BMI and blood pressure, which showed results similar to traditional meta-analyses, setting a standard for safe collaborative research.
View Article and Find Full Text PDF

Ensuring safe and healthy food is a big challenge due to the complexity of food supply chains and their vulnerability to many internal and external factors, including food fraud. Recent research has shown that Artificial Intelligence (AI) based algorithms, in particularly data driven Bayesian Network (BN) models, are very suitable as a tool to predict future food fraud and hence allowing food producers to take proper actions to avoid that such problems occur. Such models become even more powerful when data can be used from all actors in the supply chain, but data sharing is hampered by different interests, data security and data privacy.

View Article and Find Full Text PDF

LARC is managed by multimodal treatments whose intensity can be highly modulated. In this context, we need surrogate endpoints to help predict long-term outcomes and better personalize treatments. A previous study identified 2yDFS as a stronger predictor of OS than pCR in LARC patients undergoing neoadjuvant RT.

View Article and Find Full Text PDF

A large amount of personal health data that is highly valuable to the scientific community is still not accessible or requires a lengthy request process due to privacy concerns and legal restrictions. As a solution, synthetic data has been studied and proposed to be a promising alternative to this issue. However, generating realistic and privacy-preserving synthetic personal health data retains challenges such as simulating the characteristics of the patients' data that are in the minority classes, capturing the relations among variables in imbalanced data and transferring them to the synthetic data, and preserving individual patients' privacy.

View Article and Find Full Text PDF

Purpose: Artificial intelligence applications in radiation oncology have been the focus of study in the last decade. The introduction of automated and intelligent solutions for routine clinical tasks, such as treatment planning and quality assurance, has the potential to increase safety and efficiency of radiotherapy. In this work, we present a multi-institutional study across three different institutions internationally on a Bayesian network (BN)-based initial plan review assistive tool that alerts radiotherapy professionals for potential erroneous or suboptimal treatment plans.

View Article and Find Full Text PDF

This study aims to analyze the relationship between the available variables and treatment compliance in elderly cancer patients treated with radiotherapy and to establish a decision tree model to guide caregivers in their decision-making process. For this purpose, 456 patients over 74 years of age who received radiotherapy between 2005 and 2017 were included in this retrospective analysis. The outcome of interest was radiotherapy compliance, determined by whether patients completed their scheduled radiotherapy treatment (compliance means they completed their treatment and noncompliance means they did not).

View Article and Find Full Text PDF

The registration of multi-source radiation oncology data is a time-consuming and labour-intensive procedure. The standardisation of data collection offers the possibility for the acquisition of quality data for research and clinical purposes. With this study, we present an overview of the different tumour group data lists in the Dutch national proton therapy registry.

View Article and Find Full Text PDF

Background And Purpose: The model based approach involves the use of normal tissue complication models for selection of head and neck cancer patients to proton therapy. Our goal was to validate the clinical utility of the related dysphagia model using an independent patient cohort.

Materials And Methods: A dataset of 277 head and neck cancer (pharynx and larynx) patients treated with (chemo)radiotherapy between 2019 and 2021 was acquired.

View Article and Find Full Text PDF

The mining of personal data collected by multiple organizations remains challenging in the presence of technical barriers, privacy concerns, and legal and/or organizational restrictions. While a number of privacy-preserving and data mining frameworks have recently emerged, much remains to show their practical utility. In this study, we implement and utilize a secure infrastructure using data from Statistics Netherlands and the Maastricht Study to learn the association between Type 2 Diabetes Mellitus (T2DM) and healthcare expenses considering the impact of lifestyle, physical activities, and complications of T2DM.

View Article and Find Full Text PDF

Background And Purpose: Tumor recurrence, a characteristic of malignant tumors, is the biggest concern for rectal cancer survivors. The epidemiology of the disease calls for a pressing need to improve healthcare quality and patient outcomes. Prediction models such as Bayesian networks, which can probabilistically reason under uncertainty, could assist caregivers with patient management.

View Article and Find Full Text PDF

Cancer registries collect multisource data and provide valuable information that can lead to unique research opportunities. In the Netherlands, a registry and model-based approach (MBA) are used for the selection of patients that are eligible for proton therapy. We collected baseline characteristics including demographic, clinical, tumour and treatment information.

View Article and Find Full Text PDF

Answering many of the research questions in the field of cancer informatics requires incorporating and centralizing data that are hosted by different parties. Federated Learning (FL) has emerged as a new approach in which a global model can be generated without disclosing private patient data by keeping them at their original location. Flexible, user-friendly, and robust infrastructures are crucial for bringing FL solutions to the day-to-day work of the cancer epidemiologist.

View Article and Find Full Text PDF

•Demographic features are essential for a more personalize survival prediction of spinal bone metastasis (SBM).•Women have a relatively better survival chance than men before 75 years, while men have better survival after this age.•SBM survival is not dependent on the number of spinal metastases.

View Article and Find Full Text PDF

Background And Purpose: Predicting outcomes is challenging in rare cancers. Single-institutional datasets are often small and multi-institutional data sharing is complex. Distributed learning allows machine learning models to use data from multiple institutions without exchanging individual patient-level data.

View Article and Find Full Text PDF

Objective: To explore the association between preoperative physical performance with short- and long-term postoperative outcomes in patients undergoing lumbar spinal fusion (LSF).

Design: Retrospective cohort.

Setting: University hospital.

View Article and Find Full Text PDF

Given the rapid growth of artificial intelligence (AI) applications in radiotherapy and the related transformations toward the data-driven healthcare domain, this article summarizes the need and usage of the FAIR (Findable, Accessible, Interoperable, Reusable) data principles in radiotherapy. This work introduces the FAIR data concept, presents practical and relevant use cases and the future role of the different parties involved. The goal of this article is to provide guidance and potential applications of FAIR to various radiotherapy stakeholders, focusing on the central role of medical physicists.

View Article and Find Full Text PDF

The difference in incidence of oral cavity cancer (OCC) between Taiwan and the Netherlands is striking. Different risk factors and treatment expertise may result in survival differences between the two countries. However due to regulatory restrictions, patient-level analyses of combined data from the Netherlands and Taiwan are infeasible.

View Article and Find Full Text PDF