Publications by Benoit Liquet

Publications by authors named "Benoit Liquet"

Page 1 of 2

Nonstationary Spatial Process Models with Spatially Varying Covariance Kernels.

Sébastien Coube-Sisqueille , Sudipto Banerjee , Benoît Liquet

J Comput Graph Stat

July 2025

Building spatial process models that capture nonstationary behavior while delivering computationally efficient inference is challenging. Nonstationary spatially varying kernels (see, e.g.

View Article and Find Full Text PDF

Using a Supervised Principal Components Analysis for Variable Selection in High-Dimensional Datasets Reduces False Discovery Rates.

Insha Ullah , Kerrie Mengersen , Anthony N Pettitt , Benoit Liquet

Stat Med

June 2025

High-dimensional datasets, where the number of variables ' ' is much larger than the number of samples ' ', are ubiquitous and often render standard classification techniques unreliable due to overfitting. An important research problem is feature selection, which ranks candidate variables based on their relevance to the outcome variable and retains those that satisfy a chosen criterion. This article proposes a computationally efficient variable selection method based on principal component analysis tailored to a binary classification problem or case-control study.

View Article and Find Full Text PDF

Best Subset Solution Path for Linear Dimension Reduction Models Using Continuous Optimization.

Benoit Liquet , Sarat Moka , Samuel Muller

Biom J

February 2025

The selection of best variables is a challenging problem in supervised and unsupervised learning, especially in high-dimensional contexts where the number of variables is usually much larger than the number of observations. In this paper, we focus on two multivariate statistical methods: principal components analysis and partial least squares. Both approaches are popular linear dimension-reduction methods with numerous applications in several fields including in genomics, biology, environmental science, and engineering.

View Article and Find Full Text PDF

Deep learning-based hyperspectral image correction and unmixing for brain tumor surgery.

David Black , Jaidev Gill , Andrew Xie , Benoit Liquet , Antonio Di Ieva

iScience

December 2024

Article Synopsis

* Two deep learning models were developed to better correct for these issues; one uses labeled data while the other is semi-supervised, both trained on known concentrations of protoporphyrin IX (PpIX).
* Evaluations showed that these models had significantly higher correlation coefficients for PpIX concentration detection compared to classical methods, with the semi-supervised model also performing better on human data, reducing false positives by 36%.

View Article and Find Full Text PDF

Navigating Mathematical Basics: A Primer for Deep Learning in Science.

Benoit Liquet , Sarat Moka , Yoni Nazarathy

Adv Exp Med Biol

November 2024

We present a gentle introduction to elementary mathematical notation with the focus of communicating deep learning principles. This is a "math crash course" aimed at quickly enabling scientists with understanding of the building blocks used in many equations, formulas, and algorithms that describe deep learning. While this short presentation cannot replace solid mathematical knowledge that needs multiple courses and years to solidify, our aim is to allow nonmathematical readers to overcome hurdles of reading texts that also use such mathematical notation.

View Article and Find Full Text PDF

Mixture Cure Semiparametric Accelerated Failure Time Models With Partly Interval-Censored Data.

Isabel Li , Jun Ma , Benoit Liquet

Biom J

December 2024

Article Synopsis

The text discusses a method for analyzing survival data where some patients may never experience the event of interest, using a mixture cure Cox model and an accelerated failure time (AFT) model when applicable.
It introduces a penalized likelihood technique to estimate mixture cure semi-parametric AFT models, which accounts for various types of censored data and uses Gaussian basis functions for baseline hazard estimation.
The method's efficacy is validated through simulation studies and a real case study on melanoma recurrence, showcasing its advantages over existing methods like the smcure R package and making it accessible via the aftQnp R package.

View Article and Find Full Text PDF

Spectral library and method for sparse unmixing of hyperspectral images in fluorescence guided resection of brain tumors.

David Black , Benoit Liquet , Antonio Di Ieva , Walter Stummer , Eric Suero Molina

Biomed Opt Express

August 2024

Through spectral unmixing, hyperspectral imaging (HSI) in fluorescence-guided brain tumor surgery has enabled the detection and classification of tumor regions invisible to the human eye. Prior unmixing work has focused on determining a minimal set of viable fluorophore spectra known to be present in the brain and effectively reconstructing human data without overfitting. With these endmembers, non-negative least squares regression (NNLS) was commonly used to compute the abundances.

View Article and Find Full Text PDF

Investigation of common genetic risk factors between thyroid traits and breast cancer.

Elise A Lucotte , Yazdan Asgari , Pierre-Emmanuel Sugier , Mojgan Karimi , Cloé Domenighetti , Benoît Liquet

Hum Mol Genet

December 2023

Article Synopsis

- The study investigates the genetic links between breast cancer (BC) and thyroid disorders, revealing a positive correlation between BC risk and thyroxine (FT4) levels, and a negative correlation with thyroid-stimulating hormone (TSH) levels, particularly in estrogen receptor-positive BC.
- Polygenic risk scores indicate that higher FT4 and hyperthyroidism risks are associated with increased BC risk, while higher TSH risk is linked to decreased BC risk, highlighting the role of genetics in these diseases.
- The research identifies 49 shared genetic loci connected to both BC and thyroid traits and suggests that certain brain and immune system-related genes play significant roles in the relationship between these conditions.

View Article and Find Full Text PDF

GCPBayes pipeline: a tool for exploring pleiotropy at the gene level.

Yazdan Asgari , Pierre-Emmanuel Sugier , Taban Baghfalaki , Elise Lucotte , Mojgan Karimi , Benoit Liquet

NAR Genom Bioinform

September 2023

Cross-phenotype association using gene-set analysis can help to detect pleiotropic genes and inform about common mechanisms between diseases. Although there are an increasing number of statistical methods for exploring pleiotropy, there is a lack of proper pipelines to apply gene-set analysis in this context and using genome-scale data in a reasonable running time. We designed a user-friendly pipeline to perform cross-phenotype gene-set analysis between two traits using GCPBayes, a method developed by our team.

View Article and Find Full Text PDF

Understanding links between water-quality variables and nitrate concentration in freshwater streams using high frequency sensor data.

Claire Kermorvant , Benoit Liquet , Guy Litt , Kerrie Mengersen , Erin E Peterson

PLoS One

July 2023

Real-time monitoring using in-situ sensors is becoming a common approach for measuring water-quality within watersheds. High-frequency measurements produce big datasets that present opportunities to conduct new analyses for improved understanding of water-quality dynamics and more effective management of rivers and streams. Of primary importance is enhancing knowledge of the relationships between nitrate, one of the most reactive forms of inorganic nitrogen in the aquatic environment, and other water-quality variables.

View Article and Find Full Text PDF

SMOTE-CD: SMOTE for compositional data.

Teo Nguyen , Kerrie Mengersen , Damien Sous , Benoit Liquet

PLoS One

July 2023

Compositional data are a special kind of data, represented as a proportion carrying relative information. Although this type of data is widely spread, no solution exists to deal with the cases where the classes are not well balanced. After describing compositional data imbalance, this paper proposes an adaptation of the original Synthetic Minority Oversampling TEchnique (SMOTE) to deal with compositional data imbalance.

View Article and Find Full Text PDF

Leveraging pleiotropic association using sparse group variable selection in genomics data.

Matthew Sutton , Pierre-Emmanuel Sugier , Therese Truong , Benoit Liquet

BMC Med Res Methodol

January 2022

Background: Genome-wide association studies (GWAS) have identified genetic variants associated with multiple complex diseases. We can leverage this phenomenon, known as pleiotropy, to integrate multiple data sources in a joint analysis. Often integrating additional information such as gene pathway knowledge can improve statistical efficiency and biological interpretation.

View Article and Find Full Text PDF

Author Correction: Community evaluation of glycoproteomics informatics solutions reveals high-performance search strategies for serum glycopeptide analysis.

Rebeca Kawahara , Anastasia Chernykh , Kathirvel Alagesan , Marshall Bern , Weiqian Cao , Benoit Liquet

Nat Methods

January 2022

View Article and Find Full Text PDF

Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters.

Claire Kermorvant , Benoit Liquet , Guy Litt , Jeremy B Jones , Kerrie Mengersen

Int J Environ Res Public Health

December 2021

In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework based on generalized additive and auto-regressive models to recover these missing data.

View Article and Find Full Text PDF

Community evaluation of glycoproteomics informatics solutions reveals high-performance search strategies for serum glycopeptide analysis.

Rebeca Kawahara , Anastasia Chernykh , Kathirvel Alagesan , Marshall Bern , Weiqian Cao , Benoit Liquet

Nat Methods

November 2021

Article Synopsis

* A community study, part of the HUPO Human Glycoproteomics Initiative, tested various software solutions using the same human serum datasets to see how well they perform in analyzing glycopeptides.
* The study found that while results varied among teams, some software strategies showed high performance, leading to recommendations for improving search solutions in glycoproteomics and guiding future software development.

View Article and Find Full Text PDF

An appraisal of respiratory system compliance in mechanically ventilated covid-19 patients.

Gianluigi Li Bassi , Jacky Y Suen , Heidi J Dalton , Nicole White , Sally Shrapnel , Benoit Liquet

Crit Care

June 2021

Background: Heterogeneous respiratory system static compliance (C) values and levels of hypoxemia in patients with novel coronavirus disease (COVID-19) requiring mechanical ventilation have been reported in previous small-case series or studies conducted at a national level.

Methods: We designed a retrospective observational cohort study with rapid data gathering from the international COVID-19 Critical Care Consortium study to comprehensively describe C-calculated as: tidal volume/[airway plateau pressure-positive end-expiratory pressure (PEEP)]-and its association with ventilatory management and outcomes of COVID-19 patients on mechanical ventilation (MV), admitted to intensive care units (ICU) worldwide.

Results: We studied 745 patients from 22 countries, who required admission to the ICU and MV from January 14 to December 31, 2020, and presented at least one value of C within the first seven days of MV.

View Article and Find Full Text PDF

Penalized partial least squares for pleiotropy.

Camilo Broc , Therese Truong , Benoit Liquet

BMC Bioinformatics

February 2021

Background: The increasing number of genome-wide association studies (GWAS) has revealed several loci that are associated to multiple distinct phenotypes, suggesting the existence of pleiotropic effects. Highlighting these cross-phenotype genetic associations could help to identify and understand common biological mechanisms underlying some diseases. Common approaches test the association between genetic variants and multiple traits at the SNP level.

View Article and Find Full Text PDF

Estimation of semi-Markov multi-state models: a comparison of the sojourn times and transition intensities approaches.

Azam Asanjarani , Benoit Liquet , Yoni Nazarathy

Int J Biostat

January 2021

Semi-Markov models are widely used for survival analysis and reliability analysis. In general, there are two competing parameterizations and each entails its own interpretation and inference properties. On the one hand, a semi-Markov process can be defined based on the distribution of sojourn times, often via hazard rates, together with transition probabilities of an embedded Markov chain.

View Article and Find Full Text PDF

Bayesian meta-analysis models for cross cancer genomic investigation of pleiotropic effects using group structure.

Taban Baghfalaki , Pierre-Emmanuel Sugier , Therese Truong , Anthony N Pettitt , Kerrie Mengersen , Benoit Liquet

Stat Med

March 2021

An increasing number of genome-wide association studies (GWAS) summary statistics is made available to the scientific community. Exploiting these results from multiple phenotypes would permit identification of novel pleiotropic associations. In addition, incorporating prior biological information in GWAS such as group structure information (gene or pathway) has shown some success in classical GWAS approaches.

View Article and Find Full Text PDF

Design and rationale of the COVID-19 Critical Care Consortium international, multicentre, observational study.

Gianluigi Li Bassi , Jacky Suen , Adrian Gerard Barnett , Amanda Corley , Jonathan Millar , Benoit Liquet

BMJ Open

December 2020

Introduction: There is a paucity of data that can be used to guide the management of critically ill patients with COVID-19. In response, a research and data-sharing collaborative-The COVID-19 Critical Care Consortium-has been assembled to harness the cumulative experience of intensive care units (ICUs) worldwide. The resulting observational study provides a platform to rapidly disseminate detailed data and insights crucial to improving outcomes.

View Article and Find Full Text PDF

Detecting Technical Anomalies in High-Frequency Water-Quality Data Using Artificial Neural Networks.

Javier Rodriguez-Perez , Catherine Leigh , Benoit Liquet , Claire Kermorvant , Erin Peterson

Environ Sci Technol

November 2020

Anomaly detection (AD) in high-volume environmental data requires one to tackle a series of challenges associated with the typical low frequency of anomalous events, the broad-range of possible anomaly types, and local nonstationary environmental conditions, suggesting the need for flexible statistical methods that are able to cope with unbalanced high-volume data problems. Here, we aimed to detect anomalies caused by technical errors in water-quality (turbidity and conductivity) data collected by automated in situ sensors deployed in contrasting riverine and estuarine environments. We first applied a range of artificial neural networks that differed in both learning method and hyperparameter values, then calibrated models using a Bayesian multiobjective optimization procedure, and selected and evaluated the "best" model for each water-quality variable, environment, and anomaly type.

View Article and Find Full Text PDF

Classification algorithm for high-dimensional protein markers in time-course data.

Gajendra K Vishwakarma , Atanu Bhattacharjee , Souvik Banerjee , Benoit Liquet

Stat Med

December 2020

Identification of biomarkers is an emerging area in oncology. In this article, we develop an efficient statistical procedure for the classification of protein markers according to their effect on cancer progression. A high-dimensional time-course dataset of protein markers for 80 patients motivates us for developing the model.

View Article and Find Full Text PDF

Forecasting intensifying disturbance effects on coral reefs.

Julie Vercelloni , Benoit Liquet , Emma V Kennedy , Manuel González-Rivero , M Julian Caley

Glob Chang Biol

May 2020

Anticipating future changes of an ecosystem's dynamics requires knowledge of how its key communities respond to current environmental regimes. The Great Barrier Reef (GBR) is under threat, with rapid changes of its reef-building hard coral (HC) community structure already evident across broad spatial scales. While several underlying relationships between HC and multiple disturbances have been documented, responses of other benthic communities to disturbances are not well understood.

View Article and Find Full Text PDF

CPMCGLM: an R package for p-value adjustment when looking for an optimal transformation of a single explanatory variable in generalized linear models.

Benoit Liquet , Jérémie Riou

BMC Med Res Methodol

April 2019

Background: In medical research, explanatory continuous variables are frequently transformed or converted into categorical variables. If the coding is unknown, many tests can be used to identify the "optimal" transformation. This common process, involving the problems of multiple testing, requires a correction of the significance level.

View Article and Find Full Text PDF

Sparse partial least squares with group and subgroup structure.

Matthew Sutton , Rodolphe Thiébaut , Benoît Liquet

Stat Med

October 2018

Integrative analysis of high dimensional omics datasets has been studied by many authors in recent years. By incorporating prior known relationships among the variables, these analyses have been successful in elucidating the relationships between different sets of omics data. In this article, our goal is to identify important relationships between genomic expression and cytokine data from a human immunodeficiency virus vaccine trial.

View Article and Find Full Text PDF