Publications by Alina Selega

Publications by authors named "Alina Selega"

Page 1 of 1

Segmentation aware probabilistic phenotyping of single-cell spatial protein expression data.

Yuju Lee , Edward L Y Chen , Darren C H Chan , Anuroopa Dinesh , Somaieh Afiuni-Zadeh , Alina Selega

Nat Commun

January 2025

Spatial protein expression technologies can map cellular content and organization by simultaneously quantifying the expression of >40 proteins at subcellular resolution within intact tissue sections and cell lines. However, necessary image segmentation to single cells is challenging and error prone, easily confounding the interpretation of cellular phenotypes and cell clusters. To address these limitations, we present STARLING, a probabilistic machine learning model designed to quantify cell populations from spatial protein expression data while accounting for segmentation errors.

View Article and Find Full Text PDF

Beyond benchmarking and towards predictive models of dataset-specific single-cell RNA-seq pipeline performance.

Cindy Fang , Alina Selega , Kieran R Campbell

Genome Biol

June 2024

Background: The advent of single-cell RNA-sequencing (scRNA-seq) has driven significant computational methods development for all steps in the scRNA-seq data analysis pipeline, including filtering, normalization, and clustering. The large number of methods and their resulting parameter combinations has created a combinatorial set of possible pipelines to analyze scRNA-seq data, which leads to the obvious question: which is best? Several benchmarking studies compare methods but frequently find variable performance depending on dataset and pipeline characteristics. Alternatively, the large number of scRNA-seq datasets along with advances in supervised machine learning raise a tantalizing possibility: could the optimal pipeline be predicted for a given dataset?

Results: Here, we begin to answer this question by applying 288 scRNA-seq analysis pipelines to 86 datasets and quantifying pipeline success via a range of measures evaluating cluster purity and biological plausibility.

View Article and Find Full Text PDF

TrackSigFreq: subclonal reconstructions based on mutation signatures and allele frequencies.

Caitlin F Harrigan , Yulia Rubanova , Quaid Morris , Alina Selega

Pac Symp Biocomput

March 2021

Mutational signatures are patterns of mutation types, many of which are linked to known mutagenic processes. Signature activity represents the proportion of mutations a signature generates. In cancer, cells may gain advantageous phenotypes through mutation accumulation, causing rapid growth of that subpopulation within the tumour.

View Article and Find Full Text PDF

Kinetic CRAC uncovers a role for Nab3 in determining gene expression profiles during stress.

Rob van Nues , Gabriele Schweikert , Erica de Leau , Alina Selega , Andrew Langford

Nat Commun

April 2017

RNA-binding proteins play a key role in shaping gene expression profiles during stress, however, little is known about the dynamic nature of these interactions and how this influences the kinetics of gene expression. To address this, we developed kinetic cross-linking and analysis of cDNAs (χCRAC), an ultraviolet cross-linking method that enabled us to quantitatively measure the dynamics of protein-RNA interactions in vivo on a minute time-scale. Here, using χCRAC we measure the global RNA-binding dynamics of the yeast transcription termination factor Nab3 in response to glucose starvation.

View Article and Find Full Text PDF

Trends and challenges in computational RNA biology.

Alina Selega , Guido Sanguinetti

Genome Biol

December 2016

A report on the Wellcome Trust Conference on Computational RNA Biology, held in Hinxton, UK, on 17-19 October 2016.

View Article and Find Full Text PDF

Robust statistical modeling improves sensitivity of high-throughput RNA structure probing experiments.

Alina Selega , Christel Sirocchi , Ira Iosub , Sander Granneman , Guido Sanguinetti

Nat Methods

January 2017

Structure probing coupled with high-throughput sequencing could revolutionize our understanding of the role of RNA structure in regulation of gene expression. Despite recent technological advances, intrinsic noise and high sequence coverage requirements greatly limit the applicability of these techniques. Here we describe a probabilistic modeling pipeline that accounts for biological variability and biases in the data, yielding statistically interpretable scores for the probability of nucleotide modification transcriptome wide.

View Article and Find Full Text PDF