TruAB Discovery is an approach that integrates cellular immunology, high-throughput immunosequencing, bioinformatics, and computational biology in order to discover naturally occurring human antibodies for prophylactic or therapeutic use. We adapted our previously described pairSEQ technology to pair B cell receptor heavy and light chains of SARS-CoV-2 spike protein-binding antibodies derived from enriched antigen-specific memory B cells and bulk antibody-secreting cells. We identified approximately 60,000 productive, in-frame, paired antibody sequences, from which 2,093 antibodies were selected for functional evaluation based on abundance, isotype and patterns of somatic hypermutation.
View Article and Find Full Text PDFGlycerol is a biodiesel byproduct. In the present study, glycerol was used as a co-substrate during biodegradation of dibenzothiophene (DBT) by Paraburkholderia sp. C3.
View Article and Find Full Text PDFPurpose: The CLL14 study has established one-year fixed-duration treatment of venetoclax and obinutuzumab (Ven-Obi) for patients with previously untreated chronic lymphocytic leukemia. With all patients off treatment for at least three years, we report a detailed analysis of minimal residual disease (MRD) kinetics and long-term outcome of patients treated in the CLL14 study.
Patients And Methods: Patients were randomly assigned to receive six cycles of obinutuzumab with 12 cycles of venetoclax or 12 cycles of chlorambucil (Clb-Obi).
Background: The clonoSEQ® Assay (Adaptive Biotechnologies Corporation, Seattle, USA) identifies and tracks unique disease-associated immunoglobulin (Ig) sequences by next-generation sequencing of IgH, IgK, and IgL rearrangements and IgH-BCL1/2 translocations in malignant B cells. Here, we describe studies to validate the analytical performance of the assay using patient samples and cell lines.
Methods: Sensitivity and specificity were established by defining the limit of detection (LoD), limit of quantitation (LoQ) and limit of blank (LoB) in genomic DNA (gDNA) from 66 patients with multiple myeloma (MM), acute lymphoblastic leukemia (ALL), or chronic lymphocytic leukemia (CLL), and three cell lines.
Introduction: Preeclampsia is a medical condition complicated with hypertension and proteinuria during pregnancy. While preeclampsia affects approximately 5% of pregnancies, it remains without a cure. In addition, women who had preeclampsia during pregnancy have been reported to have an increased risk for cardiovascular disease later in life.
View Article and Find Full Text PDFAlternative splicing (AS) has been shown to participate in prostate cancer development and progression; however, a link between AS and prostate cancer health disparities has been largely unexplored. Here we report on the cloning of a novel splice variant of that is preferentially expressed in African American (AA) prostate cancer. This novel variant () omits exon 14, comprising 123 nucleotides that encode the activation loop in the intracellular split kinase domain.
View Article and Find Full Text PDFDespite its popularity, characterization of subpopulations with transcript abundance is subject to a significant amount of noise. We propose to use effective and expressed nucleotide variations (eeSNVs) from scRNA-seq as alternative features for tumor subpopulation identification. We develop a linear modeling framework, SSrGE, to link eeSNVs associated with gene expression.
View Article and Find Full Text PDFAlthough driver genes in hepatocellular carcinoma (HCC) have been investigated in various previous genetic studies, prevalence of key driver genes among heterogeneous populations is unknown. Moreover, the phenotypic associations of these driver genes are poorly understood. This report aims to reveal the phenotypic impacts of a group of consensus driver genes in HCC.
View Article and Find Full Text PDFplantGlycoMS is a set of tools, implemented in R, which is used to assess and validate glycopeptide spectrum matches (gPSMs). Validity of gPSMs is based on characteristic fragmentation patterns of glycopeptides (gPSMvalidator), adherence of the glycan moiety to the known N-glycan biosynthesis pathway in plants (pGlycoFilter), and elution of the glycopeptide within the observed retention time window of other glycopeptides sharing the same peptide backbone (rt.Restrict).
View Article and Find Full Text PDFPLoS Comput Biol
April 2018
Artificial neural networks (ANN) are computing architectures with many interconnections of simple neural-inspired computing elements, and have been applied to biomedical fields such as imaging analysis and diagnosis. We have developed a new ANN framework called Cox-nnet to predict patient prognosis from high throughput transcriptomics data. In 10 TCGA RNA-Seq data sets, Cox-nnet achieves the same or better predictive accuracy compared to other methods, including Cox-proportional hazards regression (with LASSO, ridge, and mimimax concave penalty), Random Forests Survival and CoxBoost.
View Article and Find Full Text PDFDeep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms have recently shown impressive results across a variety of domains. Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood.
View Article and Find Full Text PDFPac Symp Biocomput
August 2018
Long intergenic non-coding RNAs have been shown to play important roles in cancer. However, because lincRNAs are a relatively new class of RNAs compared to protein-coding mRNAs, the mutational landscape of lincRNAs has not been as extensively studied. Here we characterize expressed somatic nucleotide variants within lincRNAs using 12 cancer RNA-Seq datasets in TCGA.
View Article and Find Full Text PDFClinical challenges exist in reducing prostate cancer (PCa) disparities. The RNA splicing landscape of PCa across racial populations has not been fully explored as a potential molecular mechanism contributing to race-related tumour aggressiveness. Here, we identify novel genome-wide, race-specific RNA splicing events as critical drivers of PCa aggressiveness and therapeutic resistance in African American (AA) men.
View Article and Find Full Text PDFSingle-cell RNA-Sequencing (scRNA-Seq) is a fast-evolving technology that enables the understanding of biological processes at an unprecedentedly high resolution. However, well-suited bioinformatics tools to analyze the data generated from this new technology are still lacking. Here we investigate the performance of non-negative matrix factorization (NMF) method to analyze a wide variety of scRNA-Seq datasets, ranging from mouse hematopoietic stem cells to human glioblastoma data.
View Article and Find Full Text PDFTo identify what long non-coding RNAs (lncRNAs) are involved in non-small cell lung cancer (NSCLC), we analyzed microarray data on gene expression and methylation. Gene expression chip and HumanMethylation450BeadChip were used to interrogate genome-wide expression and methylation in tumor samples. Differential expression and methylation were analyzed through comparing tumors with adjacent non-tumor tissues.
View Article and Find Full Text PDFThe emerging single-cell RNA-Seq (scRNA-Seq) technology holds the promise to revolutionize our understanding of diseases and associated biological processes at an unprecedented resolution. It opens the door to reveal intercellular heterogeneity and has been employed to a variety of applications, ranging from characterizing cancer cells subpopulations to elucidating tumor resistance mechanisms. Parallel to improving experimental protocols to deal with technological issues, deriving new analytical methods to interpret the complexity in scRNA-Seq data is just as challenging.
View Article and Find Full Text PDFLong intergenic noncoding RNAs (lincRNAs) are a relatively new class of non-coding RNAs that have the potential as cancer biomarkers. To seek a panel of lincRNAs as pan-cancer biomarkers, we have analyzed transcriptomes from over 3300 cancer samples with clinical information. Compared to mRNA, lincRNAs exhibit significantly higher tissue specificities that are then diminished in cancer tissues.
View Article and Find Full Text PDFPlant secretory (Class III) peroxidases are redox enzymes that rely on N-glycosylation for full enzyme activity and stability. Peroxidases from palm tree leaves comprise the most stable and active plant peroxidases characterized to date. Herein, site-specific glycosylation and microheterogeneity of windmill palm tree (Trachycarpus fortunei) peroxidase are reported.
View Article and Find Full Text PDFBioData Min
December 2015
Long intergenic non-coding RNAs (lincRNAs) represent one of the most mysterious RNA species encoded by the human genome. Thanks to next generation sequencing (NGS) technology and its applications, we have recently witnessed a surge in non-coding RNA research, including lincRNA research. Here, we summarize the recent advancement in genomics studies of lincRNAs.
View Article and Find Full Text PDFInflammation is a beneficial host response to infection but can contribute to inflammatory disease if unregulated. The Th17 lineage of T helper (Th) cells can cause severe human inflammatory diseases. These cells exhibit both instability (they can cease to express their signature cytokine, IL-17A) and plasticity (they can start expressing cytokines typical of other lineages) upon in vitro re-stimulation.
View Article and Find Full Text PDFBackground: Epigenetic alterations are known to correlate with changes in gene expression among various diseases including cancers. However, quantitative models that accurately predict the up or down regulation of gene expression are currently lacking.
Methods: A new machine learning-based method of gene expression prediction is developed in the context of lung cancer.
Clin Epigenetics
March 2015
Background: Preeclampsia is one of the leading causes of fetal and maternal morbidity and mortality worldwide. Preterm babies of mothers with early onset preeclampsia (EOPE) are at higher risks for various diseases later on in life, including cardiovascular diseases. We hypothesized that genome-wide epigenetic alterations occur in cord blood DNAs in association with EOPE and conducted a case control study to compare the genome-scale methylome differences in cord blood DNAs between 12 EOPE-associated and 8 normal births.
View Article and Find Full Text PDFGenome Biol
October 2015
MiRNAs play important roles in many diseases including cancers. However computational prediction of miRNA target genes is challenging and the accuracies of existing methods remain poor. We report mirMark, a new machine learning-based method of miRNA target prediction at the site and UTR levels.
View Article and Find Full Text PDFIt is crucial for researchers to optimize RNA-seq experimental designs for differential expression detection. Currently, the field lacks general methods to estimate power and sample size for RNA-Seq in complex experimental designs, under the assumption of the negative binomial distribution. We simulate RNA-Seq count data based on parameters estimated from six widely different public data sets (including cell line comparison, tissue comparison, and cancer data sets) and calculate the statistical power in paired and unpaired sample experiments.
View Article and Find Full Text PDF