Publications by Valentin Antonescu

Publications by authors named "Valentin Antonescu"

Page 1 of 1

Author Correction: Assembly of a pan-genome from deep sequencing of 910 humans of African descent.

Rachel M Sherman , Juliet Forman , Valentin Antonescu , Daniela Puiu , Michelle Daya

Nat Genet

February 2019

In the version of this article initially published, the statement "there are no pan-genomes for any other animal or plant species" was incorrect. The statement has been corrected to "there are no reported pan-genomes for any other animal species, to our knowledge." We thank David Edwards for bringing this error to our attention.

View Article and Find Full Text PDF

Assembly of a pan-genome from deep sequencing of 910 humans of African descent.

Rachel M Sherman , Juliet Forman , Valentin Antonescu , Daniela Puiu , Michelle Daya

Nat Genet

January 2019

We used a deeply sequenced dataset of 910 individuals, all of African descent, to construct a set of DNA sequences that is present in these individuals but missing from the reference human genome. We aligned 1.19 trillion reads from the 910 individuals to the reference genome (GRCh38), collected all reads that failed to align, and assembled these reads into contiguous sequences (contigs).

View Article and Find Full Text PDF

Scaling read aligners to hundreds of threads on general-purpose processors.

Ben Langmead , Christopher Wilks , Valentin Antonescu , Rone Charles

Bioinformatics

February 2019

Motivation: General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. We discuss some of these and propose methods for improving thread scaling in tools that analyze each read independently, such as read aligners.

View Article and Find Full Text PDF

Germline Mutations in DNA Repair Genes in Lung Adenocarcinoma.

Erin M Parry , Dustin L Gable , Susan E Stanley , Sara E Khalil , Valentin Antonescu

J Thorac Oncol

November 2017

Introduction: Although lung cancer is generally thought to be environmentally provoked, anecdotal familial clustering has been reported, suggesting that there may be genetic susceptibility factors. We systematically tested whether germline mutations in eight candidate genes may be risk factors for lung adenocarcinoma.

Methods: We studied lung adenocarcinoma cases for which germline sequence data had been generated as part of The Cancer Genome Atlas project but had not been previously analyzed.

View Article and Find Full Text PDF

The novel fusion transcript NR5A2-KLHL29FT is generated by an insertion at the KLHL29 locus.

Zhenguo Sun , Xiquan Ke , Steven L Salzberg , Daehwan Kim , Valentin Antonescu

Cancer

May 2017

Background: Novel fusion transcripts (FTs) caused by chromosomal rearrangement are common factors in the development of cancers. In the current study, the authors used massively parallel RNA sequencing to identify new FTs in colon cancers.

Methods: RNA sequencing (RNA-Seq) and TopHat-Fusion were used to identify new FTs in colon cancers.

View Article and Find Full Text PDF

POPcorn: An Online Resource Providing Access to Distributed and Diverse Maize Project Data.

Ethalinda K S Cannon , Scott M Birkett , Bremen L Braun , Sateesh Kodavali , Douglas M Jennewein , Valentin Antonescu , Corina Antonescu

Int J Plant Genomics

August 2012

The purpose of the online resource presented here, POPcorn (Project Portal for corn), is to enhance accessibility of maize genetic and genomic resources for plant biologists. Currently, many online locations are difficult to find, some are best searched independently, and individual project websites often degrade over time-sometimes disappearing entirely. The POPcorn site makes available (1) a centralized, web-accessible resource to search and browse descriptions of ongoing maize genomics projects, (2) a single, stand-alone tool that uses web Services and minimal data warehousing to search for sequence matches in online resources of diverse offsite projects, and (3) a set of tools that enables researchers to migrate their data to the long-term model organism database for maize genetic and genomic information: MaizeGDB.

View Article and Find Full Text PDF

Using the DFCI gene index databases for biological discovery.

Corina Antonescu , Valentin Antonescu , Razvan Sultana , John Quackenbush

Curr Protoc Bioinformatics

March 2010

The DFCI Gene Index Web pages provide access to analyses of ESTs and gene sequences for nearly 114 species, as well as a number of resources derived from these. Each species-specific database is presented using a common format with a home page. A variety of methods exist that allow users to search each species-specific database.

View Article and Find Full Text PDF

TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets.

Geo Pertea , Xiaoqiu Huang , Feng Liang , Valentin Antonescu , Razvan Sultana

Bioinformatics

March 2003

TGICL is a pipeline for analysis of large Expressed Sequence Tags (EST) and mRNA databases in which the sequences are first clustered based on pairwise sequence similarity, and then assembled by individual clusters (optionally with quality values) to produce longer, more complete consensus sequences. The system can run on multi-CPU architectures including SMP and PVM.

View Article and Find Full Text PDF

Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA).

Yuandan Lee , Razvan Sultana , Geo Pertea , Jennifer Cho , Svetlana Karamycheva , Valentin Antonescu

Genome Res

March 2002

Comparative genomics promises to rapidly accelerate the identification and functional classification of biologically important human genes. We developed the TIGR Orthologous Gene Alignment (TOGA; View Article and Find Full Text PDF