Publications by Joshua C Stein

Publications by authors named "Joshua C Stein"

Page 1 of 1

Gene disruption by structural mutations drives selection in US rice breeding over the last century.

Justin N Vaughn , Walid Korani , Joshua C Stein , Jeremy D Edwards , Daniel G Peterson

PLoS Genet

March 2021

Article Synopsis

The study focuses on understanding the genetic factors behind plant vigor, particularly in rice, and highlights the complexity of mapping this trait due to many genes with small effects and their interactions.
Researchers performed a long-read genomic assembly of a tropical japonica rice variety, Carolina Gold, to identify significant structural mutations and understand how these changes affect crop performance.
The findings indicate a history of tandem duplications and transposable element activity that contributed to genomic size variations, with structural mutations affecting gene exons being selected against in rice breeding programs over the last century.

View Article and Find Full Text PDF

Effect of sequence depth and length in long-read assembly of the maize inbred NC358.

Shujun Ou , Jianing Liu , Kapeel M Chougule , Arkarachai Fungtammasan , Arun S Seetharam , Joshua C Stein

Nat Commun

May 2020

Article Synopsis

Advances in long-read data and scaffolding technologies have led to improved reference-quality genome assemblies, particularly for complex genomes like maize.
Critical assessments of sequence depth and read length are essential for effective resource allocation when generating these assemblies.
The study highlights that higher depth and longer subread lengths significantly enhance assembly quality, with high-quality optical maps further improving the contiguity of fragmented assemblies.

View Article and Find Full Text PDF

Publisher Correction: Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza.

Joshua C Stein , Yeisoo Yu , Dario Copetti , Derrick J Zwickl , Li Zhang

Nat Genet

November 2018

This article was not made open access when initially published online, which was corrected before print publication. In addition, ORCID links were missing for 12 authors and have been added to the HTML and PDF versions of the article.

View Article and Find Full Text PDF

Improved RNA-seq Workflows Using CyVerse Cyberinfrastructure.

Kapeel M Chougule , Liya Wang , Joshua C Stein , Xiaofei Wang , Upendra Kumar Devisetty

Curr Protoc Bioinformatics

September 2018

RNA-seq is a vital method for understanding gene structure and expression patterns. Typical RNA-seq analysis protocols use sequencing reads of length 50 to 150 nucleotides for alignment to the reference genome and assembly of transcripts. The resultant transcripts are quantified and used for differential expression and visualization.

View Article and Find Full Text PDF

The maize W22 genome provides a foundation for functional genomics and transposon biology.

Nathan M Springer , Sarah N Anderson , Carson M Andorf , Kevin R Ahern , Fang Bai , Joshua C Stein

Nat Genet

September 2018

The maize W22 inbred has served as a platform for maize genetics since the mid twentieth century. To streamline maize genome analyses, we have sequenced and de novo assembled a W22 reference genome using short-read sequencing technologies. We show that significant structural heterogeneity exists in comparison to the B73 reference genome at multiple scales, from transposon composition and copy number variation to single-nucleotide polymorphisms.

View Article and Find Full Text PDF

Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes.

Silong Sun , Yingsi Zhou , Jian Chen , Junpeng Shi , Haiming Zhao , Joshua C Stein

Nat Genet

September 2018

Maize is an important crop with a high level of genome diversity and heterosis. The genome sequence of a typical female line, B73, was previously released. Here, we report a de novo genome assembly of a corresponding male representative line, Mo17.

View Article and Find Full Text PDF

Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza.

Joshua C Stein , Yeisoo Yu , Dario Copetti , Derrick J Zwickl , Li Zhang

Nat Genet

February 2018

Article Synopsis

The genus Oryza serves as an important model for studying molecular evolution, revealing rapid species diversification alongside the emergence of new genetic elements and minimal large-scale chromosomal changes.
The research clarifies the complex phylogenetic history of Oryza, particularly within the 'AA' subclade of domesticated species, highlighting cases of introgression and the presence of disease resistance genes.
This study significantly advances rice research by releasing a comprehensive long-read genome assembly of IR 8 'Miracle Rice,' which played a crucial role in addressing famine during the Green Revolution in Asia.

View Article and Find Full Text PDF

Gramene 2018: unifying comparative genomics and pathway resources for plant research.

Marcela K Tello-Ruiz , Sushma Naithani , Joshua C Stein , Parul Gupta , Michael Campbell , Lincoln D Stein

Nucleic Acids Res

January 2018

Gramene (http://www.gramene.org) is a knowledgebase for comparative functional analysis in major crops and model plant species.

View Article and Find Full Text PDF

Improved maize reference genome with single-molecule technologies.

Yinping Jiao , Paul Peluso , Jinghua Shi , Tiffany Liang , Michelle C Stitzer , Joshua C Stein

Nature

June 2017

Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions.

View Article and Find Full Text PDF

Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing.

Bo Wang , Elizabeth Tseng , Michael Regulski , Tyson A Clark , Ting Hon , Joshua C Stein

Nat Commun

June 2016

Zea mays is an important genetic model for elucidating transcriptional networks. Uncertainties about the complete structure of mRNA transcripts limit the progress of research in this system. Here, using single-molecule sequencing technology, we produce 111,151 transcripts from 6 tissues capturing ∼70% of the genes annotated in maize RefGen_v3 genome.

View Article and Find Full Text PDF

Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica.

Michael C Schatz , Lyza G Maron , Joshua C Stein , Alejandro Hernandez Wences , James Gurtowski

Genome Biol

July 2015

Background: The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate.

View Article and Find Full Text PDF

Automated update, revision, and quality control of the maize genome annotations using MAKER-P improves the B73 RefGen_v3 gene models and identifies new genes.

MeiYee Law , Kevin L Childs , Michael S Campbell , Joshua C Stein , Andrew J Olson

Plant Physiol

January 2015

Article Synopsis

The complexity of plant genomes presents challenges for creating accurate gene annotations, which has prompted the development of the MAKER-P genome annotation engine specifically for plants.
In less than 3 hours, MAKER-P was used to enhance the maize B73 RefGen_v3 annotation, discovering 4,466 new protein-coding genes and revising existing gene models.
The study also highlights the potential for using MAKER-P to train new genome annotations for other grass species, showcasing its effectiveness for managing and improving plant genome annotations.

View Article and Find Full Text PDF

Disentangling methodological and biological sources of gene tree discordance on Oryza (Poaceae) chromosome 3.

Derrick J Zwickl , Joshua C Stein , Rod A Wing , Doreen Ware , Michael J Sanderson

Syst Biol

September 2014

We describe new methods for characterizing gene tree discordance in phylogenomic data sets, which screen for deviations from neutral expectations, summarize variation in statistical support among gene trees, and allow comparison of the patterns of discordance induced by various analysis choices. Using an exceptionally complete set of genome sequences for the short arm of chromosome 3 in Oryza (rice) species, we applied these methods to identify the causes and consequences of differing patterns of discordance in the sets of gene trees inferred using a panel of 20 distinct analysis pipelines. We found that discordance patterns were strongly affected by aspects of data selection, alignment, and alignment masking.

View Article and Find Full Text PDF

MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations.

Michael S Campbell , MeiYee Law , Carson Holt , Joshua C Stein , Gaurav D Moghe

Plant Physiol

February 2014

We have optimized and extended the widely used annotation engine MAKER in order to better support plant genome annotation efforts. New features include better parallelization for large repeat-rich plant genomes, noncoding RNA annotation capabilities, and support for pseudogene identification. We have benchmarked the resulting software tool kit, MAKER-P, using the Arabidopsis (Arabidopsis thaliana) and maize (Zea mays) genomes.

View Article and Find Full Text PDF

A 4-gigabase physical map unlocks the structure and evolution of the complex genome of Aegilops tauschii, the wheat D-genome progenitor.

Ming-Cheng Luo , Yong Q Gu , Frank M You , Karin R Deal , Yaqin Ma , Joshua C Stein

Proc Natl Acad Sci U S A

May 2013

The current limitations in genome sequencing technology require the construction of physical maps for high-quality draft sequences of large plant genomes, such as that of Aegilops tauschii, the wheat D-genome progenitor. To construct a physical map of the Ae. tauschii genome, we fingerprinted 461,706 bacterial artificial chromosome clones, assembled contigs, designed a 10K Ae.

View Article and Find Full Text PDF

Gramene database in 2010: updates and extensions.

Ken Youens-Clark , Ed Buckler , Terry Casstevens , Charles Chen , Genevieve Declerck , Joshua C Stein

Nucleic Acids Res

January 2011

Now in its 10th year, the Gramene database (http://www.gramene.org) has grown from its primary focus on rice, the first fully-sequenced grass genome, to become a resource for major model and crop plants including Arabidopsis, Brachypodium, maize, sorghum, poplar and grape in addition to several species of rice.

View Article and Find Full Text PDF

Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor.

Ruth A Swanson-Wagner , Steven R Eichten , Sunita Kumari , Peter Tiffin , Joshua C Stein

Genome Res

December 2010

Individuals of the same species are generally thought to have very similar genomes. However, there is growing evidence that structural variation in the form of copy number variation (CNV) and presence-absence variation (PAV) can lead to variation in the genome content of individuals within a species. Array comparative genomic hybridization (CGH) was used to compare gene content and copy number variation among 19 diverse maize inbreds and 14 genotypes of the wild ancestor of maize, teosinte.

View Article and Find Full Text PDF

The B73 maize genome: complexity, diversity, and dynamics.

Patrick S Schnable , Doreen Ware , Robert S Fulton , Joshua C Stein , Fusheng Wei

Science

November 2009

Article Synopsis

An improved draft sequence of the maize genome has been produced, revealing over 32,000 predicted genes, with almost all placed on reference chromosomes.
The genome consists of 85% transposable elements, which influence gene composition and positioning, including the dynamics of centromeres.
The study also highlights the roles of methylation-poor regions, transposon insertions, and gene losses in the evolution of maize, providing insights for future research on its domestication and agricultural enhancements.

View Article and Find Full Text PDF

A genome-wide characterization of microRNA genes in maize.

Lifang Zhang , Jer-Ming Chia , Sunita Kumari , Joshua C Stein , Zhijie Liu

PLoS Genet

November 2009

MicroRNAs (miRNAs) are small, non-coding RNAs that play essential roles in plant growth, development, and stress response. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling identified 150 high-confidence genes within 26 miRNA families.

View Article and Find Full Text PDF

Detailed analysis of a contiguous 22-Mb region of the maize genome.

Fusheng Wei , Joshua C Stein , Chengzhi Liang , Jianwei Zhang , Robert S Fulton

PLoS Genet

November 2009

Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences.

View Article and Find Full Text PDF

A new method to compute K-mer frequencies and its application to annotate large repetitive plant genomes.

Stefan Kurtz , Apurva Narechania , Joshua C Stein , Doreen Ware

BMC Genomics

October 2008

Background: The challenges of accurate gene prediction and enumeration are further aggravated in large genomes that contain highly repetitive transposable elements (TEs). Yet TEs play a substantial role in genome evolution and are themselves an important subject of study. Repeat annotation, based on counting occurrences of k-mers, has been previously used to distinguish TEs from low-copy genic regions; but currently available software solutions are impractical due to high memory requirements or specialization for specific user-tasks.

View Article and Find Full Text PDF

Engineering vitamin E content: from Arabidopsis mutant to soy oil.

Alison L Van Eenennaam , Kim Lincoln , Timothy P Durrett , Henry E Valentin , Christine K Shewmaker , Joshua C Stein

Plant Cell

December 2003

We report the identification and biotechnological utility of a plant gene encoding the tocopherol (vitamin E) biosynthetic enzyme 2-methyl-6-phytylbenzoquinol methyltransferase. This gene was identified by map-based cloning of the Arabidopsis mutation vitamin E pathway gene3-1 (vte3-1), which causes increased accumulation of delta-tocopherol and decreased gamma-tocopherol in the seed. Enzyme assays of recombinant protein supported the hypothesis that At-VTE3 encodes a 2-methyl-6-phytylbenzoquinol methyltransferase.

View Article and Find Full Text PDF