Biodivers Data J
June 2025
Background: Metabarcoding can generate large numbers of georeferenced occurrence data from bulk samples at low cost. Its integration into the practice of agricultural invertebrate biomonitoring currently lacks both standard methods and example datasets that allow the identification of potential challenges and uncertainties.
New Information: For this study, we gathered metabarcoding data of terrestrial arthropods from Malaise trap samples across sites in southern Ontario, spanning a gradient from high production, intensely farmed areas to alternative land use farms with varying amounts of natural restoration of marginal lands.
Global biodiversity gradients are generally expected to reflect greater species replacement closer to the equator. However, empirical validation of global biodiversity gradients largely relies on vertebrates, plants, and other less diverse taxa. Here we assess the temporal and spatial dynamics of global arthropod biodiversity dynamics using a beta-diversity framework.
View Article and Find Full Text PDFPhilos Trans R Soc Lond B Biol Sci
June 2024
DNA-based identification is vital for classifying biological specimens, yet methods to quantify the uncertainty of sequence-based taxonomic assignments are scarce. Challenges arise from noisy reference databases, including mislabelled entries and missing taxa. PROTAX addresses these issues with a probabilistic approach to taxonomic classification, advancing on methods that rely solely on sequence similarity.
View Article and Find Full Text PDFMethods Mol Biol
April 2024
BOLD, the Barcode of Life Data System, supports the acquisition, storage, validation, analysis, and publication of DNA barcodes, activities requiring the integration of molecular, morphological, and distributional data. Its pivotal role in curating the reference library of DNA barcodes, coupled with its data management and analysis capabilities, makes it a central resource for biodiversity science. It enables rapid, accurate identification of specimens and also reveals patterns of genetic diversity and evolutionary relationships among taxa.
View Article and Find Full Text PDFThis is a response to a preprint version of "A re-analysis of the data in Sharkey et al.'s (2021) minimalist revision reveals that BINs do not deserve names, but BOLD Systems needs a stronger commitment to open science", https://www.biorxiv.
View Article and Find Full Text PDFThe Atlantic Forest harbors 7% of global biodiversity and possesses high levels of endemism, but many of its component taxa remain unstudied. Due to the importance of tropical forests and the urgency to protect them, there is a compelling need to address this knowledge gap. To provide more information on its arthropod fauna, a Malaise trap was deployed for 12 months in a semi-degraded area of the southern Upper Paraná ecoregion of the Atlantic Forest.
View Article and Find Full Text PDFTwenty-nine species are treated, most of which have host caterpillar and food plant records, and all but one are new to science. The first host record for the agathidine genus is given. Sharkey, is reported as a hyperparasitoid of fly larvae, the first such record for the genus.
View Article and Find Full Text PDFMol Ecol Resour
February 2022
To associate specimens identified by molecular characters to other biological knowledge, we need reference sequences annotated by Linnaean taxonomy. In this study, we (1) report the creation of a comprehensive reference library of DNA barcodes for the arthropods of an entire country (Finland), (2) publish this library, and (3) deliver a new identification tool for insects and spiders, as based on this resource. The reference library contains mtDNA COI barcodes for 11,275 (43%) of 26,437 arthropod species known from Finland, including 10,811 (45%) of 23,956 insect species.
View Article and Find Full Text PDFThree new genera are described: (Proteropinae), (Rogadinae), and (Rogadinae). Keys are given for the New World genera of the following braconid subfamilies: Agathidinae, Braconinae, Cheloninae, Homolobinae, Hormiinae, Ichneutinae, Macrocentrinae, Orgilinae, Proteropinae, Rhysipolinae, and Rogadinae. In these subfamilies 416 species are described or redescribed.
View Article and Find Full Text PDFRecent technical advances combined with novel computational approaches have promised the acceleration of our understanding of the tree of life. However, when it comes to hyperdiverse and poorly known groups of invertebrates, studies are still scarce. As published phylogenies will be rarely challenged by future taxonomists, careful attention must be paid to potential analytical bias.
View Article and Find Full Text PDFAlthough the butterflies of North America have received considerable taxonomic attention, overlooked species and instances of hybridization continue to be revealed. The present study assembles a DNA barcode reference library for this fauna to identify groups whose patterns of sequence variation suggest the need for further taxonomic study. Based on 14,626 records from 814 species, DNA barcodes were obtained for 96% of the fauna.
View Article and Find Full Text PDFBackground: Rickettsia are intracellular bacteria best known as the causative agents of human and animal diseases. Although these medically important Rickettsia are often transmitted via haematophagous arthropods, other Rickettsia, such as those in the Torix group, appear to reside exclusively in invertebrates and protists with no secondary vertebrate host. Importantly, little is known about the diversity or host range of Torix group Rickettsia.
View Article and Find Full Text PDFDNA barcoding and metabarcoding are now widely used to advance species discovery and biodiversity assessments. High-throughput sequencing (HTS) has expanded the volume and scope of these analyses, but elevated error rates introduce noise into sequence records that can inflate estimates of biodiversity. Denoising -the separation of biological signal from instrument (technical) noise-of barcode and metabarcode data currently employs abundance-based methods which do not capitalize on the highly conserved structure of the cytochrome c oxidase subunit I (COI) region employed as the animal barcode.
View Article and Find Full Text PDFBiological conclusions based on DNA barcoding and metabarcoding analyses can be strongly influenced by the methods utilized for data generation and curation, leading to varying levels of success in the separation of biological variation from experimental error. The 5' region of cytochrome oxidase subunit I (COI-5P) is the most common barcode gene for animals, with conserved structure and function that allows for biologically informed error identification. Here, we present coil ( https://CRAN.
View Article and Find Full Text PDFThe reliable taxonomic identification of organisms through DNA sequence data requires a well parameterized library of curated reference sequences. However, it is estimated that just 15% of described animal species are represented in public sequence repositories. To begin to address this deficiency, we provide DNA barcodes for 1,500,003 animal specimens collected from 23 terrestrial and aquatic ecozones at sites across Canada, a nation that comprises 7% of the planet's land surface.
View Article and Find Full Text PDFMol Ecol Resour
May 2019
Although DNA metabarcoding is an attractive approach for monitoring biodiversity, it is often difficult to detect all the species present in a bulk sample. In particular, sequence recovery for a given species depends on its biomass and mitome copy number as well as the primer set employed for PCR. To examine these variables, we constructed a mock community of terrestrial arthropods comprised of 374 species.
View Article and Find Full Text PDFBMC Genomics
March 2018
Background: Although high-throughput sequencers (HTS) have largely displaced their Sanger counterparts, the short read lengths and high error rates of most platforms constrain their utility for amplicon sequencing. The present study tests the capacity of single molecule, real-time (SMRT) sequencing implemented on the SEQUEL platform to overcome these limitations, employing 658 bp amplicons of the mitochondrial cytochrome c oxidase I gene as a model system.
Results: By examining templates from more than 5000 species and 20,000 specimens, the performance of SMRT sequencing was tested with amplicons showing wide variation in GC composition and varied sequence attributes.
Participants in the 7th International Barcode of Life Conference (Kruger National Park, South Africa, 20-24 November 2017) share the latest findings in DNA barcoding research and its increasingly diversified applications. Here, we review prevailing trends synthesized from among 429 invited and contributed abstracts, which are collated in this open-access special issue of Genome. Hosted for the first time on the African continent, the 7th Conference places special emphasis on the evolutionary origins, biogeography, and conservation of African flora and fauna.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
March 2017
IEEE/ACM Trans Comput Biol Bioinform
January 2019
This study presents a machine learning method that increases the number of identified bases in Sanger Sequencing. The system post-processes a KB basecalled chromatogram. It selects a recoverable subset of N-labels in the KB-called chromatogram to replace with basecalls (A,C,G,T).
View Article and Find Full Text PDFPhilos Trans R Soc Lond B Biol Sci
September 2016
Recent estimates suggest that the global insect fauna includes fewer than six million species, but this projection is very uncertain because taxonomic work has been limited on some highly diverse groups. Validation of current estimates minimally requires the investigation of all lineages that are diverse enough to have a substantial impact on the final species count. This study represents a first step in this direction; it employs DNA barcoding to evaluate patterns of species richness in 27 orders of Canadian insects.
View Article and Find Full Text PDFThe proliferation of DNA data is revolutionizing all fields of systematic research. DNA barcode sequences, now available for millions of specimens and several hundred thousand species, are increasingly used in algorithmic species delimitations. This is complicated by occasional incongruences between species and gene genealogies, as indicated by situations where conspecific individuals do not form a monophyletic cluster in a gene tree.
View Article and Find Full Text PDF