The transformer architecture in deep learning has revolutionized protein sequence analysis. Recent advancements in protein language models have paved the way for significant progress across various domains, including protein function and structure prediction, multiple sequence alignments and mutation effect prediction. A protein language model is commonly trained on individual proteins, ignoring the interdependencies between sequences within a genome.
View Article and Find Full Text PDFSoybean bradyrhizobia ( spp.) are symbiotic root-nodulating bacteria that fix atmospheric nitrogen for the host plant. The University of Delaware Culture Collection (UDBCC; 353 accessions) was created to study the diversity and ecology of soybean bradyrhizobia.
View Article and Find Full Text PDFBMC Bioinformatics
April 2024
Background: The annotation of protein sequences in public databases has long posed a challenge in molecular biology. This issue is particularly acute for viral proteins, which demonstrate limited homology to known proteins when using alignment, k-mer, or profile-based homology search approaches. A novel methodology employing Large Language Models (LLMs) addresses this methodological challenge by annotating protein sequences based on embeddings.
View Article and Find Full Text PDFISME Commun
October 2023
Through infection and lysis of their coexisting bacterial hosts, viruses impact the biogeochemical cycles sustaining globally significant pelagic oceanic ecosystems. Currently, little is known of the ecological interactions between lytic viruses and their bacterial hosts underlying these biogeochemical impacts at ecosystem scales. This study focused on populations of lytic viruses carrying the B-dependent Class II monomeric ribonucleotide reductase (RNR) gene, ribonucleotide-triphosphate reductase (Class II RTPR), documenting seasonal changes in pelagic virioplankton and bacterioplankton using amplicon sequences of Class II RTPR and the 16S rRNA gene, respectively.
View Article and Find Full Text PDFThe ability of spp. to nodulate and fix atmospheric nitrogen in soybean root nodules is critical to meeting humanity's nutritional needs. The intricacies of soybean bradyrhizobia-plant interactions have been studied extensively; however, bradyrhizobial ecology as influenced by phages has received somewhat less attention, even though these interactions may significantly impact soybean yield.
View Article and Find Full Text PDFViruses are the most abundant and diverse biological entities on the planet and constitute a significant proportion of Earth's genetic diversity. Most of this diversity is not represented by isolated viral-host systems and has only been observed through sequencing of viral metagenomes (viromes) from environmental samples. Viromes provide snapshots of viral genetic potential, and a wealth of information on viral community ecology.
View Article and Find Full Text PDFInteractions between marine viruses and microbes are a critical part of the oceanic carbon cycle. The impacts of virus-host interactions range from short-term disruptions in the mobility of microbial biomass carbon to higher trophic levels through cell lysis (i.e.
View Article and Find Full Text PDFNat Rev Microbiol
February 2022
Understanding how phenotypes emerge from genotypes is a foundational goal in biology. As challenging as this task is when considering cellular life, it is further complicated in the case of viruses. During replication, a virus as a discrete entity (the virion) disappears and manifests itself as a metabolic amalgam between the virus and the host (the virocell).
View Article and Find Full Text PDFViral infection exerts selection pressure on marine microbes, as virus-induced cell lysis causes 20 to 50% of cell mortality, resulting in fluxes of biomass into oceanic dissolved organic matter. Archaeal and bacterial populations can defend against viral infection using the clustered regularly interspaced short palindromic repeat (CRISPR)-associated (Cas) system, which relies on specific matching between a spacer sequence and a viral gene. If a CRISPR spacer match to any gene within a viral genome is equally effective in preventing lysis, no viral genes should be preferentially matched by CRISPR spacers.
View Article and Find Full Text PDFShotgun metagenomics, which allows for broad sampling of viral diversity, has uncovered genes that are widely distributed among virioplankton populations and show linkages to important biological features of unknown viruses. Over 25% of known dsDNA phage carry the DNA polymerase I () gene, making it one of the most widely distributed phage genes. Because of its pivotal role in DNA replication, this enzyme is linked to phage lifecycle characteristics.
View Article and Find Full Text PDFThe fluoropolymer manufacturing industry is moving to alternative polymerization processing aid technologies with more favorable toxicological and environmental profiles as part of a commitment to curtail the use of long-chain perfluoroalkyl acids (PFAAs). To facilitate the environmental product stewardship assessment and premanufacture notification (PMN) process for a candidate replacement chemical, we conducted acute and chronic aquatic toxicity tests to evaluate the toxicity of ammonium 2,3,3,3-tetrafluoro-2-(heptafluoropropoxy)-propanoate (C6HF11O3.H3N) or the acid form of the substance to the cladoceran, Daphnia magna, the green alga, Pseudokirchneriella subcapitata, and a number of freshwater fish species including the rainbow trout, Oncorhynchus mykiss, In addition, testing with the common carp, Cyprinus carpio, was conducted to determine the bioconcentration potential of the acid form of the compound.
View Article and Find Full Text PDFMeasured rates of intrinsic clearance determined using cryopreserved trout hepatocytes can be extrapolated to the whole animal as a means of improving modeled bioaccumulation predictions for fish. To date, however, the intra- and interlaboratory reliability of this procedure has not been determined. In the present study, three laboratories determined in vitro intrinsic clearance of six reference compounds (benzo[a]pyrene, 4-nonylphenol, di-tert-butyl phenol, fenthion, methoxychlor and o-terphenyl) by conducting substrate depletion experiments with cryopreserved trout hepatocytes from a single source.
View Article and Find Full Text PDFShort-term 48, 72 and 96-h aquatic toxicity tests were conducted to evaluate the acute toxicity of eight fluorinated acids to the cladoceran, Daphnia magna, the green alga, Pseudokirchneriella subcapitata, and the rainbow trout, Oncorhynchus mykiss or the fathead minnow, Pimephales promelas. The eight fluorinated acids studied were tridecafluorohexyl ethanoic acid (6:2 FTCA), heptadecafluorooctyl ethanoic acid (8:2 FTCA), 2H-dodecafluoro-2-octenoic acid (6:2 FTUCA), 2H-hexadecafluoro-2-decenoic acid (8:2 FTUCA), 2H,2H,3H,3H-undecafluoro octanoic acid (5:3 acid), 2H,2H,3H,3H-pentadecafluoro decanoic acid (7:3 acid), n-perfluoropentanoic acid (PFPeA) and n-perfluorodecanoic acid (PFDA). The results of the acute toxicity tests conducted during this study suggest that the polyfluorinated acids, 8:2 FTCA, 8:2 FTUCA, 6:2 FTCA, 6:2 FTUCA, 7:3 acid and 5:3 acid, and the perfluorinated acids PFPeA and PFDA, are generally of low to medium concern based on evaluation of their acute freshwater toxicity (EC/LC50s typically between 1 and >100 mg L(-1)) using the USEPA TSCA aquatic toxicity evaluation paradigm.
View Article and Find Full Text PDF