98%
921
2 minutes
20
Protein-protein interaction (PPI) networks are a fundamental resource for modeling cellular and molecular function, and a large and sophisticated toolbox has been developed to leverage their structure and topological organization to predict the functional roles of under-studied genes, proteins, and pathways. However, the overwhelming majority of experimentally-determined interactions from which such networks are constructed come from a small number of well-studied model organisms. Indeed, most species lack even a single experimentally-determined interaction in these databases, much less a network to enable the analysis of cellular function, and methods for computational PPI prediction are too noisy to apply directly. We introduce PHILHARMONIC, a novel computational approach that couples deep learning network inference with robust unsupervised spectral clustering algorithms to uncover functional relationships and high-level organization in non-model organisms. Our clustering approach allows us to de-noise the predicted network, producing highly informative functional modules. We also develop a novel algorithm called ReCIPE, which aims to reconnect disconnected clusters, increasing functional enrichment and biological interpretability. We perform remote homology-based functional annotation by leveraging hmmscan and GODomainMiner to assign initial functions to proteins at large evolutionary distances. Our clusters enable us to newly assign functions to uncharacterized proteins through "function by association." We demonstrate the ability of PHILHARMONIC to recover clusters with significant functional coherence in the reef-building coral , its algal symbiont , and the well-annotated fruit fly . We perform a deeper analysis of the network, where we show that PHILHARMONIC clusters correlate strongly with gene co-expression and investigate several clusters that participate in temperature regulation in the coral, including the first putative functional annotation of several previously uncharacterized proteins. Easy to run end-to-end and requiring only a sequenced proteome, PHILHARMONIC is an engine for biological hypothesis generation and discovery in non-model organisms.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11565725 | PMC |
http://dx.doi.org/10.1101/2024.10.25.620267 | DOI Listing |
Brief Bioinform
July 2025
Faculty of Science, University of Melbourne, Grattan Street, Parkville, 3010, VIC, Australia.
Repetitive DNA sequences underpin genome architecture and evolutionary processes, yet they remain challenging to classify accurately. Terrier is a deep learning model designed to overcome these challenges by classifying repetitive DNA sequences using a publicly available, curated repeat sequence library trained under the RepeatMasker schema. Poor representation of taxa within repeat databases often limits the classification accuracy and reproducibility of current repeat annotation methods, limiting our understanding of repeat evolution and function.
View Article and Find Full Text PDFBMC Bioinformatics
August 2025
Department of Anatomy and Structural Biology, University of Yamanashi, Yamanashi, Japan.
With the development of sequencing technologies, chromosome-level genome assemblies have become increasingly common across various organisms, including non-model species. BLAST + is one of the most widely used bioinformatics tools for computing sequence alignments, offering numerous optimizations for speed and scalability. Dot plots, which visualize the similarity between two sequences, are widely used in biological research.
View Article and Find Full Text PDFJ Mol Evol
August 2025
Departamento de Zoologia, Universidade Federal Do Paraná, Curitiba, PR, 81531-980, Brazil.
Molecular data are irreplaceable resources for reconstructing the tree of life. Gene-specific substitution rates are essential for estimating divergence times in the absence of fossil calibration or converting coalescent units into absolute time in phylogeographic approaches, among other uses. However, substitution rate estimates are often derived from limited genomic loci, narrow taxonomic comparisons, and model organisms, hindering their applicability to understudied taxa.
View Article and Find Full Text PDFJ Exp Biol
August 2025
Department of Biological Sciences, Florida International University, FL 33199, USA.
Climate change can influence host-parasite dynamics by altering the abundance and distribution of hosts and their parasites as well as the physiology of both parasite and host. While the physiological effects of hosting parasites have been extensively studied in aquatic and laboratory model systems, these dynamics have been much less studied in wild terrestrial vertebrates, such as ectotherms that live in tropical forests. These organisms are particularly vulnerable to climate change because they have limited scope for behavioral buffering of stressful temperatures while already living at body temperatures close to their heat tolerance limits.
View Article and Find Full Text PDFJ Chromatogr B Analyt Technol Biomed Life Sci
November 2025
Department of Chemical Engineering, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India. Electronic address:
Preclinical efficacy testing is an essential aspect of evaluating quality of antivenoms (AVs). Recent years have witnessed a surge in development of in vitro methods to replace or reduce reliance on the standard mouse lethality assay. In this study, we propose a novel, reversed phase liquid chromatography-mass spectrometry (RPLC-MS)-based platform for monitoring AV activity on venom components under the WHO recommended in solution AV testing conditions.
View Article and Find Full Text PDF