Publications by Adrian Altenhoff | LitMetric

Publications by authors named "Adrian Altenhoff"

Page 1 of 2

EdgeHOG: a method for fine-grained ancestral gene order inference at large scale.

Charles Bernard , Yannis Nevers , Naga Bhushana Rao Karampudi , Kimberly J Gilbert , Clément Train , Adrian Altenhoff

Nat Ecol Evol

August 2025

Ancestral genomes are essential for studying the diversification of life from the last universal common ancestor to modern organisms. Methods have been proposed to infer ancestral gene order, but they lack scalability, limiting the depth to which gene neighbourhood evolution can be traced back. Here we introduce edgeHOG, a tool designed for accurate ancestral gene order inference with linear time complexity.

View Article and Find Full Text PDF

Annotation matters: the effect of structural gene annotation on orthology inference.

Silvia Prieto-Baños , Yannis Nevers , Adrian Altenhoff , Alex Warwick Vesztrocy , Christophe Dessimoz

Bioinformatics

July 2025

Motivation: In silico gene annotation, the process of identifying the genes present in a genome, remains a challenging task. As genome assemblies rapidly increase, the corresponding gene models and repertoires often fall short in quality. Despite advances in annotation methods, a lack of community standards means that most published gene annotations result from ad hoc pipelines.

View Article and Find Full Text PDF

A large collection of bioinformatics question-query pairs over federated knowledge graphs: methodology and applications.

Jerven Bolleman , Vincent Emonet , Adrian Altenhoff , Amos Bairoch , Marie-Claude Blatter

Gigascience

January 2025

Background: In recent decades, several life science resources have structured data using the same framework and made these accessible using the same query language to facilitate interoperability. Knowledge graphs have seen increased adoption in bioinformatics due to their advantages for representing data in a generic graph format. For example, yummydata.

View Article and Find Full Text PDF

Orthology inference at scale with FastOMA.

Sina Majidian , Yannis Nevers , Ali Yazdizadeh Kharrazi , Alex Warwick Vesztrocy , Stefano Pascarelli , Adrian M Altenhoff

Nat Methods

February 2025

The surge in genome data, with ongoing efforts aiming to sequence 1.5 M eukaryotes in a decade, could revolutionize genomics, revealing the origins, evolution and genetic innovations of biological processes. Yet, traditional genomics methods scale poorly with such large datasets.

View Article and Find Full Text PDF

New developments for the Quest for Orthologs benchmark service.

Adrian Altenhoff , Yannis Nevers , Vinh Tran , Dushyanth Jyothi , Maria Martin

NAR Genom Bioinform

December 2024

The Quest for Orthologs (QfO) orthology benchmark service (https://orthology.benchmarkservice.org) hosts a wide range of standardized benchmarks for orthology inference evaluation.

View Article and Find Full Text PDF

DrosOMA: the Orthologous Matrix browser.

Antonin Thiébaut , Adrian M Altenhoff , Giulia Campli , Natasha Glover , Christophe Dessimoz

F1000Res

March 2024

Background: Comparative genomic analyses to delineate gene evolutionary histories inform the understanding of organismal biology by characterising gene and gene family origins, trajectories, and dynamics, as well as enabling the tracing of speciation, duplication, and loss events, and facilitating the transfer of gene functional information across species. Genomic data are available for an increasing number of species from the genus Drosophila, however, a dedicated resource exploiting these data to provide the research community with browsable results from genus-wide orthology delineation has been lacking.

Methods: Using the OMA Orthologous Matrix orthology inference approach and browser deployment framework, we catalogued orthologues across a selected set of Drosophila species with high-quality annotated genomes.

View Article and Find Full Text PDF

Quality assessment of gene repertoire annotations with OMArk.

Yannis Nevers , Alex Warwick Vesztrocy , Victor Rossier , Clément-Marie Train , Adrian Altenhoff

Nat Biotechnol

January 2025

In the era of biodiversity genomics, it is crucial to ensure that annotations of protein-coding gene repertoires are accurate. State-of-the-art tools to assess genome annotations measure the completeness of a gene repertoire but are blind to other errors, such as gene overprediction or contamination. We introduce OMArk, a software package that relies on fast, alignment-free sequence comparisons between a query proteome and precomputed gene families across the tree of life.

View Article and Find Full Text PDF

OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA Ecosystem.

Adrian M Altenhoff , Alex Warwick Vesztrocy , Charles Bernard , Clement-Marie Train , Alina Nicheperovich

Nucleic Acids Res

January 2024

In this update paper, we present the latest developments in the OMA browser knowledgebase, which aims to provide high-quality orthology inferences and facilitate the study of gene families, genomes and their evolution. First, we discuss the addition of new species in the database, particularly an expanded representation of prokaryotic species. The OMA browser now offers Ancestral Genome pages and an Ancestral Gene Order viewer, allowing users to explore the evolutionary history and gene content of ancestral genomes.

View Article and Find Full Text PDF

Inference of phylogenetic trees directly from raw sequencing reads using Read2Tree.

David Dylus , Adrian Altenhoff , Sina Majidian , Fritz J Sedlazeck , Christophe Dessimoz

Nat Biotechnol

January 2024

Current methods for inference of phylogenetic trees require running complex pipelines at substantial computational and labor costs, with additional constraints in sequencing coverage, assembly and annotation quality, especially for large datasets. To overcome these challenges, we present Read2Tree, which directly processes raw sequencing reads into groups of corresponding genes and bypasses traditional steps in phylogeny inference, such as genome assembly, annotation and all-versus-all sequence comparisons, while retaining accuracy. In a benchmark encompassing a broad variety of datasets, Read2Tree is 10-100 times faster than assembly-based approaches and in most cases more accurate-the exception being when sequencing coverage is high and reference species very distant.

View Article and Find Full Text PDF

Read2Tree: scalable and accurate phylogenetic trees from raw reads.

David Dylus , Adrian Altenhoff , Sina Majidian , Fritz J Sedlazeck , Christophe Dessimoz

bioRxiv

December 2022

The inference of phylogenetic trees is foundational to biology. However, state-of-the-art phylogenomics requires running complex pipelines, at significant computational and labour costs, with additional constraints in sequencing coverage, assembly and annotation quality. To overcome these challenges, we present Read2Tree, which directly processes raw sequencing reads into groups of corresponding genes.

View Article and Find Full Text PDF

OMAMO: orthology-based alternative model organism selection.

Alina Nicheperovich , Adrian M Altenhoff , Christophe Dessimoz , Sina Majidian

Bioinformatics

May 2022

Summary: The conservation of pathways and genes across species has allowed scientists to use non-human model organisms to gain a deeper understanding of human biology. However, the use of traditional model systems such as mice, rats and zebrafish is costly, time-consuming and increasingly raises ethical concerns, which highlights the need to search for less complex model organisms. Existing tools only focus on the few well-studied model systems, most of which are complex animals.

View Article and Find Full Text PDF

The Quest for Orthologs orthology benchmark service in 2022.

Yannis Nevers , Tamsin E M Jones , Dushyanth Jyothi , Bethan Yates , Meritxell Ferret , Adrian Altenhoff

Nucleic Acids Res

July 2022

The Orthology Benchmark Service (https://orthology.benchmarkservice.org) is the gold standard for orthology inference evaluation, supported and maintained by the Quest for Orthologs consortium.

View Article and Find Full Text PDF

OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more.

Adrian M Altenhoff , Clément-Marie Train , Kimberly J Gilbert , Ishita Mediratta , Tarcisio Mendes de Farias

Nucleic Acids Res

January 2021

OMA is an established resource to elucidate evolutionary relationships among genes from currently 2326 genomes covering all domains of life. OMA provides pairwise and groupwise orthologs, functional annotations, local and global gene order conservation (synteny) information, among many other functions. This update paper describes the reorganisation of the database into gene-, group- and genome-centric pages.

View Article and Find Full Text PDF

How to build phylogenetic species trees with OMA.

David Dylus , Yannis Nevers , Adrian M Altenhoff , Antoine Gürtler , Christophe Dessimoz

F1000Res

June 2020

Knowledge of species phylogeny is critical to many fields of biology. In an era of genome data availability, the most common way to make a phylogenetic species tree is by using multiple protein-coding genes, conserved in multiple species. This methodology is composed of several steps: orthology inference, multiple sequence alignment and inference of the phylogeny with dedicated tools.

View Article and Find Full Text PDF

The Quest for Orthologs benchmark service and consensus calls in 2020.

Adrian M Altenhoff , Javier Garrayo-Ventas , Salvatore Cosentino , David Emms , Natasha M Glover

Nucleic Acids Res

July 2020

The identification of orthologs-genes in different species which descended from the same gene in their last common ancestor-is a prerequisite for many analyses in comparative genomics and molecular evolution. Numerous algorithms and resources have been conceived to address this problem, but benchmarking and interpreting them is fraught with difficulties (need to compare them on a common input dataset, absence of ground truth, computational cost of calling orthologs). To address this, the Quest for Orthologs consortium maintains a reference set of proteomes and provides a web server for continuous orthology benchmarking (http://orthology.

View Article and Find Full Text PDF

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.

Naihui Zhou , Yuxiang Jiang , Timothy R Bergquist , Alexandra J Lee , Balint Z Kacsoh , Adrian Altenhoff

Genome Biol

November 2019

Background: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.

Results: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes.

View Article and Find Full Text PDF

Inferring Orthology and Paralogy.

Adrian M Altenhoff , Natasha M Glover , Christophe Dessimoz

Methods Mol Biol

January 2020

The distinction between orthologs and paralogs, genes that started diverging by speciation versus duplication, is relevant in a wide range of contexts, most notably phylogenetic tree inference and protein function annotation. In this chapter, we provide an overview of the methods used to infer orthology and paralogy. We survey both graph-based approaches (and their various grouping strategies) and tree-based approaches, which solve the more general problem of gene/species tree reconciliation.

View Article and Find Full Text PDF

OMA standalone: orthology inference among public and custom genomes and transcriptomes.

Adrian M Altenhoff , Jeremy Levy , Magdalena Zarowiecki , Bartłomiej Tomiczek , Alex Warwick Vesztrocy

Genome Res

July 2019

Article Synopsis

Genomes and transcriptomes are often difficult to analyze; identifying orthologs (corresponding genes across species) is a critical but challenging step in this process.
The Orthologous MAtrix (OMA) database is a key resource for finding orthologs, and the OMA pipeline can be run as a standalone program on Linux and Mac, supporting various job schedulers and scaling up for large data processing.
OMA standalone allows users to integrate their own data with public genomic data and offers applications like phylogenetic analysis and identifying gene family changes or potential drug targets, and is available as open-source software.

View Article and Find Full Text PDF

Expanding the Orthologous Matrix (OMA) programmatic interfaces: REST API and the packages for R and Python.

Klara Kaleb , Alex Warwick Vesztrocy , Adrian Altenhoff , Christophe Dessimoz

F1000Res

June 2020

The Orthologous Matrix (OMA) is a well-established resource to identify orthologs among many genomes. Here, we present two recent additions to its programmatic interface, namely a REST API, and user-friendly R and Python packages called . These should further facilitate the incorporation of OMA data into computational scripts and pipelines.

View Article and Find Full Text PDF

Assigning confidence scores to homoeologs using fuzzy logic.

Natasha M Glover , Adrian Altenhoff , Christophe Dessimoz

PeerJ

January 2019

In polyploid genomes, homoeologs are a specific subtype of homologs, and can be thought of as orthologs between subgenomes. In Orthologous MAtrix, we infer homoeologs in three polyploid plant species: upland cotton (), rapeseed (), and bread wheat (). While we can typically recognize the features of a "good" homoeolog prediction (a consistent evolutionary distance, high synteny, and a one-to-one relationship), none of them is a hard-fast criterion.

View Article and Find Full Text PDF

iHam and pyHam: visualizing and processing hierarchical orthologous groups.

Clément-Marie Train , Miguel Pignatelli , Adrian Altenhoff , Christophe Dessimoz

Bioinformatics

July 2019

Summary: The evolutionary history of gene families can be complex due to duplications and losses. This complexity is compounded by the large number of genomes simultaneously considered in contemporary comparative genomic analyses. As provided by several orthology databases, hierarchical orthologous groups (HOGs) are sets of genes that are inferred to have descended from a common ancestral gene within a species clade.

View Article and Find Full Text PDF

Phylogenetic approaches to identifying fragments of the same gene, with application to the wheat genome.

Ivana Piližota , Clément-Marie Train , Adrian Altenhoff , Henning Redestig , Christophe Dessimoz

Bioinformatics

April 2019

Motivation: As the time and cost of sequencing decrease, the number of available genomes and transcriptomes rapidly increases. Yet the quality of the assemblies and the gene annotations varies considerably and often remains poor, affecting downstream analyses. This is particularly true when fragments of the same gene are annotated as distinct genes, which may cause them to be mistaken as paralogs.

View Article and Find Full Text PDF

The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.

Adrian M Altenhoff , Natasha M Glover , Clément-Marie Train , Klara Kaleb , Alex Warwick Vesztrocy

Nucleic Acids Res

January 2018

Article Synopsis

The Orthologous Matrix (OMA) is a key tool for comparing genes across various species and has received updates to improve its features.
Recent improvements include a new interactive viewer for orthologous gene groups and enhanced protein domain annotations that help link genes.
These updates also expand species coverage, particularly in plants and early eukaryotes, and make it easier to integrate OMA with other bioinformatics resources.

View Article and Find Full Text PDF

Gearing up to handle the mosaic nature of life in the quest for orthologs.

Kristoffer Forslund , Cecile Pereira , Salvador Capella-Gutierrez , Alan Sousa da Silva , Adrian Altenhoff

Bioinformatics

January 2018

The Quest for Orthologs (QfO) is an open collaboration framework for experts in comparative phylogenomics and related research areas who have an interest in highly accurate orthology predictions and their applications. We here report highlights and discussion points from the QfO meeting 2015 held in Barcelona. Achievements in recent years have established a basis to support developments for improved orthology prediction and to explore new approaches.

View Article and Find Full Text PDF

Orthologous Matrix (OMA) algorithm 2.0: more robust to asymmetric evolutionary rates and more scalable hierarchical orthologous group inference.

Clément-Marie Train , Natasha M Glover , Gaston H Gonnet , Adrian M Altenhoff , Christophe Dessimoz

Bioinformatics

July 2017

Motivation: Accurate orthology inference is a fundamental step in many phylogenetics and comparative analysis. Many methods have been proposed, including OMA (Orthologous MAtrix). Yet substantial challenges remain, in particular in coping with fragmented genes or genes evolving at different rates after duplication, and in scaling to large datasets.

View Article and Find Full Text PDF