Publications by authors named "Martin Vingron"

Bats are the only mammals capable of self-powered flight, an evolutionary innovation based on the transformation of forelimbs into wings. The bat wing is characterized by an extreme elongation of the second to fifth digits with a wing membrane called the chiropatagium connecting them. Here we investigated the developmental and cellular origin of this structure by comparing bat and mouse limbs using omics tools and single-cell analyses.

View Article and Find Full Text PDF

PHF13 is a H3K4me3 epigenetic reader that modulates key chromatin processes including transcription, DNA damage response, and chromatin architecture. PHF13 is found aberrantly regulated in different cancers and its misexpression alters the epigenetic landscape of key transcription factors that regulate epithelial-to-mesenchymal transition. In this study, we sought to understand how PHF13's chromatin affinity and diverse chromatin functions are intrinsically regulated.

View Article and Find Full Text PDF

Developmental gene expression is a remarkably conserved process, yet most cis-regulatory elements (CREs) lack sequence conservation, especially at larger evolutionary distances. Some evidence suggests that CREs at the same genomic position remain functionally conserved independent of sequence conservation. However, the extent of such positional conservation remains unclear.

View Article and Find Full Text PDF
Article Synopsis
  • scATAC-seq is a technique that profiles cell accessibility but struggles to accurately annotate cell types due to the absence of known 'marker regions' compared to scRNA-seq.
  • Traditional methods for annotation rely on converting accessibility data to gene expression patterns, which is not always effective.
  • The new method, scATAcat, uses existing bulk ATAC-seq data as prototypes and aggregates similar cells into pseudobulk to improve annotation accuracy, with results supported by multiple datasets and available as a python package.
View Article and Find Full Text PDF

During liver development, bipotential progenitor cells called hepatoblasts differentiate into hepatocytes or cholangiocytes. Hepatocyte differentiation is uniquely associated with multi-axial polarity, enabling the anisotropic expansion of apical lumina between adjacent cells and formation of a three-dimensional network of bile canaliculi. Cholangiocytes, the cells forming the bile ducts, exhibit the vectorial polarity characteristic of epithelial cells.

View Article and Find Full Text PDF

Transcription factors (TFs) control specificity and activity of gene transcription, but whether a relationship between these two features exists is unclear. Here we provide evidence for an evolutionary trade-off between the activity and specificity in human TFs encoded as submaximal dispersion of aromatic residues in their intrinsically disordered protein regions. We identified approximately 500 human TFs that encode short periodic blocks of aromatic residues in their intrinsically disordered regions, resembling imperfect prion-like sequences.

View Article and Find Full Text PDF

A fundamental analysis task for single-cell transcriptomics data is clustering with subsequent visualization of cell clusters. The genes responsible for the clustering are only inferred in a subsequent step. Clustering cells and genes together would be the remit of biclustering algorithms, which are often bogged down by the size of single-cell data.

View Article and Find Full Text PDF

Dormancy is an essential biological process for the propagation of many life forms through generations and stressful conditions. Early embryos of many mammals are preservable for weeks to months within the uterus in a dormant state called diapause, which can be induced in vitro through mTOR inhibition. Cellular strategies that safeguard original cell identity within the silent genomic landscape of dormancy are not known.

View Article and Find Full Text PDF

During cellular processes such as differentiation or response to external stimuli, cells exhibit dynamic changes in their gene expression profiles. Single-cell RNA sequencing (scRNA-seq) can be used to investigate these dynamic changes. To this end, cells are typically ordered along a pseudotemporal trajectory which recapitulates the progression of cells as they transition from one cell state to another.

View Article and Find Full Text PDF

Mammalian developmental timing is adjustable in vivo by preserving pre-implantation embryos in a dormant state called diapause. Inhibition of the growth regulator mTOR (mTORi) pauses mouse development in vitro, yet how embryonic dormancy is maintained is not known. Here we show that mouse embryos in diapause are sustained by using lipids as primary energy source.

View Article and Find Full Text PDF
Article Synopsis
  • The hexaploid sweetpotato has an unclear genetic origin, and this study identifies two main progenitors, a diploid closely related to Ipomoea aequatoriensis and a tetraploid likely of I. batatas 4x.
  • The research used genetic evidence and phylogenetic analysis, revealing that sweetpotato likely developed from crosses between these progenitors, followed by whole-genome duplication.
  • Key genes selected during sweetpotato's evolution are linked to root formation, genome stability, resistance, and nutrient transport, providing insights for future crop improvement.
View Article and Find Full Text PDF

Motivation: Identifying target promoters of active enhancers is a crucial step for realizing gene regulation and deciphering phenotypes and diseases. Up to now, several computational methods were developed to predict enhancer gene interactions, but they require either many epigenomic and transcriptomic experimental assays to generate cell-type (CT)-specific predictions or a single experiment applied to a large cohort of CTs to extract correlations between activities of regulatory elements. Thus, inferring CT-specific enhancer gene interactions in unstudied or poorly annotated CTs becomes a laborious and costly task.

View Article and Find Full Text PDF

Bone morphogenetic protein (BMP) signaling and fluid shear stress (FSS) mediate complementary functions in vascular homeostasis and disease development. It remains to be shown whether altered chromatin accessibility downstream of BMP and FSS offers a crosstalk level to explain changes in SMAD-dependent transcription. Here, we employed ATAC-seq to analyze arterial endothelial cells stimulated with BMP9 and/or FSS.

View Article and Find Full Text PDF
Article Synopsis
  • Changes in gene expression are key to phenotypic innovation, but the details of how these changes occur and affect trait evolution are not well understood.
  • This study investigates the genetic mechanisms behind masculinizing ovotestes in female moles, focusing on the role of SALL1 expression and enhancer activity.
  • Findings reveal that while 3D organization of the SALL1 locus is conserved, there is a notable divergence in enhancer functionality, indicating that modifications in gene expression could explain how new traits evolve.
View Article and Find Full Text PDF
Article Synopsis
  • Long-read transcriptome sequencing (LRTS) helps analyze alternative splicing and needs flexible computational tools for its diverse workflow.
  • IsoTools is a new Python-based framework that allows for transcriptome reconstruction and quantification, incorporating methods for identifying alternative splicing and detecting differential events using a statistical approach.
  • IsoTools can be accessed on GitHub and PyPI, with documentation and tutorials available online to assist users.
View Article and Find Full Text PDF

We present a novel data structure for searching sequences in large databases: the Hierarchical Interleaved Bloom Filter (HIBF). It is extremely fast and space efficient, yet so general that it could serve as the underlying engine for many applications. We show that the HIBF is superior in build time, index size, and search time while achieving a comparable or better accuracy compared to other state-of-the-art tools.

View Article and Find Full Text PDF

Protein-protein interactions (PPIs) play a crucial role in numerous molecular processes. Despite many efforts, mechanisms governing molecular recognition between interacting proteins remain poorly understood and it is particularly challenging to predict from sequence whether two proteins can interact. Here we present a new method to tackle this challenge using intrinsically disordered regions (IDRs).

View Article and Find Full Text PDF

Structural variants are a common cause of disease and contribute to a large extent to inter-individual variability, but their detection and interpretation remain a challenge. Here, we investigate 11 individuals with complex genomic rearrangements including germline chromothripsis by combining short- and long-read genome sequencing (GS) with Hi-C. Large-scale genomic rearrangements are identified in Hi-C interaction maps, allowing for an independent assessment of breakpoint calls derived from the GS methods, resulting in >300 genomic junctions.

View Article and Find Full Text PDF

Regulatory landscapes drive complex developmental gene expression, but it remains unclear how their integrity is maintained when incorporating novel genes and functions during evolution. Here, we investigated how a placental mammal-specific gene, Zfp42, emerged in an ancient vertebrate topologically associated domain (TAD) without adopting or disrupting the conserved expression of its gene, Fat1. In ESCs, physical TAD partitioning separates Zfp42 and Fat1 with distinct local enhancers that drive their independent expression.

View Article and Find Full Text PDF
Article Synopsis
  • * The consortium developed a central repository that brings together over 1,800 genomic data sets to enhance the understanding of zebrafish development.
  • * They identified 140,000 regulatory elements and explored their unique chromatin features, linking zebrafish data to mouse genomics for broader research implications.
View Article and Find Full Text PDF

On the occasion of Dr. Michael Waterman's 80th birthday, we review his major contributions to the field of computational biology and bioinformatics including the famous Smith-Waterman algorithm for sequence alignment, the probability and statistics theory related to sequence alignment, algorithms for sequence assembly, the Lander-Waterman model for genome physical mapping, combinatorics and predictions of ribonucleic acid structures, word counting statistics in molecular sequences, alignment-free sequence comparison, and algorithms for haplotype block partition and tagSNP selection related to the International HapMap Project. His books for graduate students and geared toward undergraduate students played key roles in computational biology and bioinformatics education.

View Article and Find Full Text PDF