[Advances in methods and applications of single-cell Hi-C data analysis].

Haiyan Gong , Fuqiang Ma , Xiaotong Zhang

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi

School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China.

Published: October 2023

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Chromatin three-dimensional genome structure plays a key role in cell function and gene regulation. Single-cell Hi-C techniques can capture genomic structure information at the cellular level, which provides an opportunity to study changes in genomic structure between different cell types. Recently, some excellent computational methods have been developed for single-cell Hi-C data analysis. In this paper, the available methods for single-cell Hi-C data analysis were first reviewed, including preprocessing of single-cell Hi-C data, multi-scale structure recognition based on single-cell Hi-C data, bulk-like Hi-C contact matrix generation based on single-cell Hi-C data sets, pseudo-time series analysis, and cell classification. Then the application of single-cell Hi-C data in cell differentiation and structural variation was described. Finally, the future development direction of single-cell Hi-C data analysis was also prospected.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10600426	PMC
http://dx.doi.org/10.7507/1001-5515.202303046	DOI Listing

Publication Analysis

Top Keywords

single-cell hi-c

hi-c data

data analysis

hi-c

single-cell

data

genomic structure

based single-cell

[advances methods

methods applications

Similar Publications

Recent advances in single-cell bioinformatics for inferring higher-order chromatin contact maps.

BMB Rep

September 2025

Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea.

Seung Kyun Noh , Minhyeok Lee , Hyobin Jeong

DNA, a large molecule located in the nucleus, carries essential genetic information, including gene loci and cis-regulatory elements. Despite its extensive length, DNA is compactly stored within the limited space of the nucleus due to its hierarchical three-dimensional (3D) organization. In this structure, DNA is organized into territories known as topologically associated domains (TADs).

View Article and Find Full Text PDF

Similar Publications

Polymer-derived distance penalties improve chromatin interaction predictions from single-cell data across crop genomes.

bioRxiv

August 2025

Plant Epigenomics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, 85354, Germany.

Luca Schlegel , Fabio Gómez Cano , Alexandre P Marand , Frank Johannes

Scalable proxies of 3D genome interactions, such as from single-cell co-accessibility or Deep Learning, systematically overestimate long-range chromatin contacts. To correct this bias, we introduce a penalty function grounded in polymer physics, derived by fitting a multi-component power-law model to experimental Hi-C data from maize, rice, and soybean. This correction substantially improves concordance with Hi-C, reduces false-positive rates of long-range interactions by up to 95%, and reveals distinct decay exponents corresponding to different scales of chromatin organization.

View Article and Find Full Text PDF

Similar Publications

Novel Spatial-Structural-Zero-Aware Dissimilarity Measures for Subtype Discovery Using Single Cell Hi-C Data.

bioRxiv

July 2025

Yongqi Liu , Victor Jin , Shili Lin

High-throughput single-cell Hi-C (scHi-C) technologies have opened new avenues for investigating cell-to-cell variability in the three-dimensional organization of the genome within individual nuclei. Despite their potential, analyses of scHi-C data are hindered by data sparsity, which varies substantially across cells. To address this challenge, recent methods aim to denoise scHi-C data and differentiate between two types of zero entries: structural zeros (SZs), which reflect true absence of contacts due to biological structure, and dropouts (DOs), which arise from insufficient sequencing depth.

View Article and Find Full Text PDF

Similar Publications

Development and extensive sequencing of a broadly-consented Genome in a Bottle matched tumor-normal pair.

Sci Data

July 2025

Material Measurement Laboratory, National Institute of Standards and Technology, 100 Bureau Dr., Gaithersburg, MD, 20899, USA.

Jennifer H McDaniel , Vaidehi Patel , Nathan D Olson , Hua-Jun He , Zhiyong He

The Genome in a Bottle Consortium (GIAB), hosted by the National Institute of Standards and Technology (NIST), is developing new matched tumor-normal samples, the first explicitly consented for public dissemination of genomic data and cell lines. Here, we describe a comprehensive genomic dataset from the first individual, HG008, including DNA from an adherent, epithelial-like pancreatic ductal adenocarcinoma (PDAC) tumor cell line and matched normal cells from duodenal and pancreatic tissues. Data for the tumor-normal matched samples comes from seventeen distinct state-of-the-art whole genome measurement technologies, including high depth short and long-read bulk whole genome sequencing (WGS), single cell WGS, Hi-C, and karyotyping.

View Article and Find Full Text PDF

Similar Publications

Can Random Walking on a Hi-C Contact Matrix Lead to Data Quality Improvement? An Assessment.

bioRxiv

June 2025

Department of Statistics, The Ohio State University, Columbus, OH 43210.

Yongqi Liu , Shili Lin

Hi-C and single cell Hi-C (scHi-C) data are now routinely generated for studying an array of biological questions of interest, including whole genome chromatin organization to gain a better understanding of the chromosome three-dimensional hierarchical structure: compartments, Topologically Associated Domains (TADs), and long-range interactions. Due to concerns about data quality, especially for scHi-C because of its sparsity, data quality improvement is seen as a necessary step before performing analyses to answer biological questions. As such, methods have been developed accordingly, among them is a set of methods that are "random walk"- based, including random walk with a limited number of steps (RWS) and random walk with restart (RWR).

View Article and Find Full Text PDF

Similar Publications