[Advances in methods and applications of single-cell Hi-C data analysis].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi

School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China.

Published: October 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Chromatin three-dimensional genome structure plays a key role in cell function and gene regulation. Single-cell Hi-C techniques can capture genomic structure information at the cellular level, which provides an opportunity to study changes in genomic structure between different cell types. Recently, some excellent computational methods have been developed for single-cell Hi-C data analysis. In this paper, the available methods for single-cell Hi-C data analysis were first reviewed, including preprocessing of single-cell Hi-C data, multi-scale structure recognition based on single-cell Hi-C data, bulk-like Hi-C contact matrix generation based on single-cell Hi-C data sets, pseudo-time series analysis, and cell classification. Then the application of single-cell Hi-C data in cell differentiation and structural variation was described. Finally, the future development direction of single-cell Hi-C data analysis was also prospected.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10600426PMC
http://dx.doi.org/10.7507/1001-5515.202303046DOI Listing

Publication Analysis

Top Keywords

single-cell hi-c
36
hi-c data
32
data analysis
12
hi-c
10
single-cell
9
data
8
genomic structure
8
based single-cell
8
[advances methods
4
methods applications
4

Similar Publications

Recent advances in single-cell bioinformatics for inferring higher-order chromatin contact maps.

BMB Rep

September 2025

Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea.

DNA, a large molecule located in the nucleus, carries essential genetic information, including gene loci and cis-regulatory elements. Despite its extensive length, DNA is compactly stored within the limited space of the nucleus due to its hierarchical three-dimensional (3D) organization. In this structure, DNA is organized into territories known as topologically associated domains (TADs).

View Article and Find Full Text PDF

Scalable proxies of 3D genome interactions, such as from single-cell co-accessibility or Deep Learning, systematically overestimate long-range chromatin contacts. To correct this bias, we introduce a penalty function grounded in polymer physics, derived by fitting a multi-component power-law model to experimental Hi-C data from maize, rice, and soybean. This correction substantially improves concordance with Hi-C, reduces false-positive rates of long-range interactions by up to 95%, and reveals distinct decay exponents corresponding to different scales of chromatin organization.

View Article and Find Full Text PDF

High-throughput single-cell Hi-C (scHi-C) technologies have opened new avenues for investigating cell-to-cell variability in the three-dimensional organization of the genome within individual nuclei. Despite their potential, analyses of scHi-C data are hindered by data sparsity, which varies substantially across cells. To address this challenge, recent methods aim to denoise scHi-C data and differentiate between two types of zero entries: structural zeros (SZs), which reflect true absence of contacts due to biological structure, and dropouts (DOs), which arise from insufficient sequencing depth.

View Article and Find Full Text PDF

The Genome in a Bottle Consortium (GIAB), hosted by the National Institute of Standards and Technology (NIST), is developing new matched tumor-normal samples, the first explicitly consented for public dissemination of genomic data and cell lines. Here, we describe a comprehensive genomic dataset from the first individual, HG008, including DNA from an adherent, epithelial-like pancreatic ductal adenocarcinoma (PDAC) tumor cell line and matched normal cells from duodenal and pancreatic tissues. Data for the tumor-normal matched samples comes from seventeen distinct state-of-the-art whole genome measurement technologies, including high depth short and long-read bulk whole genome sequencing (WGS), single cell WGS, Hi-C, and karyotyping.

View Article and Find Full Text PDF

Hi-C and single cell Hi-C (scHi-C) data are now routinely generated for studying an array of biological questions of interest, including whole genome chromatin organization to gain a better understanding of the chromosome three-dimensional hierarchical structure: compartments, Topologically Associated Domains (TADs), and long-range interactions. Due to concerns about data quality, especially for scHi-C because of its sparsity, data quality improvement is seen as a necessary step before performing analyses to answer biological questions. As such, methods have been developed accordingly, among them is a set of methods that are "random walk"- based, including random walk with a limited number of steps (RWS) and random walk with restart (RWR).

View Article and Find Full Text PDF