Replication Data for: Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation (doi:10.21979/N9/SBFIZD)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link) (external link)

Document Description

Citation

Title:

Replication Data for: Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation

Identification Number:

doi:10.21979/N9/SBFIZD

Distributor:

DR-NTU (Data)

Date of Distribution:

2023-06-23

Version:

1

Bibliographic Citation:

Gong, Weikang; Wee, JunJie; Wu, Min-Chun; Sun, Xiaohan; Li, Chunhua; Xia, Kelin, 2023, "Replication Data for: Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation", https://doi.org/10.21979/N9/SBFIZD, DR-NTU (Data), V1

Study Description

Citation

Title:

Replication Data for: Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation

Identification Number:

doi:10.21979/N9/SBFIZD

Authoring Entity:

Gong, Weikang (Beijing University of Technology)

Wee, JunJie (Nanyang Technological University)

Wu, Min-Chun (Nanyang Technological University)

Sun, Xiaohan (Beijing University of Technology)

Li, Chunhua (Beijing University of Technology)

Xia, Kelin (Nanyang Technological University)

Software used in Production:

Python

Software used in Production:

MATLAB

Grant Number:

M4081842.110

Grant Number:

RG109/19

Grant Number:

MOE2018-T2-1-033

Grant Number:

MOE-T2EP20120-0013

Grant Number:

31971180

Grant Number:

201906540026

Distributor:

DR-NTU (Data)

Access Authority:

Gong, Weikang

Access Authority:

Li, Chunhua

Access Authority:

Xia, Kelin

Depositor:

Wee, JunJie

Date of Deposit:

2023-06-23

Holdings Information:

https://doi.org/10.21979/N9/SBFIZD

Study Scope

Keywords:

Mathematical Sciences, Medicine, Health and Life Sciences, Mathematical Sciences, Medicine, Health and Life Sciences, Hi-C data, Hodge Laplacian, persistent spectral simplicial complex, chromosomal featurization, machine learning

Abstract:

The three-dimensional (3D) chromosomal structure plays an essential role in all DNA-templated processes, including gene transcription, DNA replication and other cellular processes. Although developing chromosome conformation capture (3C) methods, such as Hi-C, which can generate chromosomal contact data characterized genome-wide chromosomal structural properties, understanding 3D genomic nature-based on Hi-C data remains lacking. Here, we propose a persistent spectral simplicial complex (PerSpectSC) model to describe Hi-C data for the first time. Specifically, a filtration process is introduced to generate a series of nested simplicial complexes at different scales. For each of these simplicial complexes, its spectral information can be calculated from the corresponding Hodge Laplacian matrix. PerSpectSC model describes the persistence and variation of the spectral information of the nested simplicial complexes during the filtration process. Different from all previous models, our PerSpectSC-based features provide a quantitative global-scale characterization of chromosome structures and topology. Our descriptors can successfully classify cell types and also cellular differentiation stages for all the 24 types of chromosomes simultaneously. In particular, persistent minimum best characterizes cell types and Dim (1) persistent multiplicity best characterizes cellular differentiation. These results demonstrate the great potential of our PerSpectSC-based models in polymeric data analysis.

Kind of Data:

Calculated data and codes

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Publications

Citation

Identification Number:

10.1093/bib/bbac168

Bibliographic Citation:

Gong, W., Wee, J., Wu, M. C., Sun, X., Li, C., & Xia, K. (2022). Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation. Briefings in Bioinformatics, 23(4), bbac168.

Citation

Identification Number:

10356/168972

Bibliographic Citation:

Gong, W., Wee, J., Wu, M., Sun, X., Li, C. & Xia, K. (2022). Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation. Briefings in Bioinformatics, 23(4), bbac168-

Other Study-Related Materials

Label:

PerSpectSC-ML.rar

Notes:

application/x-rar-compressed