Replication data for: Topological Feature Engineering for Machine Learning Based Halide Perovskite Materials Design (doi:10.21979/N9/CVJWZ9)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link) (external link)

Document Description

Citation

Title:

Replication data for: Topological Feature Engineering for Machine Learning Based Halide Perovskite Materials Design

Identification Number:

doi:10.21979/N9/CVJWZ9

Distributor:

DR-NTU (Data)

Date of Distribution:

2022-12-01

Version:

1

Bibliographic Citation:

Anand, D Vijay; Xu, Qiang; Wee, Junjie; Xia, Kelin; Sum, Tze Chien, 2022, "Replication data for: Topological Feature Engineering for Machine Learning Based Halide Perovskite Materials Design", https://doi.org/10.21979/N9/CVJWZ9, DR-NTU (Data), V1

Study Description

Citation

Title:

Replication data for: Topological Feature Engineering for Machine Learning Based Halide Perovskite Materials Design

Identification Number:

doi:10.21979/N9/CVJWZ9

Authoring Entity:

Anand, D Vijay (Nanyang Technological University)

Xu, Qiang (Nanyang Technological University)

Wee, Junjie (Nanyang Technological University)

Xia, Kelin (Nanyang Technological University)

Sum, Tze Chien (Nanyang Technological University)

Software used in Production:

python

Software used in Production:

matlab

Software used in Production:

vasp

Grant Number:

M4081842.110

Grant Number:

RG109/19

Grant Number:

MOE-T2EP50120-0004

Grant Number:

MOE-T2EP20120-0013

Grant Number:

NRF-NRFI2018-04

Distributor:

DR-NTU (Data)

Access Authority:

Sum, Tze Chien

Depositor:

Xu, Qiang

Date of Deposit:

2022-09-10

Holdings Information:

https://doi.org/10.21979/N9/CVJWZ9

Study Scope

Keywords:

Mathematical Sciences, Physics, Mathematical Sciences, Physics, topology, machine learning, halide perovskite

Abstract:

Accelerated materials development with machine learning (ML) assisted screening and high throughput experimentation for new photovoltaic materials holds the key to addressing our grand energy challenges. Data-driven ML is envisaged as a decisive enabler for new perovskite materials discovery. However, its full potential can be severely curtailed by poorly represented molecular descriptors (or fingerprints). Optimal descriptors are essential for establishing effective mathematical representations of quantitative structure-property relationships. Here we reveal that our persistent functions (PFs) based learning models offer significant accuracy advantages over traditional descriptor based models in organic-inorganic halide perovskite (OIHP) materials design and have similar performance as deep learning models. Our multiscale simplicial complex approach not only provides a more precise representation for OIHP structures and underlying interactions, but also has better transferability to ML models. Our results demonstrate that advanced geometrical and topological invariants are highly efficient feature engineering approaches that can markedly improve the performance of learning models for molecular data analysis. Further, new structure-property relationships can be established between our invariants and bandgaps. {We anticipate that our molecular representations and featurization models will transcend the limitations of conventional approaches and lead to breakthroughs in perovskite materials design and discovery.

Kind of Data:

calculated data

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Studies

<a href="https://github.com/ExpectozJJ/PF-OIHP"> https://github.com/ExpectozJJ/PF-OIHP</a>

Related Publications

Citation

Identification Number:

10.1038/s41524-022-00883-8

Bibliographic Citation:

Anand, D. V., Xu, Q., Wee, J., Xia, K., & Sum, T. C. (2022). Topological feature engineering for machine learning based halide perovskite materials design. npj Computational Materials, 8(1), 1-8.

Citation

Identification Number:

10356/165413

Bibliographic Citation:

Anand, D. V., Xu, Q., Wee, J., Xia, K. & Sum, T. C. (2022). Topological feature engineering for machine learning based halide perovskite materials design. Npj Computational Materials, 8(1).

Other Study-Related Materials

Label:

dfpt.tar.gz

Text:

raw data of dielectric constant of perovskites

Notes:

application/x-gzip

Other Study-Related Materials

Label:

fig.1.tar.gz

Text:

Data for replication and high-resolution image for Figure 1.

Notes:

application/x-gzip

Other Study-Related Materials

Label:

fig.2.tar.gz

Text:

Data for replication and high-resolution image for Figure 2.

Notes:

application/x-gzip

Other Study-Related Materials

Label:

fig.3.tar.gz

Text:

Data for replication and high-resolution image for Figure 3.

Notes:

application/x-gzip

Other Study-Related Materials

Label:

fig.4.tar.gz

Text:

Data for replication and high-resolution image for Figure 4.

Notes:

application/x-gzip

Other Study-Related Materials

Label:

fig.5.tar.gz

Text:

Data for replication and high-resolution image for Figure 5.

Notes:

application/x-gzip

Other Study-Related Materials

Label:

fig.SI.tar.gz

Text:

Data for replication and high-resolution images for Supplementary Figures (total of 3 figures).

Notes:

application/x-gzip

Other Study-Related Materials

Label:

HOIP_database_DRYAD.rar

Text:

database of perovskites for Figure 4, Figure 5, and Supplementary Figures (total of 3 figures).

Notes:

application/x-rar-compressed

Other Study-Related Materials

Label:

MAPbX3_sc333_pdb.tar.gz

Text:

raw data of perovskite structures for Figure 4, Figure 5, and Supplementary Figures (total of 3 figures).

Notes:

application/x-gzip

Other Study-Related Materials

Label:

md_structure.tar.gz

Text:

raw data of perovskite MD structures for Figure 3.

Notes:

application/x-gzip