Unveiling the Quantum Chemical Bonding Database for Solid-State Materials

Published in Materials
Unveiling the Quantum Chemical Bonding Database for Solid-State Materials
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Understanding the physical and chemical phenomena that define the properties of solid-state materials has long been a quest at the heart of materials science. From a chemist's perspective, the intricate web of chemical bonds could play a major role in dictating material properties.  In our paper "A Quantum Chemical Bonding Database for Solid State Materials," we dive into the world of chemical bonding that is aimed to uncover relationships between material properties and chemical bonding on a larger scale. Through a comprehensive bonding analysis of over 1,500 insulators and semiconductors, we have created a dataset that should be invaluable for researchers and scientists working on materials design, computational chemistry, and machine learning.

Fig. 1. Shows a snapshot of summarized bonding information data for PbTe (mp-19717), including the COHP plot of the most relevant bonds from the database.

In this study, we harnessed the capabilities of the LOBSTER[1–4] software package, which takes modern density functional theory (DFT) data and transforms it into a form that reveals the bonding scenario in the materials. LOBSTER allows one to peer into the bonding world that holds the atoms together in solid-state materials by projecting plane wave-based wave functions onto a local, atomic orbital basis. The critical component necessary for this research is the development of a fully automatic workflow[5] that combines the VASP[6–8] and LOBSTER computations. Another essential component is the LobsterPy[9] package, which automates analyzing a vast number of output files to populate the dataset.

Fig. 2. (a) Exemplar discretized PDOS obtained from LOBSTER, and VASP runs for diamond (mp-66). (b) Distribution of the Tanimoto index comparison from VASP and LOBSTER indicates excellent agreement between the projections for the whole dataset. And Fig. 3. (a) Exemplar plot for PDOS obtained from LOBSTER and VASP runs for diamond (mp-66). BC, BW, BS, and BK denote band center, width, skewness, and kurtosis. (b) A comparison of band centers from VASP and LOBSTER indicates excellent agreement between the band features.

The final database is provided in two forms, both as JSON files. The first contains only summarized information necessary for gaining a quick insight into the chemical bonding scenario of the compound in question. This provides crucial information like, the number of ions, covalent bond strengths, coordination environment for ions, electrostatic charge of the structure, bonding and antibonding contributions to the bonds, and many more based on our quantum-chemical computations. The second database contains data from all important LOBSTER computations, including the calculation settings used, which are essential to reproduce our results. Please check our article and the codes provided for more details on where to find these data and how to access them.

Rigorous testing has ensured that the data we provide are reliable, and it also guarantees that the information contained within the database is of the highest quality, ready to drive further scientific discovery. We also demonstrate the dataset’s potential, where we leveraged the bonding descriptors (from our dataset) to construct a machine learning (ML) model for predicting phononic properties. The results showed a 27% increase in prediction accuracy compared to a benchmark model that did not rely on quantum-chemical bonding features.

Finally, let’s not forget about the critical component of the open-access code contributions that made our work reproducible. Getting them ready to be made openly accessible was filled with unseen hours of coding, extending existing code capabilities, code reviews, and rigorous testing. Thanks to this effort, we can now distribute the dataset and the codes to the data-driven materials research community. We believe that this will further advance the material informatics field.

We also plan to make our data accessible via a database server platform for seamless access. Stay tuned for further updates from us.

Figures 2 and 3 are adapted from our article. Link to our article: https://www.nature.com/articles/s41597-023-02477-5

All figures are licensed under Creative Commons CC BY https://creativecommons.org/licenses/by/4.0/.

References

[1] S. Maintz, V. L. Deringer, A. L. Tchougréeff, R. Dronskowski, J. Comput. Chem. 2016, 37, 1030–1035.

[2] R. Nelson, C. Ertural, J. George, V. L. Deringer, G. Hautier, R. Dronskowski, J. Comput. Chem. 2020, 41, 1931–1940.

[3] V. L. Deringer, A. L. Tchougréeff, R. Dronskowski, J. Phys. Chem. A 2011, 115, 5461–5466.

[4] S. Maintz, V. L. Deringer, A. L. Tchougréeff, R. Dronskowski, J. Comput. Chem. 2013, 34, 2557–2567.

[5] J. George, G. Petretto, A. Naik, M. Esters, A. J. Jackson, R. Nelson, R. Dronskowski, G. Rignanese, G. Hautier, ChemPlusChem 2022, 87.

[6] G. Kresse, J. Furthmüller, Phys. Rev. B 1996, 54, 11169–11186.

[7] G. Kresse, J. Furthmüller, Comput. Mater. Sci. 1996, 6, 15–50.

[8] G. Kresse, J. Hafner, Phys. Rev. B 1993, 47, 558–561.

[9] J. George, G. Petretto, A. Naik, M. Esters, A. J. Jackson, R. Nelson, R. Dronskowski, G.-M. Rignanese, G. Hautier, 2023, 10.5281/ZENODO.7776029.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Materials Science
Physical Sciences > Materials Science

Related Collections

With collections, you can get published faster and increase your visibility.

Epidemiological data

This Collection presents a series of articles describing epidemiological datasets spanning diverse populations, ecosystems, and disease contexts. Data are presented without hypotheses or significant analyses, and can be derived from population surveys, health registries, electronic health records, field sampling, or other sources.

Publishing Model: Open Access

Deadline: Dec 22, 2024

Metabolomics

This collection presents a series of articles describing metabolomics datasets, covering data from any organism type, collected via any valid metabolomic technique, and for any application.

Publishing Model: Open Access

Deadline: Nov 28, 2024