Abstract
Accurate and explainable artificial-intelligence (AI) models are promising tools for accelerating the discovery of new materials. Recently, symbolic regression has become an increasingly popular tool for explainable AI because it yields models that are relatively simple analytical descriptions of target properties. Due to its deterministic nature, the sure-independence screening and sparsifying operator (SISSO) method is a particularly promising approach for this application. Here, we describe the new advancements of the SISSO algorithm, as implemented into SISSO++, a C++ code with Python bindings. We introduce a new representation of the mathematical expressions found by SISSO. This is a first step toward introducing “grammar” rules into the feature creation step. Importantly, by introducing a controlled nonlinear optimization to the feature creation step, we expand the range of possible descriptors found by the methodology. Finally, we introduce refinements to the solver algorithms for both regression and classification, which drastically increase the reliability and efficiency of SISSO. For all these improvements to the basic SISSO algorithm, we not only illustrate their potential impact but also fully detail how they operate both mathematically and computationally.
| Original language | English (US) |
|---|---|
| Article number | 114110 |
| Journal | Journal of Chemical Physics |
| Volume | 159 |
| Issue number | 11 |
| DOIs | |
| State | Published - Sep 21 2023 |
| Externally published | Yes |
ASJC Scopus subject areas
- General Physics and Astronomy
- Physical and Theoretical Chemistry
Fingerprint
Dive into the research topics of 'Recent advances in the SISSO method and their implementation in the SISSO++ code'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS