Publications by authors named "Igor Poltavsky"

Atomistic simulations are routinely employed in academia and industry to study the behavior of molecules, materials, and their interfaces. Central to these simulations are force fields (FFs), whose development is challenged by intricate interatomic interactions at different spatio-temporal scales and the vast expanse of chemical space. Machine learning (ML) FFs, trained on quantum-mechanical energies and forces, have shown the capacity to achieve sub-kcal (mol Å) accuracy while maintaining computational efficiency.

View Article and Find Full Text PDF

We present the second part of the rigorous evaluation of modern machine learning force fields (MLFFs) within the TEA Challenge 2023. This study provides an in-depth analysis of the performance of MACE, SO3krates, sGDML, SOAP/GAP, and FCHL19* in modeling molecules, molecule-surface interfaces, and periodic materials. We compare observables obtained from molecular dynamics (MD) simulations using different MLFFs under identical conditions.

View Article and Find Full Text PDF

Machine-learned computational chemistry has led to a paradoxical situation in which molecular properties can be accurately predicted, but they are difficult to interpret. Explainable AI (XAI) tools can be used to analyze complex models, but they are highly dependent on the AI technique and the origin of the reference data. Alternatively, interpretable real-space tools can be employed directly, but they are often expensive to compute.

View Article and Find Full Text PDF

As the sophistication of machine learning force fields (MLFF) increases to match the complexity of extended molecules and materials, so does the need for tools to properly analyze and assess the practical performance of MLFFs. To go beyond average error metrics and into a complete picture of a model's applicability and limitations, we developed FFAST (force field analysis software and tools): a cross-platform software package designed to gain detailed insights into a model's performance and limitations, complete with an easy-to-use graphical user interface. The software allows the user to gauge the performance of any molecular force field,─such as popular state-of-the-art MLFF models, ─ on various popular data set types, providing general prediction error overviews, outlier detection mechanisms, atom-projected errors, and more.

View Article and Find Full Text PDF

Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF simulations of realistic molecules, including: (1) developing efficient descriptors for non-local interatomic interactions, which are essential to capture long-range molecular fluctuations, and (2) reducing the dimensionality of the descriptors to enhance the applicability and interpretability of MLFFs. Here we propose an automatized approach to substantially reduce the number of interatomic descriptor features while preserving the accuracy and increasing the efficiency of MLFFs.

View Article and Find Full Text PDF

In chemistry and physics, machine learning (ML) methods promise transformative impacts by advancing modeling and improving our understanding of complex molecules and materials. Each ML method comprises a mathematically well-defined procedure, and an increasingly larger number of easy-to-use ML packages for modeling atomistic systems are becoming available. In this Perspective, we discuss the general aspects of ML techniques in the context of creating ML force fields.

View Article and Find Full Text PDF

The training set of atomic configurations is key to the performance of any Machine Learning Force Field (MLFF) and, as such, the training set selection determines the applicability of the MLFF model for predictive molecular simulations. However, most atomistic reference datasets are inhomogeneously distributed across configurational space (CS), and thus, choosing the training set randomly or according to the probability distribution of the data leads to models whose accuracy is mainly defined by the most common close-to-equilibrium configurations in the reference data. In this work, we combine unsupervised and supervised ML methods to bypass the inherent bias of the data for common configurations, effectively widening the applicability range of the MLFF to the fullest capabilities of the dataset.

View Article and Find Full Text PDF

In recent years, the use of machine learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of methods and the efficiency of classical FFs. The key idea is to learn the statistical relation between chemical structure and potential energy without relying on a preconceived notion of fixed chemical bonds or knowledge about the relevant interactions.

View Article and Find Full Text PDF

Dynamics of flexible molecules are often determined by an interplay between local chemical bond fluctuations and conformational changes driven by long-range electrostatics and van der Waals interactions. This interplay between interactions yields complex potential-energy surfaces (PESs) with multiple minima and transition paths between them. In this work, we assess the performance of the state-of-the-art Machine Learning (ML) models, namely, sGDML, SchNet, Gaussian Approximation Potentials/Smooth Overlap of Atomic Positions (GAPs/SOAPs), and Behler-Parrinello neural networks, for reproducing such PESs, while using limited amounts of reference data.

View Article and Find Full Text PDF

Imaginary time path-integral (PI) simulations that account for nuclear quantum effects (NQE) beyond the harmonic approximation are increasingly employed together with modern electronic-structure calculations. Existing PI methods are applicable to molecules, liquids, and solids; however, the computational cost of such simulations increases dramatically with decreasing temperature. To address this challenge, here, we propose to combine high-order PI factorization with perturbation theory (PT).

View Article and Find Full Text PDF

We present the construction of molecular force fields for small molecules (less than 25 atoms) using the recently developed symmetrized gradient-domain machine learning (sGDML) approach [Chmiela et al., Nat. Commun.

View Article and Find Full Text PDF

Trioxatriangulenium (TOTA) platform molecules were functionalized with methyl, ethyl, ethynyl, propynyl, and hydrogen and sublimated onto Au(111) surfaces. Low-temperature scanning tunneling microscopy data reveal that >99% of ethyl-TOTA and methyl-TOTA remain intact, whereas 60% of H-TOTA and >99% of propynyl-TOTA and ethynyl-TOTA decompose. The observed tendency toward fragmentation on Au(111) is opposite to the sequence of gas-phase stabilities of the molecules.

View Article and Find Full Text PDF

Engineering of atomically thin membranes for hydrogen isotope separation is an actual challenge which has a broad range of applications. Recent experiments [M. Lozada-Hidalgo et al.

View Article and Find Full Text PDF

Nuclear quantum effects (NQE), which include both zero-point motion and tunneling, exhibit quite an impressive range of influence over the equilibrium and dynamical properties of molecules and materials. In this work, we extend our recently proposed perturbed path-integral (PPI) approach for modeling NQE in molecular systems [I. Poltavsky and A.

View Article and Find Full Text PDF

Using conservation of energy-a fundamental property of closed classical and quantum mechanical systems-we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate molecular force fields using a restricted number of samples from ab initio molecular dynamics (AIMD) trajectories. The GDML implementation is able to reproduce global potential energy surfaces of intermediate-sized molecules with an accuracy of 0.3 kcal mol for energies and 1 kcal mol Å̊ for atomic forces using only 1000 conformational geometries for training.

View Article and Find Full Text PDF

We investigate the thermal and electronic collective fluctuations that contribute to the finite-temperature adsorption properties of flexible adsorbates on surfaces on the example of the molecular switch azobenzene C_{12}H_{10}N_{2} on the Ag(111) surface. Using first-principles molecular dynamics simulations, we obtain the free energy of adsorption that accurately accounts for entropic contributions, whereas the inclusion of many-body dispersion interactions accounts for the electronic correlations that govern the adsorbate binding. We find the adsorbate properties to be strongly entropy driven, as can be judged by a kinetic molecular desorption prefactor of 10^{24}  s^{-1} that largely exceeds previously reported estimates.

View Article and Find Full Text PDF

The quantum nature of nuclear motions plays a vital role in the structure, stability, and thermodynamics of molecules and materials. The standard approach to model nuclear quantum fluctuations in chemical and biological systems is to use path-integral molecular dynamics. Unfortunately, conventional path-integral simulations can have an exceedingly large computational cost due to the need to employ an excessive number of coupled classical subsystems (beads) for quantitative accuracy.

View Article and Find Full Text PDF