Physics-based statistical learning approach to mesoscopic model selection

Published in Physical Review E, 2014

Recommended citation: S. Taverniers, T.S. Haut, K. Barros, F.J. Alexander, and T. Lookman. Physics-based statistical learning approach to mesoscopic model selection. Phys. Rev. E 92, 053301 (2015). https://doi.org/10.1103/PhysRevE.92.053301

Abstract

In materials science and many other research areas, models are frequently inferred without considering their generalization to unseen data. We apply statistical learning using cross-validation to obtain an optimally predictive coarse-grained description of a two-dimensional kinetic nearest-neighbor Ising model with Glauber dynamics (GD) based on the stochastic Ginzburg-Landau equation (sGLE). The latter is learned from GD training data using a log-likelihood analysis, and its predictive ability for various complexities of the model is tested on GD test data independent of the data used to train the model on. Using two different error metrics, we perform a detailed analysis of the error between magnetization time trajectories simulated using the learned sGLE coarse-grained description and those obtained using the GD model. We show that both for equilibrium and out-of-equilibrium GD training trajectories, the standard phenomenological description using a quartic free energy does not always yield the most predictive coarse-grained model. Moreover, increasing the amount of training data can shift the optimal model complexity to higher values. Our results are promising in that they pave the way for the use of statistical learning as a general tool for materials modeling and discovery.