Machine Learning for PDEs

Overview

For my machine learning class, we explored how machine learning can recover the underlying parameters of a spring lattice material purely from displacement data observed under forcing. My focus was the PDE-based approach, in which a spring lattice system serves as a discrete approximation to a continuous material. The goal was to train a neural network to learn the material's constitutive law, specifically the stress-strain law, from displacement and boundary reaction data alone, enabling the discovery of physical material properties from observed deformation. The other approach explored was an ODE-based approach, through which the underlying spring constants could be identified directly, rather than a continuum constitutive approximation.

Problem Setup

The domain is a 2D spring lattice with nodes connected by nearest-neighbor springs. Two material models were considered: linear springs and nonlinear Duffing springs, which produce strain-stiffening behavior. The lattice is subjected to three distinct load cases, vertical pull, shear, and biaxial loading, each at five displacement levels, giving 15 training snapshots. Each load case illuminates a different component of the constitutive tensor, making all three necessary to fully identify the material law from the available measurements. The measured boundary reaction forces were recorded, and the displacement fields together with these reaction forces served as the training data for the model.

Approach

Displacement to Strain

Simulated nodal displacements from the lattice were converted to continuous strain fields using Gaussian kernel smoothing, with spatial derivatives computed analytically by differentiating the kernel. This avoids the noise amplification of finite differencing and produces clean strain fields.

Neural Network Architecture

The constitutive law was parameterised as an Input Convex Neural Network (ICNN) that learns the strain energy density as a function of two strain invariants. Stresses are recovered by automatic differentiation of the energy with respect to strain. The ICNN architecture enforces convexity of the energy by construction, a fundamental physical requirement that significantly improves learning.

Physics-Constrained Loss

Training is fully unsupervised with respect to stress. The loss has two terms: an interior equilibrium term that penalises violation of momentum balance across the domain, and a boundary reaction term that matches predicted integrated stresses to measured global reaction forces. The reaction term pins the absolute stress scale, which the equilibrium term alone cannot determine. An adaptive penalty schedule automatically balances the two terms throughout training to prevent either from dominating.

Training

The model was trained using Adam with cosine annealing and gradient clipping, with best-checkpoint tracking across the full training run. An ensemble of 10 independently seeded runs was trained, with the best selected by minimum evaluation loss. This ensemble approach guards against poor local minima and provides confidence in the result.

Results

The model successfully recovered constitutive stress-strain curves consistent with both the linear and nonlinear material models, with boundary reaction predictions closely matching measured values. Interior force balance was satisfied across the domain. The multi-load-case training strategy was essential, as single load cases left components of the constitutive tensor unconstrained, while the combined three-case training produced a fully identified material law. As a final validation, displacements were simulated using FEM with the learned constitutive law and closely matched the original input displacements used for training. This project provided valuable insight into what is required for successful learning for physical material systems and laid the groundwork for learning important material properties from simply observing displacements.

wdoster@mit.edu