headgraphic
loader graphic

Loading content ...

Surface Drifter Trajectory Prediction in the Gulf of Mexico Using Neural Networks

Grossi, M.D., S. Jegelka, P.F.J. Lermusiaux, and T.M. Özgökmen, 2024. Surface Drifter Trajectory Prediction in the Gulf of Mexico Using Neural Networks. Ocean Modelling, sub-judice. Special issue: Machine Learning for Ocean Modelling.

Machine learning techniques are applied to Lagrangian trajectory prediction, which is important in oceanography for providing guidance to search and rescue efforts, forecasting the spread of harmful algal blooms, and tracking pollutants and marine debris. This study evaluates the ability of two types of neural networks for learning ocean trajectories from nearly 250 surface drifters released during the Grand Lagrangian Deployment in the Gulf of Mexico from Jul-Oct 2012. First, simple fully connected neural networks were trained to predict an individual drifter’s trajectory over 24 h and 5 d time windows using only that drifter’s previous velocity time series. These networks, despite having successfully learned modeled trajectories in a previous study, failed to outperform common autoregressive models in any of the tests conducted. This was true even when drifters were pre-sorted into geospatial groups based on past trajectories and different networks were trained on each group to reduce the variability that each network had to learn. In contrast, a more sophisticated social spatio-temporal graph convolutional neural network (SST-GCNN), originally developed for learning pedestrian trajectories, demonstrated greater potential due to two important features: learning spatial and temporal patterns simultaneously, and sharing information between similarly-behaving drifters to facilitate the prediction of any particular drifter. Position forecast errors averaged around 60km at day 5, roughly 20km lower than autoregression, and even better for certain subsets of drifters. The passage of Tropical Cyclone Isaac over the drifter array as a tropical storm and category 1 hurricane provided a unique opportunity to also explore whether these models would benefit from adding wind as a predictor when making short 24 h forecasts. The SST-GCNNs were found to not benefit from wind on average, though certain subsets of drifters (based on deployment) exhibited slightly lower forecast errors at hour 24 with the addition of wind.

Share

Evaluation of Deep Neural Operator Models toward Ocean Forecasting

Rajagopal, E., A.N.S. Babu, T. Ryu, P.J. Haley, Jr., C. Mirabito, and P.F.J. Lermusiaux, 2023. Evaluation of Deep Neural Operator Models toward Ocean Forecasting. In: OCEANS '23 IEEE/MTS Gulf Coast, 25–28 September 2023. doi:10.23919/OCEANS52994.2023.10337380

Data-driven, deep-learning modeling frameworks have been recently developed for forecasting time series data. Such machine learning models may be useful in multiple domains including the atmospheric and oceanic ones, and in general, the larger fluids community. The present work investigates the possible effectiveness of such deep neural operator models for reproducing and predicting classic fluid flows and simulations of realistic ocean dynamics. We first briefly evaluate the capabilities of such deep neural operator models when trained on a simulated two-dimensional fluid flow past a cylinder. We then investigate their application to forecasting ocean surface circulation in the Middle Atlantic Bight and Massachusetts Bay, learning from high-resolution data-assimilative simulations employed for real sea experiments. We confirm that trained deep neural operator models are capable of predicting idealized periodic eddy shedding. For realistic ocean surface flows and our preliminary study, they can predict several of the features and show some skill, providing potential for future research and applications.

Share

Generalized Neural Closure Models with Interpretability

Gupta, A., and P.F.J. Lermusiaux, 2023. Generalized Neural Closure Models with Interpretability. Scientific Reports 13, 10364. doi:10.1038/s41598-023-35319-w

Improving the predictive capability and computational cost of dynamical models is often at the heart of augmenting computational physics with machine learning (ML). However, most learning results are limited in interpretability and generalization over different computational grid resolutions, initial and boundary conditions, domain geometries, and physical or problem-specific parameters. In the present study, we simultaneously address all these challenges by developing the novel and versatile methodology of unified neural partial delay differential equations. We augment existing/low-fidelity dynamical models directly in their partial differential equation (PDE) forms with both Markovian and non-Markovian neural network (NN) closure parameterizations. The melding of the existing models with NNs in the continuous spatiotemporal space followed by numerical discretization automatically allows for the desired generalizability. The Markovian term is designed to enable extraction of its analytical form and thus provides interpretability. The non-Markovian terms allow accounting for inherently missing time delays needed to represent the real world. Our flexible modeling framework provides full autonomy for the design of the unknown closure terms such as using any linear-, shallow-, or deep-NN architectures, selecting the span of the input function libraries, and using either or both Markovian and non-Markovian closure terms, all in accord with prior knowledge. We obtain adjoint PDEs in the continuous form, thus enabling direct implementation across differentiable and non-differentiable computational physics codes, different ML frameworks, and treatment of nonuniformly-spaced spatiotemporal training data. We demonstrate the new generalized neural closure models (gnCMs) framework using four sets of experiments based on advecting nonlinear waves, shocks, and ocean acidification models. Our learned gnCMs discover missing physics, find leading numerical error terms, discriminate among candidate functional forms in an interpretable fashion, achieve generalization, and compensate for the lack of complexity in simpler models. Finally, we analyze the computational advantages of our new framework.

Share

Bayesian Learning of Coupled Biogeochemical-Physical Models

Gupta, A. and P.F.J. Lermusiaux, 2023. Bayesian Learning of Coupled Biogeochemical-Physical Models. Progress in Oceanography 216, 103050. doi:10.1016/j.pocean.2023.103050

Predictive dynamical models for marine ecosystems are used for a variety of needs. Due to the sparse measurements and limited understanding of the myriad of ocean processes, there is however significant uncertainty. There is model uncertainty in the parameter values, functional forms with diverse parameterizations, and level of complexity needed, and thus in the state variable fields. We develop a Bayesian model learning methodology that allows interpolation in the space of candidate dynamical models and discovery of new models from noisy, sparse, and indirect observations, all while estimating state variable fields and parameter values, as well as the joint probability distributions of all learned quantities. We address the challenges of high-dimensional and multidisciplinary dynamics governed by partial differential equations (PDEs) by using state augmentation and the computationally efficient Gaussian Mixture Model – Dynamically Orthogonal filter. Our innovations include stochastic formulation parameters and stochastic complexity parameters to unify candidate models into a single general model as well as stochastic expansion parameters within piecewise function approximations to generate dense candidate model spaces. These innovations allow handling many compatible and embedded candidate models, possibly none of which are accurate, and learning elusive unknown functional forms that augment these models. Our new Bayesian methodology is generalizable and interpretable. It seamlessly and rigorously discriminates among existing models, but also extrapolates out of the space of models to discover new ones. We perform a series of twin experiments based on flows past a ridge coupled with three-to-five component ecosystem models, including flows with chaotic advection. We quantify the learning skill, and evaluate convergence and the sensitivity to hyper-parameters. Our PDE framework successfully discriminates among functional forms and model complexities, and learns in the absence of prior knowledge by searching in dense function spaces. The probabilities of known, uncertain, and unknown model formulations, and of biogeochemical-physical fields and parameters, are updated jointly using Bayes’ law. Non-Gaussian statistics, ambiguity, and biases are captured. The parameter values and the model formulations that best explain the noisy, sparse, and indirect data are identified. When observations are sufficiently informative, model complexity and model functions are discovered.

Share

Toward Dynamic Data-Driven Systems for Rapid Adaptive Interdisciplinary Ocean Forecasting

Patrikalakis, N.M., P.F.J. Lermusiaux, C. Evangelinos, J.J. McCarthy, A.R. Robinson, H. Schmidt, P.J. Haley, S. Lalis, R. Tian, W.G. Leslie, and W. Cho, 2023. Toward Dynamic Data-Driven Systems for Rapid Adaptive Interdisciplinary Ocean Forecasting. Chapter 14, Handbook of Dynamic Data Driven Applications Systems, F. Darema, E.P. Blasch, S. Ravela, and A.J. Aved (Eds.), pp. 377-395. doi:10.1007/978-3-031-27986-7_14

The state of the ocean evolves and its dynamics involves transitions occurring at multiple scales. For efficient and rapid interdisciplinary forecasting, ocean observing and prediction systems must have the same behavior and adapt to the ever-changing dynamics. This chapter sets the basis of a distributed system for real-time interdisciplinary ocean field and uncertainty forecasting with adaptive modeling and adaptive sampling. The scientific goal is to couple physical and biological oceanography with ocean acoustic measurements. The technical goal is to build a dynamic modeling and instrumentation system based on advanced infrastructures, distributed/grid computing, and efficient information retrieval and visualization interfaces, from which all these are incorporated into the Poseidon system. Importantly, the Poseidon system combines a suite of modern legacy physical models, acoustic models, and ocean current monitoring data assimilation schemes with innovative modeling and adaptive sampling methods. The legacy systems are encapsulated at the binary level using software component methodologies. Measurement models are utilized to link the observed data to the dynamical model variables and structures. With adaptive sampling, the data acquisition is dynamic and aims to minimize the predicted uncertainties, maximize the optimized sampling of key dynamics, and maintain overall coverage. With adaptive modeling, model improvements dynamically select the best model structures and parameters among different physical or biogeochemical parameterizations. The dynamic coupling of models and measurements discussed here, and embodied in the Poseidon system, represents a Dynamic Data-Driven Applications Systems (DDDAS). Technical and scientific progress is highlighted based on examples in Massachusetts Bay, Monterey Bay, and the California Current System.

Share

Deep Reinforcement Learning for Adaptive Mesh Refinement

Foucart, C., A. Charous, and P.F.J. Lermusiaux, 2023. Deep Reinforcement Learning for Adaptive Mesh Refinement. Journal of Computational Physics 491, 112381. doi:10.1016/j.jcp.2023.112381

Finite element discretizations of problems in computational physics often rely on adaptive mesh refinement (AMR) to preferentially resolve regions containing important features during simulation. However, these spatial refinement strategies are often heuristic and rely on domain-specific knowledge or trial-and-error. We treat the process of adaptive mesh refinement as a local, sequential decision-making problem under incomplete information, formulating AMR as a partially observable Markov decision process. Using a deep reinforcement learning approach, we train policy networks for AMR strategy directly from numerical simulation. The training process does not require an exact solution or a high-fidelity ground truth to the partial differential equation at hand, nor does it require a pre-computed training dataset. The local nature of our reinforcement learning formulation allows the policy network to be trained inexpensively on much smaller problems than those on which they are deployed. The methodology is not specific to any particular partial differential equation, problem dimension, or numerical discretization, and can flexibly incorporate diverse problem physics. To that end, we apply the approach to a diverse set of partial differential equations, using a variety of high-order discontinuous Galerkin and hybridizable discontinuous Galerkin finite element discretizations. We show that the resultant deep reinforcement learning policies are competitive with common AMR heuristics, generalize well across problem classes, and strike a favorable balance between accuracy and cost such that they often lead to a higher accuracy per problem degree of freedom.

Share

Bayesian Learning of Stochastic Dynamical Models

Lu, P., and P.F.J. Lermusiaux, 2021. Bayesian Learning of Stochastic Dynamical Models. Physica D 427: 133003. doi:10.1016/j.physd.2021.133003

A new methodology for rigorous Bayesian learning of high-dimensional stochastic dynamical models is developed. The methodology performs parallelized computation of marginal likelihoods for multiple candidate models, integrating over all state variable and parameter values, and enabling a principled Bayesian update of model distributions. This is accomplished by leveraging the dynamically orthogonal (DO) evolution equations for uncertainty prediction in a dynamic stochastic subspace and the Gaussian Mixture Model-DO filter for inference of nonlinear state variables and parameters, using reduced-dimension state augmentation to accommodate models featuring uncertain parameters. Overall, the joint Bayesian inference of the state, model equations, geometry, boundary conditions, and initial conditions is performed. Results are exemplified using two high-dimensional, nonlinear simulated fluid and ocean systems. For the first, limited measurements of fluid flow downstream of an obstacle are used to perform joint inference of the obstacle’s shape, the Reynolds number, and the O(105) fluid velocity state variables. For the second, limited measurements of the concentration of a microorganism advected by an uncertain flow are used to perform joint inference of the microorganism’s reaction equation and the O(105) microorganism concentration and ocean velocity state variables. When the observations are sufficiently informative about the learning objectives, we find that our posterior model probabilities correctly identify either the true model or the most plausible models, even in cases where a human would be challenged to do the same.

Share

Neural Closure Models for Dynamical Systems

Gupta, A. and P.F.J. Lermusiaux, 2021. Neural Closure Models for Dynamical Systems. Proceedings of The Royal Society A, 477(2252), pp. 1–29. doi:10.1098/rspa.2020.1004

Complex dynamical systems are used for predictions in many domains. Because of computational costs, models are truncated, coarsened, or aggregated. As the neglected and unresolved terms become important, the utility of model predictions diminishes. We develop a novel, versatile, and rigorous methodology to learn non-Markovian closure parameterizations for known-physics/low-fidelity models using data from high-fidelity simulations. The new neural closure models augment low-fidelity models with neural delay differential equations (nDDEs), motivated by the Mori-Zwanzig formulation and the inherent delays in complex dynamical systems. We demonstrate that neural closures efficiently account for truncated modes in reduced-order-models, capture the effects of subgrid-scale processes in coarse models, and augment the simplification of complex biological and physical-biogeochemical models. We find that using non-Markovian over Markovian closures improves long-term prediction accuracy and requires smaller networks. We derive adjoint equations and network architectures needed to efficiently implement the new discrete and distributed nDDEs, with any time-integration scheme and allowing nonuniformly-spaced temporal training data. The performance of discrete over distributed delays in closure models is explained using information theory, and we find an optimal amount of past information for a specified architecture. Finally, we analyze computational complexity and explain the limited additional cost due to neural closure models.

Share

Towards Bayesian Ocean Physical-Biogeochemical-Acidification Prediction and Learning Systems for Massachusetts Bay

Haley, Jr., P.J., A. Gupta, C. Mirabito, and P. F. J. Lermusiaux, 2020. Towards Bayesian Ocean Physical-Biogeochemical-Acidification Prediction and Learning Systems for Massachusetts Bay. In: OCEANS '20 IEEE/MTS, 5-30 October 2020, pp. 1-9, doi:10.1109/IEEECONF38699.2020.9389210

Better quantitative understanding and accurate data-assimilative predictions of the three-dimensional and time-dependent ocean acidification (OA) processes in coastal regions is urgently needed for the protection and sustainable utilization of ocean resources. In this paper, we extend and showcase the use of our MIT-MSEAS systems for high-resolution coupled physical-biogeochemical-acidification simulations and Bayesian learning of OA models in Massachusetts Bay, starting with simple empirical and equilibrium OA models. Simulations are shown to have reasonable skill when compared to available in situ and remote data. The impacts of wind forcing, internal tides, and solitary waves on water transports and mixing, and OA fields, are explored. Strong wind events are shown to reset circulations and the OA state in the Bay. Internal tides increase vertical mixing of waters in the shallow regions. Solitary waves propagating off Stellwagen Bank coupled with lateral turbulent mixing provide a pathway for exchange of surface and deep waters. Both of these effects are shown to impact biological activity and OA. A mechanism for the creation of multiple subsurface chlorophyll maxima is presented, involving wind-induced upwelling, internal tides, and advection of near surface fields. Due to the measurement sparsity and limited understanding of complex OA processes, the state variables and parameterizations of OA models are very uncertain. We thus present a proof-of-concept study to simultaneously learn and estimate the OA state variables and model parameterizations from sparse observations using our novel dynamics-based Bayesian learning framework for high-dimensional and multi-disciplinary estimation. Results are found to be encouraging for more realistic OA model learning.

Share

Sparse Regression and Adaptive Feature Generation for the Discovery of Dynamical Systems

Kulkarni, C.S., A. Gupta, and P.F.J. Lermusiaux, 2020. Sparse Regression and Adaptive Feature Generation for the Discovery of Dynamical Systems. In: Darema, F., E. Blasch, S. Ravela, and A. Aved (eds.), Dynamic Data Driven Application Systems. DDDAS 2020. Lecture Notes in Computer Science 12312, 208–216. doi:10.1007/978-3-030-61725-7_25

We study the performance of sparse regression methods and propose new techniques to distill the governing equations of nonlinear dynamical systems from data. We start from the recently proposed generic methodology of learning interpretable equation forms from data, followed by performance of least absolute shrinkage and selection operator (LASSO) for this purpose. We first develop an algorithm that uses the dual of LASSO optimization for higher accuracy and stability. We then derive a second algorithm that learns the candidate function library in a dynamic data driven applications systems (DDDAS) manner to distill the governing equations of the dynamical system. This is achieved via sequentially thresholded ridge regression (STRidge) over a orthogonal polynomial space. The performance of the methods is illustrated using the Lorenz 63 system and a marine ecosystem model.

Share

Fish Modeling and Bayesian Learning for the Lakshadweep Islands

Gupta, A., P.J. Haley, D.N. Subramani, and P.F.J. Lermusiaux, 2019. Fish Modeling and Bayesian Learning for the Lakshadweep Islands. In: OCEANS '19 MTS/IEEE Seattle, 27-31 October 2019, doi: 10.23919/OCEANS40490.2019.8962892

In fish modeling, a significant amount of uncertainty exists in the parameter values, parameterizations, functional form of model equations, and even the state variables themselves. This is due to the complexity and lack of understanding of the processes involved, as well as the measurement sparsity. These challenges motivate the present proof-of-concept study to simultaneously learn and estimate the state variables, parameters, and model equations from sparse observations. We employ a novel dynamics-based Bayesian learning framework for high-dimensional, coupled fish-biogeochemical-physical partial-differential equations (PDEs) models, allowing the simultaneous inference of the augmented state variables and parameters. After reviewing the status of ecosystem modeling in the coastal oceans, we first complete a series of PDE-based learning experiments that showcase capabilities for fish-biogeochemical-physical model equations and parameters, using nonhydrostatic Boussinesq flows past a seamount. We then showcase realistic ocean primitive-equation simulations and analyses, using fish catch data  for the Lakshadweep islands in India. These modeling and learning efforts could improve fisheries management from a standpoint of sustainability and efficiency.

Share

Intelligent Systems for Geosciences: An Essential Research Agenda

Gil, Y., S.A. Pierce, H. Babaie, A. Banerjee, K. Borne, G. Bust, M. Cheatham, I. Ebert-Uphoff, C. Gomes, M. Hill, J. Horel, L. Hsu, J. Kinter, C. Knoblock, D. Krum, V. Kumar, P.F.J. Lermusiaux, Y. Liu, C. North, V. Pankratius, S. Peters, B. Plale, A. Pope, S. Ravela, J. Restrepo, A. Ridley, H. Samet, and S. Shekhar, 2019. Intelligent Systems for Geosciences: An Essential Research Agenda. Communications of the ACM, 62(1), 76–84. doi:10.1145/3192335

Many aspects of geosciences pose novel problems for intelligent systems research. Geoscience data is challenging because it tends to be uncertain, intermittent, sparse, multiresolution, and multiscale. Geosciences processes and objects often have amorphous spatiotemporal boundaries. The lack of ground truth makes model evaluation, testing, and comparison difficult. Overcoming these challenges requires breakthroughs that would significantly transform intelligent systems, while greatly benefitting the geosciences in turn. Although there have been significant and beneficial interactions between the intelligent systems and geosciences communities, the potential for synergistic research in intelligent systems for geosciences is largely untapped. A recently launched Research Coordination Network on Intelligent Systems for Geosciences followed a workshop at the National Science Foundation on this topic. This expanding network builds on the momentum of the NSF EarthCube initiative for geosciences, and is driven by practical problems in Earth, ocean, atmospheric, polar, and geospace sciences. Based on discussions and activities within this network, this article presents a research agenda for intelligent systems inspired by geosciences challenges.

Share

Data-driven Learning and Modeling of AUV Operational Characteristics for Optimal Path Planning

Edwards, J., J. Smith, A. Girard, D. Wickman, P.F.J. Lermusiaux, D.N. Subramani, P.J. Haley, Jr., C. Mirabito, C.S. Kulkarni, and, S. Jana, 2017. Data-driven Learning and Modeling of AUV Operational Characteristics for Optimal Path Planning. In: Oceans '17 MTS/IEEE Aberdeen, 1-5, 19-22 June 2017, DOI: 10.1109/OCEANSE.2017.8084779

Autonomous underwater vehicles (AUVs) are used to execute an increasingly challenging set of missions in commercial, environmental and defense industries. The resources available to the AUV in service of these missions are typically a limited power supply and onboard sensing of its local environment. Optimal path planning is needed to maximize the chances that these AUVs will successfully complete long endurance missions within their power budget. A time-optimal path planner has been recently developed to minimize AUV mission time required to traverse a dynamic ocean environment at a specified speed through the water. For many missions, time minimization is appropriate because the AUVs operate at a fixed propeller speed. However, the ultimate limiting constraint on AUV operations is often the onboard power supply, rather than mission time. While an empirical or theoretical relationship between mission time and power could be applied to estimate power usage in the path planner, the real power usage and availability on an AUV varies mission-to-mission, as a result of multiple factors, including vehicle buoyancy, battery charge cycle, fin configuration, and water type or quality. In this work, we use data collected from two mid-size AUVs operating in various conditions to learn the mission-to-mission variability in the power budget so that it could be incorporated into the mission planner.
Share

Adaptive Modeling, Adaptive Data Assimilation and Adaptive Sampling.

Lermusiaux, P.F.J, 2007. Adaptive Modeling, Adaptive Data Assimilation and Adaptive Sampling. Refereed invited manuscript. Special issue on "Mathematical Issues and Challenges in Data Assimilation for Geophysical Systems: Interdisciplinary Perspectives". C.K.R.T. Jones and K. Ide, Eds. Physica D, Vol 230, 172-196, doi: 10.1016/j.physd.2007.02.014.

For efficient progress, model properties and measurement needs can adapt to oceanic events and interactions as they occur. The combination of models and data via data assimilation can also be adaptive. These adaptive concepts are discussed and exemplified within the context of comprehensive real-time ocean observing and prediction systems. Novel adaptive modeling approaches based on simplified maximum likelihood principles are developed and applied to physical and physical-biogeochemical dynamics. In the regional examples shown, they allow the joint calibration of parameter values and model structures. Adaptable components of the Error Subspace Statistical Estimation (ESSE) system are reviewed and illustrated. Results indicate that error estimates, ensemble sizes, error subspace ranks, covariance tapering parameters and stochastic error models can be calibrated by such quantitative adaptation. New adaptive sampling approaches and schemes are outlined. Illustrations suggest that these adaptive schemes can be used in real time with the potential for most efficient sampling.
Share

Adaptive Coupled Physical and Biogeochemical Ocean Predictions: A Conceptual Basis

Lermusiaux, P.F.J, C. Evangelinos, R. Tian, P.J. Haley, J.J. McCarthy, N.M. Patrikalakis, A.R. Robinson and H. Schmidt, 2004. Adaptive Coupled Physical and Biogeochemical Ocean Predictions: A Conceptual Basis. Refereed invited manuscript, F. Darema (Ed.), Lecture Notes in Computer Science, 3038, 685-692.

Physical and biogeochemical ocean dynamics can be intermittent and highly variable, and involve interactions on multiple scales. In general, the oceanic fields, processes and interactions that matter thus vary in time and space. For efficient forecasting, the structures and parameters of models must evolve and respond dynamically to new data injected into the executing prediction system. The conceptual basis of this adaptive modeling and corresponding computational scheme is the subject of this presentation. Specifically, we discuss the process of adaptive modeling for coupled physical and biogeochemical ocean models. The adaptivity is introduced within an interdisciplinary prediction system. Model-data misfits and data assimilation schemes are used to provide feedback from measurements to applications and modify the runtime behavior of the prediction system. Illustrative examples in Massachusetts Bay and Monterey Bay are presented to highlight ongoing progress.
Share

Rapid real-time interdisciplinary ocean forecasting using adaptive sampling and adaptive modeling and legacy codes: Component encapsulation using XML

Evangelinos C., R. Chang, P.F.J. Lermusiaux and N.M. Patrikalakis, 2003. Rapid real-time interdisciplinary ocean forecasting using adaptive sampling and adaptive modeling and legacy codes: Component encapsulation using XML. Lecture Notes in Computer Science, 2660, 375-384.

We present the high level architecture of a real-time interdisciplinary ocean forecasting system that employs adaptive elements in both modeling and sampling. We also discuss an important issue that arises in creating an integrated, web-accessible framework for such a system out of existing stand-alone components: transparent support for handling legacy binaries. Such binaries, that are most common in scientific applications, expect a standard input stream, maybe some command line options, a set of input files and generate a set of output files as well as standard output and error streams. Legacy applications of this form are encapsulated using XML. We present a method that uses XML documents to describe the parameters for executing a binary.
Share