Active Transfer Learning for Ocean Modeling (ATL)

MSEAS Ocean Test Cases

P.F.J. Lermusiaux, P. Lu, M.P. Ueckermann,
P.J. Haley, Jr., W.G. Leslie, T. Lolla

Massachusetts Institute of Technology
Center for Ocean Engineering
Mechanical Engineering
Cambridge, Massachusetts


This research sponsored by the
Office of Naval Research.

1: Learning in Flows behind a Cylinder or Island
2: Learning for 2-D Biogeochemical Dynamical Systems
3: Feature Extraction in 3-D Time-dependent Multivariate Ocean Fields

(Click on any test case to reach the description)


In what follows, we define a set of ocean test cases for Active Transfer Learning. The test cases are intended to be not too simple but not too complicated, so that ideas can be generated and tested, and algorithms and methods evaluated, in fast but relevant enough situations. We provide three types of cases, as well as software to run them and/or actual data sets.

1: Learning in Flows behind a Cylinder or Island

  • Feature extraction
    • Can prominent flow features, such as vortices, be identified and used for flow characterization (regime classification)?
    • Learn the feature dynamics:
      • Given flow fields from a limited set of experiments, can accurate feature field estimates be predicted?
      • Can it be done for flows with arbitrary, possibly time-varying parameters?
  • Dynamics classification
    • Can key global parameters, e.g. Reynolds number, be identified based on sparse observations of flow fields?
      • For ex: learn Re(t), Strouhal number, or other global measures
  • Model learning
    • Can specific model functions be estimated?
      • For ex: Is Coriolis active? What is the form of the mixing terms?


2: Learning for 2-D Biogeochemical Dynamical Systems

  • Model dynamics: Nutrient-Phytoplankton-Zooplankton (NPZ)
    • Various regimes: single stable points and stable limit cycles
    • Biomass: N+P+Z = conserved at every point
  • MSEAS ATL Problems:
    • Dynamics classification:
      • Discover unknown biomass, biomass (time)
      • Discover dynamical regimes, etc.
    • Inductive model learning:
      • Learn specific terms, e.g. unknown functionals, functionals (time), etc.
      • Discover parameters and structures of the 0D biological reaction terms
      • Determine when/where certain terms matter: e.g. when do the physical terms matter?
This test case 2 is concerned with discovering and learning biological-physical dynamics from sparse ocean data and/or approximate model information. Such learning includes quantitative learning of parameter values, model functionals and model structures, or in other words, discovering the model equations for the system as well as the initial and boundary conditions. It also involves learning more qualitative descriptions (but nonetheless scientific) of the system, such as describing dynamical regimes, discovering global properties of the biological system, determining when and why the biological dynamics suddenly change, and providing locations and periods over which certain term or process dominate, etc. Examples of terms include: advection, mixing, light-variation and grazing.

In what follows, we provide a set of examples, with an increasing level of difficulty, that aim to illustrate such applications of learning for biological-physical ocean processes. We note that transfer learning is possible from one example to the next, as well as active learning within or across examples.

All examples consist of Nutrient, Phytoplankton and Zooplankton (NPZ) dynamics and illustrate the effects of topography and other physical factors on the biogeochemical interactions.

For additional information see Ueckermann and Lermusiaux (2010) High Order Schemes for 2D Unsteady Biogeochemical Ocean Models.

    Examples

  • Example 0: the goal is to solve for the biological reaction terms based on sparse data. Three different data sets are provided, for three different dynamics, so that transfer learning and active learning are possible.
  • Example 1: the goals of this example are to:
    • Solve for the biological reaction terms in a two-dimensional in space and unsteady dynamics
    • Determine where and when the physical terms perturb the biological dynamics
    In this example 1, for one of the data sets, we provide more information so that a machine learning scheme can be trained.

  • Example 2: The goal of this example is to apply the methodologies developed in Example 1 to a more complex dynamical situation. Specific questions are:
    • Solve for the biological reaction terms (parameters and structure)
    • Among the three data sets provided, which ones have limit cycles and which ones do not, and what are the limit cycles and where do they occur?
    • Determine what has changed when compared to Example 1.

  • Example 3: The goal of this example is to apply the methodologies developed in Examples 1-2 to a more complex situation with multiple biological dynamics. Specific questions for each data set are:
    • Solve for the biological reaction terms (parameters and structure), possibly using results of Examples 1-2 to accelerate learning
    • Determine which data set contains regime changes if any, where/when do they occur, and what are their properties?

  • Example 4: The goal of this example is to utilize all of the results of Examples 0-3 above and suggest what may have changed in the dynamics, using active and transfer learning.


3: Feature extraction in 3-D time-dependent multivariate ocean fields

  • Feature extraction and discovery
    • Internal tides/waves
    • Eddies
    • Similar to "data-only" learning
  • Learn sub-grid-scale processes
  • Tuning parameters
  • Automated balances of terms, secondary variables, decompositions, etc
    • e.g. system telling user where certain type of instability occurs
  • Learn from data-model misfits and infer their causes (biases, other errors)
  • All the way to "open-ended learning for extreme modeling"


Return to top of document

Return to the MSEAS ATL home page