Neural FMUs in model exchange (ME) mode

Tutorial by Tobias Thummerer

Last edit: 03.09.2024

License

# Copyright (c) 2021 Tobias Thummerer, Lars Mikelsons
# Licensed under the MIT license. 
# See LICENSE (https://github.com/thummeto/FMIFlux.jl/blob/main/LICENSE) file in the project root for details.

Introduction

Functional mock-up units (FMUs) can easily be seen as containers for simulation models.

This example shows how to build a very easy neural FMU by combining a model exchange (ME) FMU and an artificial neural network (ANN). The goal is, to train the hybrid model based on a very simple simulation model.

Packages

First, import the packages needed:

# imports
using FMI                       # for importing and simulating FMUs
using FMIFlux                   # for building neural FMUs
using FMIFlux.Flux              # the default machine learning library in Julia
using FMIZoo                    # a collection of demo FMUs
using DifferentialEquations     # the (O)DE solver suite in Julia
using Plots                     # for plotting some results

import Random                   # for random variables (and random initialization)
Random.seed!(1234)              # makes our program deterministic

Random.TaskLocalRNG()

Code

Next, start and stop time are set for the simulation, as well as some intermediate time points tSave to record simulation results.

tStart = 0.0
tStep = 0.01
tStop = 5.0
tSave = collect(tStart:tStep:tStop)

501-element Vector{Float64}:
 0.0
 0.01
 0.02
 0.03
 0.04
 0.05
 0.06
 0.07
 0.08
 0.09
 0.1
 0.11
 0.12
 ⋮
 4.89
 4.9
 4.91
 4.92
 4.93
 4.94
 4.95
 4.96
 4.97
 4.98
 4.99
 5.0

Complex FMU (ground truth training data)

First, let's load a model from the FMIZoo.jl, an easy pendulum including some friction. We will use that to generate training data.

# let's load the FMU in ME-mode (some FMUs support multiple simulation modes)
fmu_gt = loadFMU("SpringFrictionPendulum1D", "Dymola", "2022x"; type=:ME)  

# and print some info
info(fmu_gt)

#################### Begin information for FMU ####################
	Model name:			SpringFrictionPendulum1D
	FMI-Version:			2.0
	GUID:				{2e178ad3-5e9b-48ec-a7b2-baa5669efc0c}
	Generation tool:		Dymola Version 2022x (64-bit), 2021-10-08
	Generation time:		2022-05-19T06:54:12Z
	Var. naming conv.:		structured
	Event indicators:		24
	Inputs:				0
	Outputs:			0
	States:				2
		33554432 ["mass.s"]
		33554433 ["mass.v", "mass.v_relfric"]
	Parameters:			12
		16777216 ["fricScale"]
		16777217 ["s0"]
		16777218 ["v0"]
		16777219 ["fixed.s0"]
		...
		16777223 ["mass.smin"]
		16777224 ["mass.v_small"]
		16777225 ["mass.L"]
		16777226 ["mass.m"]
		16777227 ["mass.fexp"]


	Supports Co-Simulation:		true
		Model identifier:	SpringFrictionPendulum1D
		Get/Set State:		true
		Serialize State:	true
		Dir. Derivatives:	true
		Var. com. steps:	true
		Input interpol.:	true
		Max order out. der.:	1
	Supports Model-Exchange:	true
		Model identifier:	SpringFrictionPendulum1D
		Get/Set State:		true
		Serialize State:	true
		Dir. Derivatives:	true
##################### End information for FMU #####################

Next, the start state x0 is defined, together with some variables to be recorded vrs (they are identified by the names that where used during export of the FMU). The FMU is simulated and the results are plotted.

# the initial state we start our simulation with, position (0.5 m) and velocity (0.0 m/s) of the pendulum
x0 = [0.5, 0.0] 

# some variables we are interested in, so let's record them: position, velocity and acceleration
vrs = ["mass.s", "mass.v", "mass.a"]  

# simulate the FMU ...
sol_gt = simulate(fmu_gt, (tStart, tStop); recordValues=vrs, saveat=tSave, x0=x0)    

# ... and plot it! (but only the recorded values, not the states)
plot(sol_gt; states=false)

[34mSimulating ME-FMU ...   0%|█                             |  ETA: N/A[39m

[34mSimulating ME-FMU ... 100%|██████████████████████████████| Time: 0:00:16[39m

svg

After the simulation, specific variables can be extracted. We will use them for the later training - as training data!

pos_gt = getValue(sol_gt, "mass.s")

501-element Vector{Float64}:
 0.5
 0.5002131418271644
 0.500854887495059
 0.5019281657516876
 0.5034351795370763
 0.50537742474533
 0.5077556973743648
 0.5105701110158936
 0.5138201163230011
 0.5175045276221266
 0.5216215241870015
 0.5261686593877334
 0.5311429006829193
 ⋮
 1.0616593561654388
 1.0627701713285898
 1.0637521434353139
 1.0646032889175199
 1.0653217119141543
 1.0659056034549985
 1.0663532416435237
 1.0666629937790932
 1.0668333163759727
 1.0668685641881956
 1.0668685641871958
 1.0668685641861957

Now, we can release the FMU again - we don't need it anymore.

unloadFMU(fmu_gt)

Simple FMU

Now, we load an even more simple system, that we use as core for our neural FMU: A pendulum without friction. Again, we load, simulate and plot the FMU and its results.

fmu = loadFMU("SpringPendulum1D", "Dymola", "2022x"; type=:ME)
info(fmu)

sol_fmu = simulate(fmu, (tStart, tStop); recordValues=vrs, saveat=tSave)
plot(sol_fmu)

#################### Begin information for FMU ####################
	Model name:			SpringPendulum1D
	FMI-Version:			2.0
	GUID:				{fc15d8c4-758b-48e6-b00e-5bf47b8b14e5}
	Generation tool:		Dymola Version 2022x (64-bit), 2021-10-08
	Generation time:		2022-05-19T06:54:23Z
	Var. naming conv.:		structured
	Event indicators:		0
	Inputs:				0
	Outputs:			0
	States:				2
		33554432 ["mass.s"]
		33554433 ["mass.v"]
	Parameters:			7
		16777216 ["mass_s0"]
		16777217 ["mass_v0"]
		16777218 ["fixed.s0"]
		16777219 ["spring.c"]
		16777220 ["spring.s_rel0"]
		16777221 ["mass.m"]
		16777222 ["mass.L"]
	Supports Co-Simulation:		true
		Model identifier:	SpringPendulum1D
		Get/Set State:		true
		Serialize State:	true
		Dir. Derivatives:	true
		Var. com. steps:	true
		Input interpol.:	true
		Max order out. der.:	1
	Supports Model-Exchange:	true
		Model identifier:	SpringPendulum1D
		Get/Set State:		true
		Serialize State:	true
		Dir. Derivatives:	true
##################### End information for FMU #####################

svg

Neural FMU

Now the fun begins, let's combine the loaded FMU and the ANN!

# get number of states
numStates = getNumberOfStates(fmu)

net = Chain(x -> fmu(x=x, dx_refs=:all),    # we can use the FMU just like any other neural network layer!
            Dense(numStates, 16, tanh),     # some additional dense layers ...
            Dense(16, 16, tanh),
            Dense(16, numStates))

# the neural FMU is constructed by providing the FMU, the net topology, start and stop time and a solver (here: Tsit5)
neuralFMU = ME_NeuralFMU(fmu, net, (tStart, tStop), Tsit5(); saveat=tSave);

Now, we can check how the neural FMU performs before the actual training!

solutionBefore = neuralFMU(x0)
plot(solutionBefore)

svg

Not that ideal... let's add our ground truth data to compare!

plot!(sol_gt; values=false)

svg

Ufff... only the starting state for position and velocity is correct. Training seems a good idea here!

Loss function

Before we can train the neural FMU, we need to define a loss function. We use the common mean-squared-error (MSE) here.

function loss(p)
    # simulate the neural FMU by calling it
    sol_nfmu = neuralFMU(x0; p=p)

    # we use the first state, because we know that's the position
    pos_nfmu = getState(sol_nfmu, 1; isIndex=true)

    # we could also identify the position state by its name
    #pos_nfmu = getState(solution, "mass.s")
    
    FMIFlux.Losses.mse(pos_gt, pos_nfmu) 
end

loss (generic function with 1 method)

Callback

Further, we define a simple logging function for our training.

global counter = 0
function callback(p)
    global counter += 1
    if counter % 20 == 1
        lossVal = loss(p[1])
        @info "Loss [$(counter)]: $(round(lossVal, digits=6))"
    end
end

callback (generic function with 1 method)

Training

For training, we only need to extract the parameters to optimize and pass it to a pre-build train command FMIFlux.train!.

optim = Adam()

p = FMIFlux.params(neuralFMU)

FMIFlux.train!(
    loss, 
    neuralFMU,
    Iterators.repeated((), 500), 
    optim; 
    cb=()->callback(p)
)

[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [1]: 0.061175


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [21]: 0.040784


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [41]: 0.040451


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [61]: 0.039743


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [81]: 0.039087


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [101]: 0.03815


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [121]: 0.035558


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [141]: 0.021058


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [161]: 0.006635


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [181]: 0.004409


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [201]: 0.003957


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [221]: 0.003682


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [241]: 0.003449


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [261]: 0.00321


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [281]: 0.002995


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [301]: 0.002806


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [321]: 0.002578


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [341]: 0.002362


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [361]: 0.002152


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [381]: 0.001956


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [401]: 0.001789


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [421]: 0.001617


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [441]: 0.001453


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [461]: 0.001301


[36m[1m[ [22m[39m[36m[1mInfo: [22m[39mLoss [481]: 0.00116

Results

Finally, we can compare the results before and after training, as well as the ground truth data:

solutionAfter = neuralFMU(x0)

fig = plot(solutionBefore; stateIndices=1:1, label="Neural FMU (before)", ylabel="position [m]")
plot!(fig, solutionAfter; stateIndices=1:1, label="Neural FMU (after)")
plot!(fig, tSave, pos_gt; label="ground truth")
fig

svg

Finally, the FMU is unloaded and memory released.

unloadFMU(fmu)

Source

[1] Tobias Thummerer, Lars Mikelsons and Josef Kircher. 2021. NeuralFMU: towards structural integration of FMUs into neural networks. Martin Sjölund, Lena Buffoni, Adrian Pop and Lennart Ochel (Ed.). Proceedings of 14th Modelica Conference 2021, Linköping, Sweden, September 20-24, 2021. Linköping University Electronic Press, Linköping (Linköping Electronic Conference Proceedings ; 181), 297-306. DOI: 10.3384/ecp21181297

Build information

# check package build information for reproducibility
import Pkg; Pkg.status()

[32m[1mStatus[22m[39m `D:\a\FMIFlux.jl\FMIFlux.jl\examples\Project.toml`
  [90m[0c46a032] [39mDifferentialEquations v7.14.0
  [90m[14a09403] [39mFMI v0.14.1
  [90m[fabad875] [39mFMIFlux v0.13.0 `D:\a\FMIFlux.jl\FMIFlux.jl`
  [90m[9fcbc62e] [39mFMIImport v1.0.8
  [90m[724179cf] [39mFMIZoo v1.1.0
  [90m[587475ba] [39mFlux v0.14.22
  [90m[7073ff75] [39mIJulia v1.25.0
[32m⌃[39m [90m[033835bb] [39mJLD2 v0.4.53
  [90m[b964fa9f] [39mLaTeXStrings v1.4.0
  [90m[f0f68f2c] [39mPlotlyJS v0.18.14
  [90m[91a5bcdd] [39mPlots v1.40.8
  [90m[9a3f8284] [39mRandom
[36m[1mInfo[22m[39m Packages marked with [32m⌃[39m have new versions available and may be upgradable.