First Neural ODE example

First Neural ODE example#

A neural ODE is an ODE where a neural network defines its derivative function. \(\dot{u} = NN(u)\)

From: https://docs.sciml.ai/DiffEqFlux/stable/examples/neural_ode/

using Lux, DiffEqFlux, DifferentialEquations, ComponentArrays
using Optimization, OptimizationOptimJL, OptimizationOptimisers
using Random, Plots

rng = Random.default_rng()

[ Info: Precompiling IJuliaExt [2f4121a4-3b3a-5ce6-9c5e-1f2673ce168a]

TaskLocalRNG()

True solution

function trueODEfunc(du, u, p, t)
    true_A = [-0.1 2.0; -2.0 -0.1]
    du .= ((u.^3)'true_A)'
end

trueODEfunc (generic function with 1 method)

The data used for training

u0 = Float32[2.0; 0.0]
datasize = 30
tspan = (0.0f0, 1.5f0)
tsteps = range(tspan[begin], tspan[end], length = datasize)
prob_trueode = ODEProblem(trueODEfunc, u0, tspan)
ode_data = Array(solve(prob_trueode, Tsit5(), saveat = tsteps))

2×30 Matrix{Float32}:
 2.0  1.9465    1.74178  1.23837  0.577127  …  1.40688   1.37023   1.29214
 0.0  0.798832  1.46473  1.80877  1.86465      0.451377  0.728699  0.972102

Make a NeuralODE problem with a neural network defined by Lux.jl.

dudt2 = Lux.Chain(
    x -> x.^3,
    Lux.Dense(2, 50, tanh),
    Lux.Dense(50, 2)
)

p, st = Lux.setup(rng, dudt2)
prob_neuralode = NeuralODE(dudt2, tspan, Tsit5(), saveat = tsteps)

NeuralODE(
    model = Chain(
        layer_1 = WrappedFunction(#1),
        layer_2 = Dense(2 => 50, tanh_fast),  # 150 parameters
        layer_3 = Dense(50 => 2),       # 102 parameters
    ),
)         # Total: 252 parameters,
          #        plus 0 states.

Define output, loss, and callback functions.

function predict_neuralode(p)
    Array(prob_neuralode(u0, p, st)[1])
  end

function loss_neuralode(p)
    pred = predict_neuralode(p)
    loss = sum(abs2, ode_data .- pred)
    return loss, pred
end

loss_neuralode (generic function with 1 method)

Do not generate plots by default. Users could change doplot=true to see the figures in the callback fuction.

callback = function (p, l, pred; doplot = false)
    println(l)
    # plot current prediction against data
    if doplot
      plt = scatter(tsteps, ode_data[1,:], label = "data")
      scatter!(plt, tsteps, pred[1,:], label = "prediction")
      plot(plt)
    end
    return false
end

#3 (generic function with 1 method)

Try the callback function on the first iteration.

pinit = ComponentArray(p)
callback(pinit, loss_neuralode(pinit)...; doplot=true)

118.93152

false

Use Optimization.jl to solve the problem.

Zygote for automatic differentiation (AD)
loss_neuralode as the function to be optimized
Make an OptimizationProblem

adtype = Optimization.AutoZygote()
optf = Optimization.OptimizationFunction((x, p) -> loss_neuralode(x), adtype)
optprob = Optimization.OptimizationProblem(optf, pinit)

OptimizationProblem. In-place: true
u0: ComponentVector{Float32}(layer_1 = Float32[], layer_2 = (weight = Float32[-0.13173558 -0.26862946; -0.21219468 -0.29113472; … ; 0.23871331 0.32163706; 0.2673218 -0.17830189], bias = Float32[0.0; 0.0; … ; 0.0; 0.0;;]), layer_3 = (weight = Float32[0.039003026 0.28549364 … -0.24841379 0.102247044; 0.23711038 -0.06729417 … 0.070725866 0.0989487], bias = Float32[0.0; 0.0;;]))

Solve the OptimizationProblem.

result_neuralode = Optimization.solve(
    optprob,
    OptimizationOptimisers.ADAM(0.05),
    callback = callback,
    maxiters = 300
)

118.93152

36655
212326
86787

88174
30611
794754

669773
41168
646446

95736
760284
29796

5933
968784
266697

661457
308996
99663

27.117113
18.224869

5595
507513
880632

99707
060825
736874

136986
329556
334167

15575
84882
48766

146152
874454
717518

696476
7733
857851

862919
808237
685902

28656
606424
672087

167015
113039
772392

939747
860489
007938

8824415
949396
325867

410077
960457
799699

659706
288935
292451

243168
1806083
4701586

4545035
8181467
3685756

9891233
7556217
8554635

8198745
5525782
191701

9124403
6297073
5695117

544964
3150454
1894524

0074775
8385435
8549367

8052906
7279646
7551826

6504538
5874145
6074078

5670396
5367639
5248336

1.4728632
1.4370981

3992316
3706714
3412848

305206
2548811
2194672

1.1932594
1.1558425

1368406
1092584
0908762

1.0662757
1.0483215

0350976
01947
0049571

9824165
97018325
9649516

95416635
941921
925175

0.9117411
0.9005054

8941272
87876767
871321

8557811
845482
83529013

8299453
81286347
8054415

8019792
78728247
78193

77108365
76428175
75724345

75075823
7411033
7392033

7296305
7196565
71166456

7058496
6999039
6956167

666411
6535747
6511683

6435649
63974446
63729036

63559884
6321424
6239543

61493677
6183981
6096776

6073146
5993138
59358007

5949444
5872536
58704114

5821489
57660115
57140446

56809527
5658814
5591117

55308956
5483318
5481536

5453068
54355454
5411277

5394622
5287483
53205395

5267897
52042353
51914907

5121167
5132708
5056295

5021682
5043211
50082844

49712506
49049583
488807

4836612
48589966
47914726

47941613
4763267
4696518

47023976
46876737
46203262

45890146
45814407
45125443

45364237
45154297
44755158

4429472
44174865
43937814

43662018
43433928
42819756

0.4290631
0.4241396

42134872
41700017
41936338

41664764
41054168
4094148

40712437
40857816
40158877

4031048
40195826
3939447

39499843
3902107
39367834

38610053
38710114
390421

37968183
38185933
381964

36958292
37819535
3724618

36656803
3674983
36255923

35879308
35838723
35515106

35121173
35158592
34738743

3486128
34715125
340986

3471296
34075132
33429274

34008977
335349
3268921

33041143
3265921
3224478

3244342
32013506
31521055

3153871
31209573
31137058

31365484
30831978
30435055

30782023
30088735
30174026

3167247
30817467
29165807

2964253
29488084
28661618

286528
28760532
2833498

0.27903017
0.27903017

retcode: Default
u: ComponentVector{Float32}(layer_1 = Float32[], layer_2 = (weight = Float32[0.2608399 -0.7103775; -0.16067556 -1.1602527; … ; 0.15277196 1.2721673; 0.21492538 0.2944236], bias = Float32[0.36572495; 0.44362512; … ; -0.44807923; -0.53865325;;]), layer_3 = (weight = Float32[0.6236109 0.5328911 … -0.50796896 0.35644993; 0.051700193 -0.3421964 … 0.34809616 0.63404644], bias = Float32[-0.21957095; 0.017598046;;]))

Use another optimization algorithm Optim.BFGS() and start from where the ADAM() algorithm stopped.

optprob2 = remake(optprob, u0 = result_neuralode.u)

result_neuralode2 = Optimization.solve(
    optprob2,
    Optim.BFGS(initial_stepnorm=0.01),
    callback=callback,
    allow_f_increases = false
)

0.27903017

0.27872407

0.27863026
0.25121066

0.2420656
0.21821216

0.19049883
0.15103383

0.12721382
0.07557778

0.06568295
0.061216295

0.060143154
0.050319873

0.041621387
0.038229752

0.033189513
0.031393625

0.028698342

0.026024546
0.023069603

0.022244453
0.019386347

0.015208791
0.013807998

0.0113597335
0.010351848

0.010138731

0.008896519
0.007957236

0.007272527
0.0069238544

0.0061710677

0.0061710677
0.0058267526

0.0051714745
0.0044511706

0.004431646
0.003977054

0.003935433
0.0035506398

0.0033168232

0.00292864
0.002692305

0.002690089

0.0026872384

0.0026772963

0.0026385426

0.0026384164

0.0026384164

0.0026384164

retcode: Success
u: ComponentVector{Float32}(layer_1 = Float32[], layer_2 = (weight = Float32[0.08741521 -0.7041453; -0.19451857 -0.99229836; … ; 0.19416788 1.120761; 0.31404698 0.026229795], bias = Float32[0.3758842; 0.6552975; … ; -0.6631577; -1.0059776;;]), layer_3 = (weight = Float32[0.665346 0.35397014 … -0.31324288 0.48092267; 0.0966155 -0.45088494 … 0.44247928 1.1319721], bias = Float32[-0.1843225; 0.8225856;;]))

Plot the solution to see if it matches the provided data.

callback(result_neuralode2.u, loss_neuralode(result_neuralode2.u)...; doplot=true)

0.0026384164

false

Animated solving process#

Let’s reset the problem and visualize the training process.

rng = Random.default_rng()
u0 = Float32[2.0; 0.0]
datasize = 30
tspan = (0.0f0, 1.5f0)
tsteps = range(tspan[begin], tspan[end], length = datasize)

0.0f0:0.05172414f0:1.5f0

Setup truth values for validation

true_A = Float32[-0.1 2.0; -2.0 -0.1]

function trueODEfunc!(du, u, p, t)
    du .= ((u.^3)'true_A)'
end

trueODEfunc! (generic function with 1 method)

prob_trueode = ODEProblem(trueODEfunc!, u0, tspan)
ode_data = Array(solve(prob_trueode, Tsit5(), saveat = tsteps))

2×30 Matrix{Float32}:
 2.0  1.9465    1.74178  1.23837  0.577126  …  1.40688   1.37023   1.29215
 0.0  0.798832  1.46473  1.80877  1.86465      0.451358  0.728681  0.972087

nodeFunc = Lux.Chain(
    x -> x.^3,
    Lux.Dense(2, 50, tanh),
    Lux.Dense(50, 2)
)

p, st = Lux.setup(rng, nodeFunc)

((layer_1 = NamedTuple(), layer_2 = (weight = Float32[-0.2273688 -0.20968895; 0.054255627 0.20530364; … ; 0.15356942 0.24286065; -0.11287057 0.30330524], bias = Float32[0.0; 0.0; … ; 0.0; 0.0;;]), layer_3 = (weight = Float32[-0.18407616 0.17299011 … -0.22141114 -0.07978176; 0.16965075 0.03569014 … 0.15526423 -0.3377969], bias = Float32[0.0; 0.0;;])), (layer_1 = NamedTuple(), layer_2 = NamedTuple(), layer_3 = NamedTuple()))

Parameters in the neural network:

(layer_1 = NamedTuple(), layer_2 = (weight = Float32[-0.2273688 -0.20968895; 0.054255627 0.20530364; … ; 0.15356942 0.24286065; -0.11287057 0.30330524], bias = Float32[0.0; 0.0; … ; 0.0; 0.0;;]), layer_3 = (weight = Float32[-0.18407616 0.17299011 … -0.22141114 -0.07978176; 0.16965075 0.03569014 … 0.15526423 -0.3377969], bias = Float32[0.0; 0.0;;]))

Use NeuroODE() to construct the problem

prob_node = NeuralODE(nodeFunc, tspan, Tsit5(), saveat = tsteps)

NeuralODE(
    model = Chain(
        layer_1 = WrappedFunction(#8),
        layer_2 = Dense(2 => 50, tanh_fast),  # 150 parameters
        layer_3 = Dense(50 => 2),       # 102 parameters
    ),
)         # Total: 252 parameters,
          #        plus 0 states.

Predicted values.

function predict_neuralode(p)
    Array(prob_node(u0, p, st)[1])
end

predict_neuralode (generic function with 1 method)

The loss function.

function loss_neuralode(p)
    pred = predict_neuralode(p)
    loss = sum(abs2, ode_data .- pred)
    return loss, pred
end

loss_neuralode (generic function with 1 method)

Callback function to observe training process

anim = Animation()
callback = function (p, l, pred; doplot = true)
    if doplot
        plt = scatter(tsteps, ode_data[1,:], label = "data")
        scatter!(plt, tsteps, pred[1,:], label = "prediction")
        frame(anim)
    end
    return false
end

#10 (generic function with 1 method)

adtype = Optimization.AutoZygote()
optf = Optimization.OptimizationFunction((x, p) -> loss_neuralode(x), adtype)
optprob = Optimization.OptimizationProblem(optf, ComponentArray(p))

OptimizationProblem. In-place: true
u0: ComponentVector{Float32}(layer_1 = Float32[], layer_2 = (weight = Float32[-0.2273688 -0.20968895; 0.054255627 0.20530364; … ; 0.15356942 0.24286065; -0.11287057 0.30330524], bias = Float32[0.0; 0.0; … ; 0.0; 0.0;;]), layer_3 = (weight = Float32[-0.18407616 0.17299011 … -0.22141114 -0.07978176; 0.16965075 0.03569014 … 0.15526423 -0.3377969], bias = Float32[0.0; 0.0;;]))

Solve the problem using the ADAM optimizer

result_neuralode = Optimization.solve(
    optprob,
    OptimizationOptimisers.ADAM(0.05),
    callback = callback,
    maxiters = 300
)

retcode: Default
u: ComponentVector{Float32}(layer_1 = Float32[], layer_2 = (weight = Float32[1.1471213 0.074072815; -0.33308297 0.38328815; … ; 0.10337194 1.6461912; 0.116916105 0.70201075], bias = Float32[-0.027155323; -0.05846334; … ; -0.2410494; 0.23793979;;]), layer_3 = (weight = Float32[0.39916682 -0.42204866 … -0.39229235 0.077795036; 0.6139497 -0.46670642 … 0.24933042 0.09802464], bias = Float32[-0.4860557; -0.10765326;;]))

And then solve the problem using the LBFGS optimizer

optprob2 = remake(optprob, u0 = result_neuralode.u)

result_neuralode2 = Optimization.solve(
    optprob2,
    Optim.LBFGS(),
    callback = callback,
    allow_f_increases = false
)

retcode: Success
u: ComponentVector{Float32}(layer_1 = Float32[], layer_2 = (weight = Float32[1.1471213 0.07407282; -0.33308297 0.38328815; … ; 0.10337194 1.6461912; 0.116916105 0.70201075], bias = Float32[-0.027155323; -0.05846334; … ; -0.2410494; 0.23793979;;]), layer_3 = (weight = Float32[0.39916682 -0.42204866 … -0.39229235 0.07779504; 0.6139497 -0.46670642 … 0.24933042 0.09802464], bias = Float32[-0.4860557; -0.10765325;;]))

Visualize fitting process

mp4(anim, fps=15)

[ Info: Saved animation to /tmp/docs/ude/tmp.mp4

First Neural ODE example

Contents

First Neural ODE example#

Animated solving process#