PiecewiseInference.jl

PiecewiseInference.PiecewiseInference
PiecewiseInference.InferenceProblem
PiecewiseInference.AIC
PiecewiseInference.AIC
PiecewiseInference.AIC
PiecewiseInference.AICc
PiecewiseInference.AICc_TREE
PiecewiseInference.FIM_strouwen
PiecewiseInference.FIM_yazdani
PiecewiseInference.R2
PiecewiseInference.RSS
PiecewiseInference.compute_cis
PiecewiseInference.compute_cis
PiecewiseInference.divisors
PiecewiseInference.estimate_σ
PiecewiseInference.get_evidence
PiecewiseInference.get_ranges
PiecewiseInference.group_ranges_gn
PiecewiseInference.group_ranges_gs
PiecewiseInference.inference
PiecewiseInference.iterative_inference
PiecewiseInference.loss_param_prior_from_dict
PiecewiseInference.moments
PiecewiseInference.moments!
PiecewiseInference.piecewise_ML_indep_TS
PiecewiseInference.piecewise_loss
PiecewiseInference.pretty_print

PiecewiseInference.PiecewiseInference — Module

PiecewiseInference.jl

Suite for inverse modelling of dynamical systems characterised by complex dynamics.

PiecewiseInference.jl implements a segmentation method that improves the convergence of local search methods by regularizing the inference problem.

Installation

PiecewiseInference.jl has ParametricModels.jl in its dependency, a non-registered package. As such, to install PiecewiseInference.jl, you'll need to first add an extra registry to your Julia installation that tracks both ParametricModels.jl and PiecewiseInference.jl.

To proceed, open Julia and type the following

using Pkg
pkg"registry add https://github.com/vboussange/VBoussangeRegistry.git"

Then go on and

pkg"add PiecewiseInference"

That's it! This will download the latest version of PiecewiseInference.jl from this git repo and download all dependencies.

Getting started

See the documentation and the test folder for up-to-date examples.

Related packages

DiffEqFlux is a package with similar goals as PiecewiseInference, and proposes the method DiffEqFlux.multiple_shooting, which is close to PiecewiseInference.inference but where initial conditions are not inferred. PiecewiseInference further proposes several utility methods for model selection.

Reference

Boussange, V., Vilimelis-Aceituno, P., Pellissier, L., Mini-batching ecological data to improve ecosystem models with machine learning bioRxiv, 46 pages.

PiecewiseInference.InferenceProblem — Method

InferenceProblem(
    model,
    p0;
    p_bij,
    u0_bij,
    loss_param_prior,
    loss_u0_prior,
    loss_likelihood
)

Args

model: a model of type AbstractModel.
p0: initial guess of the parameters. Should be a named tuple.

Optional

p_bij: a dictionary containing bijectors, to constrain parameter values and initial conditions
loss_param_prior is a function with arguments p::NamedTuple. Should correspond to parameter priors.

By default, loss_param_prior(p) = 0.

loss_u0_prior is a function with arguments u0_pred, u0_data. Should correspond to IC priors.

By default it corresponds to RSS between u0 and the corresponding data point.

loss_likelihood is a function that matches the predictions and the data,

which should have as arguments data, pred, rng. By default, it corresponds to the RSS.

PiecewiseInference.AIC — Method

AIC(RSS, k, m)

Calculate AIC of a model given its RSS, k its number of parameters, and m the number of observations.

PiecewiseInference.AIC — Method

AIC(data, tsteps, infres)

Computes the AIC of res given the observational noise distribution noisedistrib.

PiecewiseInference.AIC — Method

AIC(logl, nparams)

Computes the AIC given loglikelihood logl and number of parameters nparams.

PiecewiseInference.AICc — Method

AICc(aic, k, m)

Calculate AIC corrected of a model given its aic, k its number of parameters, and m the number of observations (if d variables and T time steps, m = d * T). The formula is taken from Mangan2017.

PiecewiseInference.AICc_TREE — Method

AICc_TREE(RSS, k, m)

Calculate aic of a model given its RSS, k its number of parameters, and m the number of observations. The formula is taken from TREE article.

PiecewiseInference.FIM_strouwen — Method

FIM_strouwen(predict, θ, Σ)

Returns the FIM matrix associated to predict and evaluated at θ taken from https://arnostrouwen.com/post/dynamic-experimental-design/

predict is a function that takes θ as a parameter and returns an array
- with dim=1 corresponding to state variables and
- dim = 2 corresponding to the time steps
θ the parameter vector where to evaluate the FIM
Σ is the variance-covariance matrix of the observation errors

PiecewiseInference.FIM_yazdani — Method

FIM_yazdani(dudt, u0_true, tspan, p_true, Σ)

Returns the FIM matrix associated to problem defined by prob = ODEProblem(dudt, u0_true, tspan, p_true). Σ is the variance-covariance matrix of the observation errors

PiecewiseInference.R2 — Method

R2(odedata, pred, _)

We distinguish between R2 for log transformed values (with `dist::MvLogNormal` as last argument) 
and standard R2, to discard non positive values in the former case.

PiecewiseInference.RSS — Method

RSS(res, data_set, noisedistrib)

Computes the RSS of res given data_set.

Argument:

fn is set to identity but can be e.g. log if lognormal loss used.

PiecewiseInference.compute_cis — Method

compute_cis(reseco, odedata, noisedistrib, p, α, σ)

PiecewiseInference.compute_cis — Method

compute_cis(var_cov_matrix, p, α)

Compute confidence intervals, given var_cov_matrix, parameters p and a confidence level α.

PiecewiseInference.divisors — Method

divisors(n)

Returns all divisors of n, sorted.

PiecewiseInference.estimate_σ — Method

estimate_σ(pred, odedata, _)

Estimate noise variance, assuming similar noise across all the dimensions of the data.

PiecewiseInference.get_evidence — Method

get_evidence(data, tsteps, infprob, p, u0s, ranges)

Provides evidence p(M|data) = ∫p(data|M, θ) p(θ) p(M) dθ for model M stored in infprob given the data, using MAP estimate p and u0s. Here it is assumed that p(M) = 1.

Relies on Laplace's method

Note

For now, we do not integrate over initial conditions u0s, but this may be considered.

PiecewiseInference.get_ranges — Method

get_ranges(; group_size, group_nb, datasize)

PiecewiseInference.group_ranges_gn — Method

group_ranges_gn(datasize, group_nb)

Similar to `group_ranges`, except that it takes as arguments the nb of groups wanted, `group_nb`

PiecewiseInference.group_ranges_gs — Method

group_ranges_gs(datasize, group_size)

Get ranges that partition data of length datasize in groups of groupsize observations. If the data isn't perfectly dividable by groupsize, the last group contains the reminding observations. Taken from https://github.com/SciML/DiffEqFlux.jl/blob/80c4247c19860d0422211d6a65283d896eeaa831/src/multiple_shooting.jl#L273-L303

group_ranges(datasize, groupsize)

Arguments:

datasize: amount of data points to be partitioned
groupsize: maximum amount of observations in each group

Example:

julia> group_ranges(10, 5)
3-element Vector{UnitRange{Int64}}:
 1:5
 5:9
 9:10

PiecewiseInference.inference — Method

inference(infprob; group_size, group_nb, kwargs...)

Piecewise inference. Loops through the optimizers optimizers. Returns a InferenceResult.

Arguments

infprob: the InferenceProblem
opt : array of optimizers
group_size : size of segments
group_nb: alternatively to group_size, one can ask for a certain number of segments
data : data
tsteps : corresponding to data

Optional

u0_init : if not provided, we initialise from data
optimizers : array of optimizers, e.g. [Adam(0.01)]
epochs : number of epochs, which length should match that of optimizers
batchsizes: array of batch size, which length should match that of optimizers
verbose_loss : displaying loss
info_per_its = 50,
plotting : plotting convergence loss
info_per_its = 50,
cb : call back function. Must be of the form cb(θs, p_trained, losses, pred, ranges)
threshold : default to 1e-6
save_pred = true saves predictions
save_losses = true saves losses
adtype = Optimization.AutoForwardDiff() : AD type to be used. Can be Optimization.AutoForwardDiff()

for forward AD, or Optimization.Autozygote() for backward AD.

multi_threading = true: if true, segments in the piecewise loss are computed in parallel.

Currently not supported with adtype = Optimization.Autozygote()

Examples

using SciMLSensitivity # provides diffential equation sensitivity methods
using UnPack # provides the utility macro @unpack 
using OptimizationOptimisers, OptimizationFlux # provide the optimizers
using LinearAlgebra
using ParametricModels
using PiecewiseInference
using OrdinaryDiffEq
using Distributions, Bijectors # used to constrain parameters and initial conditions
@model MyModel
function (m::MyModel)(du, u, p, t)
    @unpack b = p
    du .=  0.1 .* u .* ( 1. .- b .* u) 
end
tsteps = 1.:0.5:100.5
tspan = (tsteps[1], tsteps[end])
p_true = (b = [0.23, 0.5],)
p_init= (b = [1., 2.],)
# Defining the model
# Pay attention to the semi column before the parameters for `ModelParams`
u0 = ones(2)
mp = ModelParams(;p = p_true, 
                tspan,
                u0, 
                alg = BS3(),
                sensealg = ForwardDiffSensitivity(),
                saveat = tsteps, 
                )
model = MyModel(mp)
sol_data = ParametricModels.simulate(model)
ode_data = Array(sol_data)
# adding some normally distributed noise
σ_noise = 0.1
ode_data_wnoise = ode_data .+ randn(size(ode_data)) .* σ_noise
# Define the `InferenceProblem`
# First specifiy which values can the parameter take with bijectors
# here, `b` is constrained to be ∈ [1e-3, 5e0] and `u0` ∈ [1e-3, 5.]
p_bij = (bijector(Uniform(1e-3, 5e0)),)
u0_bij = bijector(Uniform(1e-3,5.))
distrib_noise = MvNormal(ones(2) * σ_noise^2)
# defining `loss_likelihood`
loss_likelihood(data, pred, rng) = sum(logpdf(distrib_noise, data .- pred))
infprob = InferenceProblem(model, p_init; p_bij, u0_bij)
optimizers = [ADAM(0.001)]
epochs = [5000]
group_nb = 2
batchsizes = [1] # batch size used for each optimizer in optimizers (here only one)
# you could also have `batchsizes = [group_nb]`
res = inference(infprob,
                    group_nb = group_nb, 
                    data = ode_data_wnoise, 
                    tsteps = tsteps, 
                    epochs = epochs, 
                    optimizers = optimizers,
                    batchsizes = batchsizes,
                    )

p_trained = get_p_trained(res)
pred = res.pred

PiecewiseInference.iterative_inference — Method

iterative_inference(
    infprob;
    group_sizes,
    group_nbs,
    optimizers_array,
    threshold,
    kwargs...
)

Performs a iterative piecewise MLE, iterating over group_sizes. Stops the iteration when loss function increases between two iterations.

Returns an array with all InferenceResult obtained during the iteration. For kwargs, see inference.

Note

for now, does not support independent time series (piecewise_ML_indep_TS).
at every iteration, initial conditions are initialised given the predition of previous iterations

Specific arguments

group_sizes : array of group sizes to test
optimizers_array: optimizersarray[i] is an array of optimizers for the trainging process of `groupsizes[i]`

PiecewiseInference.loss_param_prior_from_dict — Method

loss_param_prior_from_dict(params, param_distrib)

Returns the loglikelihood of params given the prior distribution of the parameters param_distrib

params: params, in the form of NamedTuple
param_distrib: in the form of a Dictionary or a NamedTuple, with entries p::String => "d::Distribution"

PiecewiseInference.moments! — Method

moments!(xx, x)

reurns the moments of x as a vector stored in xx

Args
xx : the covariate vector
x : the state variable vector

PiecewiseInference.moments — Method

moments(x)

reurns the moments of x as a vector

Args
xx : the covariate vector
x : the state variable vector

PiecewiseInference.piecewise_ML_indep_TS — Method

piecewise_ML_indep_TS(
    infprob;
    data,
    group_size,
    group_nb,
    tsteps,
    save_pred,
    save_losses,
    kwargs...
)

Similar to inference but for independent time series, where data is a vector containing the independent arrays corresponding to the time series, and tsteps is a vector where each entry contains the time steps of the corresponding time series.

PiecewiseInference.piecewise_loss — Method

piecewise_loss(
    infprob,
    θ,
    ode_data,
    tsteps,
    ranges,
    idx_rngs
)
piecewise_loss(
    infprob,
    θ,
    ode_data,
    tsteps,
    ranges,
    idx_rngs,
    multi_threading
)

Returns a tuple (loss, pred) based on the segmentation of ode_data into segments with time steps given by tsteps[ranges[i]]. The initial conditions are assumed free parameters for each segments.

Arguments:

infprob: the inference problem
θ: [u0,p] where p corresponds to the parameters of ode function in the optimization space.
ode_data: Original Data to be modeloss_likelihooded.
tsteps: Timesteps on which ode_data was calculated.
ranges: Vector containg range for each segment.
idx_rngs: Vector containing the indices of the segments to be included in the loss

PiecewiseInference.pretty_print — Function

pretty_print(p_1)
pretty_print(p_1, p_2)

Prints in a nice format the NamedTuple or Dict p_1. If p_true provided, also display its values for comparisions.