TMB Documentation
v1.9.11
|
When building models in TMB it is generally recommended to test the implementation on simulated data. Obviously, data can be simulated from R and passed to the C++ template. In practice this amounts to implementing the model twice and is thus a strong way to validate the implementation of the model. However, with increased model complexity it becomes inconvenient to maintain two separate implementations. Therefore, TMB allows the the user to write the simulation code as an integrated part of the C++ model template.
The TMB simulation routines use the same naming convention as the R simulators. For instance rnorm()
is used to simulate from a normal distribution. However, the argument convention is slightly different:
rnorm(n, mu, sd)
draws n
simulations from a normal distribution. Unlike R this works for scalar parameters only.rnorm(mu, sd)
is a TMB specific variant that works for mixed scalar and vector input. Output length follows the length of the longest input (no re-cycling) hence is consistent with dnorm(mu, sd)
.Currently the following simulators are implemented:
rnorm()
,rpois()
,runif()
,rbinom()
,rgamma()
,rexp()
,rbeta()
,rf()
,rlogis()
,rt()
,rweibull()
,rcompois()
,rtweedie()
,rnbinom()
,rnbinom2()
Objects from the density namespace have their own simulate()
method. Taking the multivariate normal distribution as example we have the following ways to draw a simulation:
MVNORM(Sigma).simulate()
returns a vector with a simulation from the multivariate normal distribution. The void argument version is only available when there is no ambiguity in the dimension of the output. In the MVNORM
case the dimension of the output is known from the dimension of Sigma
. In other cases e.g. AR1(phi)
the dimension of the output is not known hence the void argument version is not available.MVNORM(Sigma).simulate(x)
pass x
by reference and writes the simulation directly to x
without returning anything. This version is available for all the classes because the dimension of the simulation can always be deduced from x
.All TMB simulation methods are based on R's random number generator. It follows that the random seed can be controlled from R the usual way using set.seed
even though the simulation is performed on the C++ side.
Simulation functions can be called from anywhere in the C++ program. However, usually one should put the simulation code inside specialized simulation blocks that allows the code to only be executed when requested from R.
A complete example extending the example linreg.cpp with simulation code is:
The
SIMULATE
block marks the simulation and is not executed by default.
We compile the C++-file and the model object is constructed as usual:
Now a simulation can be generated with
This only includes the simulated response - not the rest of the data. A complete dataset can be generated by:
Here we did not explicitely state the parameter values to use with the simulation. The simulate
method takes an additional argument par
that can be used for this.
The default parameter values used for the simulation is
obj$env$last.par
.
Simulating datasets from known parameters and re-estimationg those parameters can be done generically by:
We reshape and plot the result:
Compare with the true parameter values of the simulation:
The examples sam.cpp and ar1_4D.cpp includes more advanced simulation code. The latter demonstrates how to simulate from the density objects:
In this example the 4D-array eta
is passed to the simulator by reference. Thereby the simulator knows the dimension of eta
and can fill eta
with a simulation.
The above example only used one simulation block. In general there is no limitation on the number of simulation blocks that can be used in a model and simulation blocks can use temporaries calculated outside the blocks (as demonstrated in the linear regression example). For clarity reasons, it is often a good idea to add a simulation block after each likelihood contribution. However, note that simulation blocks are in general not commutative (unlike likelihood accumulation). It is therefore further recommended to add likelihood contributions of random effects in the natural hierarchical order.
Adding simulation code to the model template