
Estimate mobility flows based on different trip distribution models
Source:R/run_model.R
run_model.RdThis function estimates mobility flows using different distribution laws and models. As described in Lenormand et al. (2016), the function uses a two-step approach to generate mobility flows by separating the trip distribution law (gravity or intervening opportunities) from the modeling approach used to generate the flows based on this law. This function only uses the second step to generate mobility flow based on a matrix of probabilities using different models.
Usage
run_model(
proba,
model = "UM",
nb_trips = 1000,
out_trips = NULL,
in_trips = out_trips,
average = FALSE,
nbrep = 3,
maxiter = 50,
mindiff = 0.01,
check_names = FALSE
)Arguments
- proba
A squared
matrixof probability. The sum of the matrix element must be equal to 1. It will be normalized automatically if it is not the case.- model
A
characterindicating which model to use.- nb_trips
A
numericvalue indicating the total number of trips. Must be anintegerifaverage = FALSE(see Details).- out_trips
A
numericvector representing the number of outgoing trips per location. Must be a vector of integers ifaverage = FALSE(see Details).- in_trips
A
numericvector representing the number of incoming trips per location. Must be a vector of integers ifaverage = FALSE(see Details).- average
A
booleanindicating if the average mobility flow matrix should be generated instead of thenbrepmatrices based on random draws (see Details).- nbrep
An
integerindicating the number of replications associated with the model run. Note thatnbrep = 1ifaverage = TRUE(see Details).- maxiter
An
integerindicating the maximal number of iterations for adjusting the Doubly Constrained Model (see Details).- mindiff
A
numericstrictly positive value indicating the stopping criterion for adjusting the Doubly Constrained Model (see Details).- check_names
A
booleanindicating whether the location IDs used as matrix rownames and colnames should be checked for consistency (see Note).
Details
We propose four constrained models to generate the flows from these
distribution of probability as described in Lenromand et al. (2016).
These models respect different level of constraints. These constraints can
preserve the total number of trips (argument nb_trips) OR the number of
out-going trips (argument out_trips) AND/OR the number of in-coming
(argument in_trips) according to the model. The sum of out-going trips
should be equal to the sum of in-coming trips.
Unconstrained model (
model = "UM"). Onlynb_tripswill be preserved (argumentsout_tripsandin_tripswill not be used).Production constrained model (
model = "PCM"). Onlyout_tripswill be preserved (argumentsnb_tripsandin_tripswill not be used).Attraction constrained model (
model = "ACM"). Onlyin_tripswill be preserved (argumentsnb_tripsandout_tripswill not be used).Doubly constrained model (
model = "DCM"). Bothout_tripsandin_tripswill be preserved (argumentsnb_tripswill not be used). The doubly constrained model is based on an Iterative Proportional Fitting process (Deming & Stephan, 1940). The argumentsmaxiter(50 by default) andmindiff(0.01 by default) can be used to tune the model.mindiffis the minimal tolerated relative error between the simulated and observed marginals.maxiterensures that the algorithm stops even if it has not converged toward themindiffwanted value.
By default, when average = FALSE, nbrep matrices are generated from
proba with multinomial random draws that will take different forms
according to the model used. In this case, the models will deal with positive
integers as inputs and outputs. Nevertheless, it is also possible to generate
an average matrix based on a multinomial distribution (based on an infinite
number of drawings). In this case, the models' inputs can be either positive
integer or real numbers and the output (nbrep = 1 in this case) will be a
matrix of positive real numbers.
Note
All inputs should be based on the same number of
locations, sorted in the same order. It is recommended to use the location ID
as matrix rownames and matrix colnames and to set
check_names = TRUE to verify that everything is consistent before running
this function (check_names = FALSE by default). Note that the function
check_format_names() can be used to validate all inputs
before running the main package's functions.
References
Deming WE & Stephan FF (1940) On a Least Squares Adjustment of a Sample Frequency Table When the Expected Marginal Totals Are Known. Annals of Mathematical Statistics 11, 427-444.
Lenormand M, Bassolas A, Ramasco JJ (2016) Systematic comparison of trip distribution laws and models. Journal of Transport Geography 51, 158-169.
See also
For more details illustrated with a practical example, see the vignette: https://rtdlm.github.io/TDLM/articles/TDLM.html#run-functions.
Associated functions:
run_law_model(), run_law(), gof().
Author
Maxime Lenormand (maxime.lenormand@inrae.fr)
Examples
data(mass)
data(od)
proba <- od / sum(od)
Oi <- as.numeric(mass[, 2])
Dj <- as.numeric(mass[, 3])
res <- run_model(
proba = proba,
model = "DCM", nb_trips = NULL, out_trips = Oi, in_trips = Dj,
average = FALSE, nbrep = 3, maxiter = 50, mindiff = 0.01,
check_names = FALSE
)
# print(res)