R/mcmc-functions.R
mcmc_potential_scale_reduction.Rd
Given N > 1
states from each of C > 1
independent chains, the potential
scale reduction factor, commonly referred to as R-hat, measures convergence of
the chains (to the same target) by testing for equality of means.
mcmc_potential_scale_reduction( chains_states, independent_chain_ndims = 1, name = NULL )
chains_states |
|
---|---|
independent_chain_ndims | Integer type |
name | name to prepend to created tf. Default: |
Tensor
or list
of Tensor
s representing the R-hat statistic for
the state(s). Same dtype
as state
, and shape equal to
state$shape[1 + independent_chain_ndims:]
.
Specifically, R-hat measures the degree to which variance (of the means) between chains exceeds what one would expect if the chains were identically distributed. See Gelman and Rubin (1992), Brooks and Gelman (1998)].
Some guidelines:
The initial state of the chains should be drawn from a distribution overdispersed with respect to the target.
If all chains converge to the target, then as N --> infinity
, R-hat --> 1.
Before that, R-hat > 1 (except in pathological cases, e.g. if the chain paths were identical).
The above holds for any number of chains C > 1
. Increasing C
improves effectiveness of the diagnostic.
Sometimes, R-hat < 1.2 is used to indicate approximate convergence, but of course this is problem dependent. See Brooks and Gelman (1998).
R-hat only measures non-convergence of the mean. If higher moments, or other statistics are desired, a different diagnostic should be used. See Brooks and Gelman (1998).
To see why R-hat is reasonable, let X
be a random variable drawn uniformly
from the combined states (combined over all chains). Then, in the limit
N, C --> infinity
, with E
, Var
denoting expectation and variance,
R-hat = ( E[Var[X | chain]] + Var[E[X | chain]] ) / E[Var[X | chain]].
Using the law of total variance, the numerator is the variance of the combined
states, and the denominator is the total variance minus the variance of the
the individual chain means. If the chains are all drawing from the same
distribution, they will have the same mean, and thus the ratio should be one.
Stephen P. Brooks and Andrew Gelman. General Methods for Monitoring Convergence of Iterative Simulations. Journal of Computational and Graphical Statistics, 7(4), 1998.
Andrew Gelman and Donald B. Rubin. Inference from Iterative Simulation Using Multiple Sequences. Statistical Science, 7(4):457-472, 1992.
Other mcmc_functions:
mcmc_effective_sample_size()
,
mcmc_sample_annealed_importance_chain()
,
mcmc_sample_chain()
,
mcmc_sample_halton_sequence()