|
A prior probability is a marginal
probability, interpreted as a description of what is known about a variable in the absence of some evidence. The posterior
probability is then the conditional
probability of the variable taking the evidence into account. The posterior probability is computed from the prior and the
likelihood function via Bayes' theorem.
As prior and posterior are not terms used in frequentist analyses, this article uses the vocabulary of Bayesian probability and Bayesian
inference.
Throughout this article, for the sake of brevity the term variable encompasses observable variables, latent
(unobserved) variables, parameters, and hypotheses.
Informative priors
An informative prior expresses specific, definite information about a variable. An example is a prior distribution
for the temperature at noon tomorrow. A reasonable approach is to make the prior a normal distribution with expected value equal
to today's noontime temperature, with variance equal to the day-to-day variance of
atmospheric temperature.
This example has a property in common with many priors, namely, that the posterior from one problem (today's temperature)
becomes the prior for another problem (tomorrow's temperature); pre-existing evidence which has already been taken into account
is part of the prior and as more evidence accumulates the prior is largely by the evidence rather than any original assumption,
provided that the original assumption admitted the possibility of what the evidence is suggesting. The terms "prior" and
"posterior" are generally relative to a specific datum or observation.
Uninformative priors
An uninformative prior expresses vague or general information about a variable. The term "uninformative prior" is a
misnomer; such a prior might be called a not very informative prior. Uninformative priors can express information such
as "the variable is positive" or "the variable is less than some limit".
The use of an uninformative prior typically yields results which are not too different from conventional statistical analysis,
as the likelihood function often yields more information than the uninformative prior.
Improper priors
If Bayes' therorem is written as
-
then it is clear that it would remain true if all the prior probabilities P(Ai) and
P(Aj) were multiplied by a given constant; the same would be true for a continuous random variable. The posterior probabilites
will still sum (or integrate) to 1 even if the prior values do not, and so the priors only need be specified in the correct
proportion.
Taking this idea further, in many cases the sum or integral of the prior values may not even need to be finite to get sensible
answers for the posterior probabilities. When this is the case, the prior is called an improper prior. Some
statisticians use improper priors as uninformative priors. For example, if they need a prior distribution for the mean and
variance of a random variable, they may assume p(m,v)~1/v (for v>0)
which would suggest that any value for the mean is equally likely and that a value for the positive variance becomes less likely
in inverse proportion to its value. Since
-
this would be an improper prior both for the mean and for the variance.
References
- Andrew Gelman, John B. Carlin, Hal S. Stern, and Donald B. Rubin. Bayesian Data Analysis, 2nd edition. CRC Press,
2003.
|