terrycojones/gist:2b18f23247903fd5213b

## gistfile1.md

      
    Raw
  

              gistfile1.md
            
          
    Bayes is cool!

From the introduction to
An Intuitive Explanation of Bayes' Theorem:

Your friends and colleagues are talking about something called "Bayes'
Theorem" or "Bayes' Rule", or something called Bayesian reasoning.  They
sound really enthusiastic about it, too, so you google and find a webpage
about Bayes' Theorem and...
It's this equation.  That's all.  Just one equation.  The page you found
gives a definition of it, but it doesn't say what it is, or why it's
useful, or why your friends would be interested in it.  It looks like
this random statistics thing.
So you came here.  Maybe you don't understand what the equation says.
Maybe you understand it in theory, but every time you try to apply it in
practice you get mixed up trying to remember the difference between
p(a|x) and p(x|a), and whether p(a)*p(x|a) belongs in the numerator or
the denominator.  Maybe you see the theorem, and you understand the
theorem, and you can use the theorem, but you can't understand why your
friends and/or research colleagues seem to think it's the secret of the
universe.  Maybe your friends are all wearing Bayes' Theorem T-shirts,
and you're feeling left out.  Maybe you're a girl looking for a
boyfriend, but the boy you're interested in refuses to date anyone who
"isn't Bayesian".  What matters is that Bayes is cool, and if you don't
know Bayes, you aren't cool.
Why does a mathematical concept generate this strange enthusiasm in its
students?  What is the so-called Bayesian Revolution now sweeping through
the sciences, which claims to subsume even the experimental method itself
as a special case?  What is the secret that the adherents of Bayes know?
What is the light that they have seen?
Soon you will know.  Soon you will be one of us.

It's hard to grasp, but powerful

From
Probability Theory: The Logic of Science
page xi:

... it became clear gradually that the outstanding dificulties of
conventional "statistical inference" are easily understood and overcome.
But the rules which now took their place were quite subtle conceptually,
and it required some deep thinking to see how to apply them correctly.
Past dificulties which had led to rejection of Laplace's work, were seen
finally as only misapplications, arising usually from failure to define
the problem unambiguously or to appreciate the cogency of seemingly
trivial side information, and easy to correct once this is recognized.

and

Often, the things which are most familiar to us turn out to be the
hardest to understand. Phenomena whose very existence is unknown to the
vast majority of the human race (such as the difference in ultraviolet
spectra of Iron and Nickel) can be explained in exhaustive mathematical
detail, but all of modern science is practically helpless when faced
with the complications of such a commonplace fact as growth of a blade of
grass. Accordingly, we must not expect too much of our models; we must
be prepared to find that some of the most familiar features of mental
activity may be ones for which we have the greatest diffculty in
constructing any adequate model.

and

The writer has learned from much experience that this primary emphasis on
the logic of the problem, rather than the mathematics, is necessary in
the early stages. For modern students, the mathematics is the easy part;
once a problem has been reduced to a definite mathematical exercise, most
students can solve it effortlessly and extend it endlessly, without
further help from any book or teacher. It is in the conceptual matters
(how to make the initial connection between the real-world problem and
the abstract mathematics) that they are perplexed and unsure how to
proceed.

and

A scientist who has learned how to use probability theory directly as
extended logic, has a great advantage in power and versatility over one
who has learned only a collection of unrelated ad hoc devices. As the
complexity of our problems increases, so does this relative advantage.
Therefore we think that in the future, workers in all the quantitative
sciences will be obliged, as a matter of practical necessity, to use
probability theory in the manner expounded here. This trend is already
well under way in several fields, ranging from econometrics to astronomy
to magnetic resonance spectroscopy; but to make progress in a new area it
is necessary to develop a healthy disrespect for tradition and authority,
which have retarded progress throughout the 20th century.

It's non-intuitive

Why does Bayes' theorem seem hard or non-intuitive? See the discussion of
natural frequencies in
An Intuitive Explanation of Bayes' Theorem.
From An Intuitive Explanation of Bayes' Theorem:

While there are a few existing online explanations of Bayes' Theorem, my
experience with trying to introduce people to Bayesian reasoning is that
the existing online explanations are too abstract.  Bayesian reasoning is
very counterintuitive.  People do not employ Bayesian reasoning
intuitively, find it very difficult to learn Bayesian reasoning when
tutored, and rapidly forget Bayesian methods once the tutoring is over.
This holds equally true for novice students and highly trained
professionals in a field. Bayesian reasoning is apparently one of those
things which, like quantum mechanics or the Wason Selection Test, is
inherently difficult for humans to grasp with our built-in mental
faculties.

and

Studies of clinical reasoning show that most doctors carry out the mental
operation of replacing the original 1% probability with the 80%
probability that a woman with cancer would get a positive mammography.
Similarly, on the pearl-egg problem, most respondents unfamiliar with
Bayesian reasoning would probably respond that the probability a blue egg
contains a pearl is 30%, or perhaps 20% (the 30% chance of a true
positive minus the 10% chance of a false positive).  Even if this mental
operation seems like a good idea at the time, it makes no sense in terms
of the question asked.  It's like the experiment in which you ask a
second-grader: "If eighteen people get on a bus, and then seven more
people get on the bus, how old is the bus driver?"  Many second-graders
will respond: "Twenty-five."  They understand when they're being prompted
to carry out a particular mental procedure, but they haven't quite
connected the procedure to reality.  Similarly, to find the probability
that a woman with a positive mammography has breast cancer, it makes no
sense whatsoever to replace the original probability that the woman has
cancer with the probability that a woman with breast cancer gets a
positive mammography.  Neither can you subtract the probability of a
false positive from the probability of the true positive.  These
operations are as wildly irrelevant as adding the number of people on the
bus to find the age of the bus driver.

Reading on Bayes


Wikipedia on Bayes' Theorem,
Stanford Encyclopedia of Philosophy on Bayes' Theorem.
This article on Visualizing Bayes' theorem
is simple and clear, with pictures.
An Intuitive Explanation of Bayes' Theorem,
includes some simple Java applets for interactively playing with Bayes' theorem parameters.

Probability interpretations

Frequentist vs Subjectivist argument. From the preface of
Probability Theory: The Logic of Science
(page xi):

For many years there has been controversy over "frequentist" versus
"Bayesian" methods of inference, in which the writer has been an
outspoken partisan on the Bayesian side.  The record of this up to 1981
is given in an earlier book (Jaynes, 1983). In these old works there was
a strong tendency, on both sides, to argue on the level of philosophy or
ideology. We can now hold ourselves somewhat aloof from this because,
thanks to recent work, there is no longer any need to appeal to such
arguments. We are now in possession of proven theorems and masses of
worked-out numerical examples. As a result, the superiority of Bayesian
methods is now a thoroughly demonstrated fact in a hundred different
areas. One can argue with a philosophy; it is not so easy to argue with a
computer printout, which says to us: "Independently of all your
philosophy, here are the facts of actual performance." We point this out
in some detail whenever there is a substantial difference in the final
results. Thus we continue to argue vigorously for the Bayesian methods;
but we ask the reader to note that our arguments now proceed by citing
facts rather than proclaiming a philosophical or ideological position.

See Wikipedia on
Probability Interpretations
and the Stanford Encyclopedia of Philosophy on
Interpretations of Probability
Bayesian inference

Does Bayesian inference include the Scientific Method as a special case?
From the Wikipedia page on Bayesian Inference:

The use of Bayes' theorem by jurors is controversial. In the United
Kingdom, a defence expert witness explained Bayes' theorem to the jury in
R v Adams. The jury convicted, but the case went to appeal on the basis
that no means of accumulating evidence had been provided for jurors who
did not wish to use Bayes' theorem. The Court of Appeal upheld the
conviction, but it also gave the opinion that "To introduce Bayes'
Theorem, or any similar method, into a criminal trial plunges the jury
into inappropriate and unnecessary realms of theory and complexity,
deflecting them from their proper task."

Further reading


The preface to Probability Theory: The Logic of Science
is excellent (via Gene).
This Udacity Intro to statistics
course Bayes' rule lesson
has some good diagrams (via Eu).

Misc

From Probability Theory: The Logic of Science p xi:

There are two more, even stronger reasons for placing our primary
emphasis on logic and clarity. Firstly, no argument is stronger than the
premises that go into it, and as Harold Jeffreys noted, those who lay the
greatest stress on mathematical rigor are just the ones who, lacking a
sure sense of the real world, tie their arguments to unrealistic premises
and thus destroy their relevance.  Jeffreys likened this to trying to
strengthen a building by anchoring steel beams into plaster. An argument
which makes it clear intuitively why a result is correct, is actually
more trustworthy and more likely of a permanent place in science, than is
one that makes a great overt show of mathematical rigor unaccompanied by
understanding.


Secondly, we have to recognize that there are no really trustworthy
standards of rigor in a mathematics that has embraced the theory of
infinite sets. Morris Kline (1980, p. 351) came close to the Jeffreys
simile: "Should one design a bridge using theory involving infinite sets
or the axiom of choice? Might not the bridge collapse?" The only real
rigor we have today is in the operations of elementary arithmetic on
finite sets of finite integers, and our own bridge will be safest from
collapse if we keep this in mind.