Skip to content

Instantly share code, notes, and snippets.

@dakridas
Created April 28, 2017 18:42
Show Gist options
  • Star 2 You must be signed in to star a gist
  • Fork 1 You must be signed in to fork a gist
  • Save dakridas/9c377ac38f60c22c71be9885a3bd1203 to your computer and use it in GitHub Desktop.
Save dakridas/9c377ac38f60c22c71be9885a3bd1203 to your computer and use it in GitHub Desktop.
Probability vs Statistical model

Contents

Probability model

A probability model is a mathematical representation of a random phenomenon. It is defined by its sample space,
events within the sample space, and probabilities associated with each event.

  • The sample space S for a probability model is the set of all possible outcomes.
  • An event A is a subset of the sample space S.
  • A probability is a numerical value assigned to a given event A. The probability of an event is written P(A), and describes the long-run relative frequency of the event.

The first two basic rules of probability are the following:

  • Any probability P(A) is a number between 0 and 1 (0 < P(A) < 1).
  • The probability of the sample space S is equal to 1 (P(S) = 1).

Statistical model

A statistical model is a special class of mathematical model. What distinguishes a statistical model from other
mathematical models is that a statistical model is non-deterministic. Thus, in a statistical model specified via
mathematical equations, some of the variables do not have specific values, but instead have probability distributions;
i.e. some of the variables are stochastic. In the example above, ε is a stochastic variable; without that variable, the
model would be deterministic.

Statistical models are often used even when the physical process being modeled is deterministic. For instance, coin
tossing is, in principle, a deterministic process; yet it is commonly modeled as stochastic (via a Bernoulli process).

There are three purposes for a statistical model, according to Konishi & Kitagawa.[4]

  • Predictions
  • Extraction of information
  • Description of stochastic structures

Differences between a statistical model and a probability model

The main difference is that a probability model is only one (known) distribution, while a statistical model is a set of
probability models; the data is used to select a model from this set or a smaller subset of models that better (in a certain sense) describe the phenomenon (in the light of the data).

Data Science

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment