Choose hypotheses

Now that the reports are done, you need to pick hypotheses for your second report.

These hypotheses must follow the sentence types on the Hypotheses page on the course website.

Example:

mean(datasubset$Variable) is [not equal to] / [larger than] / [smaller than] M for some specific value M.

A specific hypothesis of this type could be

mean(iris$Sepal.Length) is larger than 6.

If you are estimating values from your data, you must use the same subset that you used for the first report.

Probability

Frequentist	Bayesian
Origin	Analyzing gambling	Analyzing evidence
Interpretation	Expected long term proportions after many repetitions	Quantified degree of belief

Both approaches follow the same basic algebraic laws.

Definitions

A phenomenon is random if individual outcomes are uncertain, but there is a regular distribution of outcomes in a large number of repetitions.

The probability of any outcome of a random phenomenon is the proportion of times that outcome would occur in a very long series of repetitions.

Note that the book uses a frequentist perspective.

Coin toss

Toss a fair coin. We expect heads & tails to come up approximately equally often.

Dice rolls

Roll a fair 6-sided die. We expect each number to come up approximately equally often.

Discrete uniform distribution

Both the coin toss and the dice roll is an example of a discrete random distribution:

To each of a finite (countable) number of outcomes is assigned a probability, with the sum of probabilities being equal to 1.

Coin toss	Probability	Dice roll	Probability
Heads	0.5	1	0.166
Tails	0.5	2	0.166
		3	0.166
		4	0.166
		5	0.166
		6	0.166

Experiments

Count Buffon (1707-1788) tossed a coin 4040 times. 2048 or 50.69% heads.
Karl Pearson (1857-1936) tossed a coin 24000 times. 12012 or 50.05% heads.
John Kerrich (1903-1985) tossed a coin 10000 (while in a German WW2 prison camp). 5067 or 50.67% heads.

Probability Models

To completely specify a model for a random phenomenon, we need:

A list of possible outcomes, and
A probability for each outcome.

Probability Models

To completely specify a model for a random phenomenon, we need:

A list of possible outcomes, and
A probability for each outcome.

The sample space of a random phenomenon is the set of all possible outcomes.

An event is a set of possible outcomes.

Probability is a function that takes an event $A$ an produces a number $0\leq\mathbb{P}(A)\leq1$.

What are some sample spaces?

Discuss in pairs:

What is the sample space for…

Tossing a coin
Rolling a six-sided die
Tossing a coin twice
Measuring the weight of a person

What are some sample spaces?

Discuss in pairs:

What is the sample space for…

Tossing a coin: {H, T}
Rolling a six-sided die: {1, 2, 3, 4, 5, 6}
Tossing a coin twice: {HH, HT, TH, TT}
Measuring the weight of a person: $\mathbb{R}_{>0}$

Rules (axioms) of probability

A probability is a number between 0 and 1
The event containing all possible outcomes has probability 1
The event containing no possible outcomes has probability 0
If two events $A, B$ are disjoint (have no outcomes in common) then $\mathbb{P}(A\text{ or }B)=\mathbb{P}(A)+\mathbb{P}(B)$ (the addition rule)
The probability of an event not occurring is 1 minus the probability of the event

Example: first digits

Benford’s Law: in many “naturally occurring” collections of numbers (tax returns, payment record, expense account claims, …) the first digit follows a distinct probability distribution:

Independent events and the multiplication rule

Two events are independent if knowing that one occurs does not change the probability of the other one occurring.

If $A$ and $B$ are independent events, then \[ \mathbb{P}(A\text{ and }B) = \mathbb{P}(A)\cdot\mathbb{P}(B) \]

Independent events and the multiplication rule

Two events are independent if knowing that one occurs does not change the probability of the other one occurring.

If $A$ and $B$ are independent events, then \[ \mathbb{P}(A\text{ and }B) = \mathbb{P}(A)\cdot\mathbb{P}(B) \]

Coins don’t have memory: so subsequent coin tosses can be considered independent.

Independent events and the multiplication rule

Two events are independent if knowing that one occurs does not change the probability of the other one occurring.

If $A$ and $B$ are independent events, then \[ \mathbb{P}(A\text{ and }B) = \mathbb{P}(A)\cdot\mathbb{P}(B) \]

Coins don’t have memory: so subsequent coin tosses can be considered independent.

Once cards get removed from a deck, the proportions of card drawn changes: Probability of first card being red is $26/52$. Probabiltiy of second card being red is

$26/51$ if the first card was black
$25/51$ if the first card was red

Random variable

Sample spaces are collections of possible outcomes. A numeric value assigned to each outcome produces a random variable.

Example A craps roll has as its sample space the 36 possible pairs of dice outcomes.

The payout for a particular craps bet is a random variable.

Random variable

A discrete random variable has a finite (countable) set of possible values.

A discrete random variable can be specified by giving a probability to each possible value.

A continuous random variable has numeric possible values.

A continuous random variable has probability 0 for any single specific value. Instead, for continuous variables, probabilities are assigned to ranges.

The probability of a range is the area under the density curve for that range.

Common Distributions

We have already seen the normal distribution. This is determined by a mean $\mu$ and a standard deviation $\sigma$.

The uniform distribution has a constant density curve. This is determined by the range of the constant density.

The binomial distribution counts the number of successes in $n$ repeated trials of constant success probability $p$.

The Poisson distribution counts the number of events in a constant rate process with an average count of $\lambda$ events per time unit.

Expected value of a random variable

Suppose you gamble with a consistent bet: every time you play, you have a probability of 1% to win $ 100 and a probability of 99% of losing $ 5.

After playing 1000 times, you expect to have won 10 times, and lost 990 times.

This produces an overall gain of $ 100 $\times$ 10 = $ 1000 and a loss of $ 5 $\times$ 990 = $ 4950 for a total loss of $ 3950.

We define the expected value or mean of a discrete random variable to be \[ \mu_X = \mathbb{E}X = \sum_x x\cdot\mathbb{P}(x) \]

For a continuous random variable, the sum becomes an integral and the expected value is \[ \mathbb{E}X = \int x\cdot p(x)dx \]

Example: Benford’s Law and Uniform digits

1	2	3	4	5	6	7	8	9
Uniform	1/9	1/9	1/9	1/9	1/9	1/9	1/9	1/9	1/9
Benford	.301	.176	.125	.097	.079	.067	.058	.051	.046

Example: Benford’s Law and Uniform digits

1	2	3	4	5	6	7	8	9
Uniform	1/9	1/9	1/9	1/9	1/9	1/9	1/9	1/9	1/9
Benford	.301	.176	.125	.097	.079	.067	.058	.051	.046

Uniform mean is \[ 1/9 + 2/9 + 3/9 + 4/9 + 5/9 + 6/9 + 7/9 + 8/9 + 9/9 = 45/9 = 5 \]

Example: Benford’s Law and Uniform digits

1	2	3	4	5	6	7	8	9
Uniform	1/9	1/9	1/9	1/9	1/9	1/9	1/9	1/9	1/9
Benford	.301	.176	.125	.097	.079	.067	.058	.051	.046

Uniform mean is \[ 1/9 + 2/9 + 3/9 + 4/9 + 5/9 + 6/9 + 7/9 + 8/9 + 9/9 = 45/9 = 5 \]

Mean first digit in Benford’s law is \[\begin{multline*} 1\cdot.301+2\cdot.176+3\cdot.125+4\cdot.097+5\cdot.079+\\ +6\cdot.067+7\cdot.058+8\cdot.051+9\cdot.046 \approx 3.441 \end{multline*}\]

The Law of Large Numbers

or The Central Limit Theorem:

As the sample size grows larger, the sample mean $\overline x$ gets closer to the distribution mean $\mu$.

This holds for any distribution (as long as the mean and standard deviation are finite) and we can calculate how many samples we need to reach the precision we want.

The Law of Large Numbers

Rules for means

$\mu_{a+bX} = a+b\mu_X$
$\mu_{X+Y} = \mu_X + \mu_Y$
$\mu_{X-Y} = \mu_X - \mu_Y$

Variance of a random variable

Just like the mean can be defined and used for random variables, the standard deviation and the variance can too.

The variance $\sigma^2_X$ of a random variable $X$ is the mean square deviation from the mean.

\[ \sigma^2_X = \mathbb{E}[(X-\mu_X)^2] \]

The standard deviation $\sigma_X$ is the square root of the variance.

Example: dice roll

X	$\mathbb{P}$	$X\cdot\mathbb{P}$	$(X-\mu_X)^2\mathbb{P}$
1	1/6	0.166	1.041
2	1/6	0.333	0.375
3	1/6	0.500	0.041
4	1/6	0.666	0.041
5	1/6	0.833	0.375
6	1/6	1.000	1.041

$\mu_X = 3.5$

$\sigma^2_X = 0.486$

$\sigma_X = 0.697$

Rules for variances and standard deviations

$\sigma^2_{a+bX} = b^2\sigma^2_X$

The correlation between random variables controls how variances combine.

The correlation $\rho$ between independent variables is 0.
If $X$ and $Y$ are independent, then $\sigma^2_{X+Y} = \sigma^2_X+\sigma^2_Y$ (the addition rule for variances of independent random variables)
If $X$ and $Y$ are any random variables, then \[ \sigma^2_{X+Y} = \sigma^2_X+\sigma^2_Y + \rho\sigma_X\sigma_Y \qquad \sigma^2_{X-Y} = \sigma^2_X+\sigma^2_Y - \rho\sigma_X\sigma_Y \] (the general addition rule for variances of random variables)

Lecture 13

Choose hypotheses

Probability

Definitions

Coin toss

Dice rolls

Discrete uniform distribution

Experiments

Probability Models

Probability Models

What are some sample spaces?

What are some sample spaces?

Rules (axioms) of probability

Example: first digits

Independent events and the multiplication rule

Independent events and the multiplication rule

Independent events and the multiplication rule

Random variable

Random variable

Random variable

Common Distributions

Expected value of a random variable

Example: Benford’s Law and Uniform digits

Example: Benford’s Law and Uniform digits

Example: Benford’s Law and Uniform digits

The Law of Large Numbers

The Law of Large Numbers

Rules for means

Variance of a random variable

Example: dice roll

Rules for variances and standard deviations