Review for Test 2

Test 2 will be November 20th, and cover the following sections from the book: 4.1, 4.2, 4.3, 4.4; 5.2, 5.3; 6.1, 6.2; and 7.1.

Here are some sample questions. There are many more potential questions that could appear on the test.

Question

An experiment consists of flipping a coin and tossing one die. Describe the sample space; what outcomes are in the event ``a heads and even roll’’. What is the probability of this event? What assumptions did you make to compute the probability?

Question

In computing the normal, we use \(P(Z > z) = 1 - P(Z \leq z)\). What ``rule’’ allows us to do this? Compute \(P(Z > 2.3.4)\).

Question

If we toss two coins (a penny and a nickel); is the result on the penny disjoint from the result on the nickel? If not, what is their relationship (using probability).

Question

Suppose we have a probability distribution:

k     1    2    3   4
-------------------------
P(k) .43  .25  .17  x

What is x? What is the mean of the probability distribution?

Question

Let \(X\) be a random variable with distribution given by:

k       0    1    2    3    4
-----------------------------
P(X=k) .10  .10  .20   x   .30

Find x. Compute \(\mu\). Compute \(\sigma^2\),

Question

Let \(X\) be a continuous random variable with density given by \(f(x) = 1/2\) for \(0 \leq x \leq 2\) and \(0\) otherwise. What is \(P(1/2 < X < 7/8)\)? Using symmetry, what is \(\mu\)?

Question

Let \(X\) have a distribution:

k         -1   0   1
---------------------
P(X=k)    1/4 1/2 1/4

Compute \(\mu\) and \(\sigma\) for this random variable.

Suppose, we have \(n\) independent random variables, \(X_1\), \(X_2\), … with this same distribution. Let \(S = X_1 + X_2 + \dots + X_n\). What is \(\mu_S\), \(\sigma_S\)?

Question

The law of large numbers states if \(\mu\) is the population mean and \(X_1, X_2, \dots X_n\) is an i.i.d sample from this population, the \(\bar{x}\) approaches the mean of \(\mu\) as closely as you have specified and then stays that close.

Suppose you toss a fair coin 100 times and the you have 30 heads and 70 tails. Does the law of large numbers say in the next 100 coin tosses you will have 70 heads and 30 tails?

Question

At the beginning of the term, we used the terms ‘shape’, ‘center’, and ‘spread’ to describe a distribution. For large \(n\), describe the distribution of \(\bar{x}_n\) using these terms. Be as specific as you can be. Did you make any assumptions? How do you know you are right?

Question

In Chapter 3, a principle of experimental design is ``Repeat’’ (each treatment on many units to reduce chance variation in the results). Give an example where you have a formula that demonstrates this.

Question

A population has mean \(\mu{}\), standard deviation \(\sigma{}\) and median \(M\).

Let \(X_1\), \(X_2\), …, \(X_n\) be an i.i.d. sample of size \(n\) from this population. Assume \(n\) is ``large enough’’ for the central limit to apply.

Which is true: \(P(X_1 > \mu) = 1/2\) or \(P(X_1 > M) = 1/2\)?

Which is true \(P(\bar{x}_n > \mu = 1/2)\) or \(P(\bar{x}_n > M) = 1/2\)?

Which is true \(\mu_{\bar{x}} = \mu{}\) or \(\mu_{\bar{x}} = \mu/\sqrt{n}\)?

Which is true \(\sigma_{\bar{x}} = \sigma{}\) or \(\sigma_{\bar{x}} = \sigma/\sqrt{n}\)? (or both?)

Question

A population has mean \(\mu=0\), standard deviation \(\sigma=6.74\) and \(X_1\), \(X_2\), , \(X_n\) is an i.i.d sample with \(n\) large enough for the central limit theorem to apply.

How large must \(n\) be so that \(P(-1 < \bar{x} < 1) = 1/2\)? (What must the \(z\)-score of \(1\) be?)

Question

Assume the mean number of skittles in a standard size bag is 15 and the standard deviation is 2. Assume the population of skittles in a bag is ``bell shaped’’ (whatever that means for a discrete distribution).

What is the probability that there are \(21\) or more skittles in a given bag?

Now consider 10 bags. What is the probability the average number of skittles in the bags is 18 or more?

Now consider 100 bags. What is the probability the average number of skittles in the bags is 16 or more?

Question

The binomial distribution is used to count the number of ``successes’’ in a fixed number (\(n\)) of trials. What scenarios below will it apply to:

  • Toss a coin times, let \(X\) be the number of heads
  • Roll a die 100 times, let \(X\) be the number of twos.
  • Survey 100 Americans at random (with replacement), let \(X\) be the number who have been to Staten Island
  • Ask your friends if they can drive you home. Let \(X\) be the number of friends you ask until you find a ride.

Question

Let \(p\) be a population proportion and \(\hat{p}\) be a sample proportion from a SRS of size \(n\) drawn from the large population.

Let \(n=123\) and \(p = 1/23\). What is \(\mu_{\hat{p}}\)? What is \(\sigma_{\hat{p}}\)?

Is \(np > 10\)?

Use the normal approximation to compute \(P(\hat{p} > 0.10)\).

Question

Toss a fair coin 250 times. What is the probability you have 150 or more heads? (Use the normal approximation to give an answer).

Question

A survey of 1000 people answering yes and no is taken. If the population proportion of ``yes’’ answerers is assumed to be \(0.60\), what is the probability that 62% or more in the survey will answer yes? (Use the normal approximation to give an answer).

Question

Mary finds a 90% confidence interval for some \(\mu\) by taking a sample and John finds his own 90% confidence interval for the same \(\mu\) by taking a survey. It turns out they did not overlap on any values. Can they both have computed their values properly? Explain.

Question

How much sleep do CSI students get, on average? To investigate, a student researcher interviewed 12 students at random and found this summary data:

xbar  s   n
-----------
 6.7 1.2  12
  • If the student assumes the population standard deviation is \(\sigma=1\), find the \(90\%\) confidence interval based on this data.

  • If the student does not assume the population standard deviation is \(\sigma=1\), but does assume the data is from a normal population, find the \(90\%\) confidence interval based on this data.

Question

A researcher knows that a population of interest is normally distributed with unknown mean \(\mu\), but known standard deviation, \(\sigma=10\). To estimate \(\mu\), she will take a random sample. How large a random sample is needed so that a 90% confidence interval has a margin of error of \(1\). Repeat with a 99% confidence interval.

Question

A CSI student research wants to know if students get less sleep around final exam period. From earlier work, she is confident that the population mean for amount of sleep in normal class periods is 7 hours with a population standard deviation of 1.25. During finals, she also assumes a normal distribution of sleep times, and she takes a survey of 25 students, randomly selected, and computes this sample data:

xbar   s   n
-------------
6.8   1.1  25
  • Write out a null and alternative hypothesis corresponding to the question of whether students get less sleep.

  • Assuming \(\sigma=1.25\) applies to finals week sleep times, what test statistic would you use for a significance test? What would be its sampling distribution under the null hypothesis?

  • Assuming \(\sigma=1.25\) does not apply to finals week sleep times, what test statistic would you use for a significance test? What would be its sampling distribution under the null hypothesis?

  • Assuming \(\sigma=1.25\) applies to finals week sleep times, is the computed \(p\)-value less than \(\alpha=0.05\)?

  • Assuming \(\sigma=1.25\) does not apply to finals week sleep times, is the computed \(p\)-value less than \(\alpha=0.05\)?

  • Were you a student researcher, would you think assuming \(\sigma\) is the same for finals period and the regular class period to be a reasonable or unreasonable assumption?

Question

A student researcher wants to know if men and women play computer games the same amount of time per week on average. She has limited manpower and knows data may be quite variable, so she constructs a matched sample experiment, where she takes college aged men and women who are dating and finds their respective times. The data she finds is given here:

Partnership  1    2   3   4   5   6 |  xbar  s     n
----------------------------------------------------
Male         3    13  12  0   4  25 |   9.5  9.18  6
Female       0     9   9  5   5   4 |   5.3  3.38  6
----------------------------------------------------
MF           3     4   3 -5  -1  21 |   4.2  8.91  6
  • The sample average for males is more than that of females, but is the difference statistically significant. Construct a one-sided test that the difference of means is greater than \(0\) (\(\mu_{male}\) \(>\) \(\mu_{female}\)). That is what is \(H_0\) and \(H_a\)?

  • What test statistic will you use, what is its sampling distribution (assuming normal populations, which isn’t really such a good idea).

  • What \(p\)-value was found?