Definitions
Goal
Use trends in a sample to make inference about a population.
| Size | Proportion | Mean | Standard Deviation | |
|---|---|---|---|---|
| Population | N | \(\pi\) | \(\mu\) | \(\sigma\) |
| Sample | n | \(p\) or \(\hat{\pi}\) | \(\bar{x}\) | \(s\) |
Let \(X_1,...,X_n\) be a sample of data points from a population of \(N\) data points, then:
\[s = \sqrt{\frac{1}{n-1}\sum_{i=1}^n (X_i-\bar{X})^2}\] \[\sigma = \sqrt{\frac{1}{N}\sum_{i=1}^n (X_i-\mu)^2}\]
In a survey of 1,500 parents in the United States, 73% said they wanted to resume in person school for their children for 2021.
What is the population?
What is the sample?
What “notation” is used to represent 73%
Random Sampling: how you draw the observations from the population
Random Assignment: how you assign the sample into treatment/control groups
![]()
You want to evaluate the effectiveness of the Pfizer vaccine on COVID. You ask for 1,000 volunteers and randomly give half of the volunteers the vaccine and the other half a placebo dose. These people get tested for COVID every week for a year and we record who tests positive for COVID during this time and their symptoms.
What type of conclusions can we draw?
![]()
Northwestern University is deciding which undergraduate students to test for COVID this week and due to limited resources they cannot test the entire undergraduate student body at once.
The university divides the population into 4 groups {1st year, sophomore, junior, seniors} and randomly selects 200 students from each group to test.
What type of sampling is this?
The university alphabetizes the population by last name and selects every \(25^{th}\) student to get tested.
What type of sampling is this?
The university alphabetizes the population by last name and selects the first 200 students to get tested.
What type of sampling is this?
