L2 & L3 Social Science Statistics
By the end of lesson 2, you should be able to:
- State the five steps of the Statistical Process
- Distinguish between an observational study and an experiment.
- Distinguish between a population and a sample.
- Distinguish between a categorical and a quantitative variable.
- Distinguish and give an example of each of the following sampling schemes:
- Simple random sampling
- Systematic sampling
- Cluster sampling
- Stratified sampling
- Convenience sampling
- Explain the significance of using a random sample
>simple random sample (SRS) is the best method for obtaining a sample from a population.
>stratified sample is when the items to be sampled are organized in groups of homogeneous (similar) items called strata
By the end of lesson 3, you should be able to:
- Determine the shape of a distribution using a histogram.
- Interpret the mean, median, and mode for quantitative data.
- Apply the Excel functions AVERAGE, MEDIAN, and MODE when working with quantitative data in Excel.
- Determine the location of the mean relative to the median of left-skewed, right-skewed, or bell-shaped distributions visually using a histogram.
- Interpret a histogram.
>the purpose of the histogram is to identify the shape of the data
>a histogram w/ more than 2 peaks is called multimodal
- Distinguish between a parameter and a statistic
*Graphing helps you to see:
>oultliers
>shape, center, and spread of distribution
REMEMBER:
*Left skewed resembles your left foot with your big toe being the peak of the graph
* Right skewed resembles your right foot with your big toe being the peak of the graph.
>If your histogram has 30-40 and then 40-50, you can represent which bar of the graph contains the number 40 by using a square bracket for the bar that includes the number 40 and then using a rounded bracket to indicate that that bar contains everything after 40.
*parameter: numbers that describe a population. They are typically the unknown quantities that we are trying to estimate
*Statistics: numbers that are computed from the sample data
>If no numbers are repeated then there will be no mode.
>There can be more than 1 mode
*If distribution is symmetric use mean
*If distribution is skewed use median because the data will be less affected by outliers.
>In a right skewed distribution the mean will always be greater than the median.
>In a left skewed distribution the mean will always be less than the median.
Comments
Post a Comment