![]() |
VOOZH | about |
Sampling distribution is essential in various aspects of real life, essential in inferential statistics. A sampling distribution represents the probability distribution of a statistic (such as the mean or standard deviation) that is calculated from multiple samples of a population. It helps us to understand how a statistic varies across different samples and is crucial for making inferences about the population.
Sampling distribution is the probability distribution of a statistic based on random samples of a given population. It is also know as finite distribution.
In this article, we will discuss the Sampling Distribution in detail and its types, along with examples, and go through some practice questions, too.
Some important terminologies related to sampling distribution are given below:
The variability of a sampling distribution is measured by standard error or population variance, depending on the context and the type of inference required. Both measure how spread out the data is around the mean.
Main factors influencing the variability of a sampling distribution are:
3 main types of sampling distributions are:
The sampling distribution of the mean refers to the probability distribution of sample means that you get by repeatedly taking samples (of the same size) from a population and calculating the mean of each sample.
For any population with mean µ and standard deviation σ:
= µ
There is no tendency for a sample mean to fall systematically above or below µ, even if the distribution of the raw data is skewed. Thus, the mean of the sampling distribution is an unbiased estimate of the population mean µ.
= σ/√n
Where
Sampling distribution of a proportion focuses on proportions in a population. Here, you select samples and calculate their corresponding proportions. The means of the sample proportions from each group represent the proportion of the entire population.
The formula for the sampling distribution of a proportion (often denoted as p̂) is:
p̂ = x/n
Where:
This formula calculates the proportion of occurrences of a certain event (e.g., success, positive outcome) within a sample.
Sampling distribution involves a small population or a population about which you don't know much. It is used to estimate the mean of the population and other statistics such as confidence intervals, statistical differences, and linear regression. T-distribution uses a t-score to evaluate data that wouldn't be appropriate for a normal distribution.
The formula for the t-score, denoted as t, is:
t = [x - μ] / [s /√(n)]
Where:
This formula calculates the difference between the sample mean and the population mean, scaled by the standard error of the sample mean. The t-score helps to assess whether the observed difference between the sample and population means is statistically significant.
Example 1: Mean and standard deviation of the tax value of all vehicles registered in a certain state are μ=$13,525 and σ=$4,180. Suppose random samples of size 100 are drawn from the population of vehicles.
Find
Solution:
Since n = 100, the formulas yield
μx̄ = μ = $13,525
σx̄ = σ / √n = $4180 / √100
σx̄ =$418
Example 2: A prototype automotive tire has a design life of 38,500 miles with a standard deviation of 2,500 miles. Five such tires are manufactured and tested. On the assumption that the actual population mean is 38,500 miles and the actual population standard deviation is 2,500 miles, find the probability that the sample mean will be less than 36,000 miles. Assume that the distribution of lifetimes of such tires is normal.
Solution:
Here, we will assume and use units of thousands of miles.
Then sample mean x̄ has
- Mean: μx̄ = μ = 38.5
- Standard Deviation: σx̄ = σ/√n = 2.5/√5 = 1.11803
Since the population is normally distributed, so is x̄, hence,
P (X < 36) = P(Z < {36 - μx̄}/σx̄)
P (X < 36) = P(Z < {36 - 38.5}/1.11803)
P (X < 36) = P(Z < -2.24)
P(X < 36) = 0.0125
Therefore, if the tires perform as designed then there is only about a 1.25% chance that the average of a sample of this size would be so low.
Question 1: Random samples of size 225 are drawn from a population with a mean of 100 and a standard deviation of 20. Find the mean and standard deviation of the sample mean.
Question 2: Random samples of size 64 are drawn from a population with a mean of 32 and a standard deviation of 5. Find the mean and standard deviation of the sample mean.
Question 3: A population has a mean of 75 and a standard deviation of 12.
Question 4: A population has a mean of 5.75 and a standard deviation of 1.02.
Question 5: The Numerical population of grade point averages at a college has a mean of 2.61 and a standard deviation of 0.5. If a random sample of size 100 is taken from the population, what is the probability that the sample mean will be between 2.51 and 2.71?
Question 6: Random samples of size 1,600 are drawn from a population in which the proportion with the characteristic of interest is 0.05. Decide whether or not the sample size is large enough to assume that the sample proportion is normally distributed.