We are given a dataset of pre-test scores ($X_1$) and post-test scores ($X_2$) for a group of students. We want to determine if there was a statistically significant increase in scores from the pre-test to the post-test at a significance level of $\alpha = 0.05$, and if so, by how much did the scores increase. We need to conduct a paired t-test.

Probability and StatisticsPaired t-testStatistical SignificanceHypothesis TestingMeanStandard DeviationP-value
2025/4/14

1. Problem Description

We are given a dataset of pre-test scores (X1X_1) and post-test scores (X2X_2) for a group of students. We want to determine if there was a statistically significant increase in scores from the pre-test to the post-test at a significance level of α=0.05\alpha = 0.05, and if so, by how much did the scores increase. We need to conduct a paired t-test.

2. Solution Steps

First, calculate the difference D=X2X1D = X_2 - X_1 for each student. Then calculate the mean of the differences Dˉ\bar{D} and the standard deviation of the differences sDs_D. After that, calculate the t-statistic and finally the p-value.
| Pretest (X1X_1) | Posttest (X2X_2) | D = X2X1X_2 - X_1 |
|---|---|---|
| 90 | 88 | -2 |
| 60 | 67 | 7 |
| 95 | 99 | 4 |
| 93 | 94 | 1 |
| 94 | 100 | 6 |
| 68 | 64 | -4 |
| 88 | 91 | 3 |
| 91 | 95 | 4 |
| 93 | 95 | 2 |
| 84 | 89 | 5 |
| 76 | 82 | 6 |
| 86 | 92 | 6 |
| 83 | 83 | 0 |
| 81 | 85 | 4 |
| 89 | 93 | 4 |
| 65 | 69 | 4 |
| 91 | 90 | -1 |
| 92 | 100 | 8 |
| 87 | 95 | 8 |
| 92 | 96 | 4 |
n=20n = 20
Calculate the mean difference Dˉ\bar{D}:
Dˉ=Dn=2+7+4+1+64+3+4+2+5+6+6+0+4+4+41+8+8+420=7320=3.65\bar{D} = \frac{\sum D}{n} = \frac{-2 + 7 + 4 + 1 + 6 - 4 + 3 + 4 + 2 + 5 + 6 + 6 + 0 + 4 + 4 + 4 - 1 + 8 + 8 + 4}{20} = \frac{73}{20} = 3.65
Calculate the standard deviation of the differences sDs_D:
First calculate D2=(2)2+72+42+12+62+(4)2+32+42+22+52+62+62+02+42+42+42+(1)2+82+82+42=4+49+16+1+36+16+9+16+4+25+36+36+0+16+16+16+1+64+64+16=439 \sum D^2 = (-2)^2 + 7^2 + 4^2 + 1^2 + 6^2 + (-4)^2 + 3^2 + 4^2 + 2^2 + 5^2 + 6^2 + 6^2 + 0^2 + 4^2 + 4^2 + 4^2 + (-1)^2 + 8^2 + 8^2 + 4^2 = 4 + 49 + 16 + 1 + 36 + 16 + 9 + 16 + 4 + 25 + 36 + 36 + 0 + 16 + 16 + 16 + 1 + 64 + 64 + 16 = 439
sD=D2nDˉ2n1=43920(3.65)2201=43920(13.3225)19=439266.4519=172.5519=9.08157893.0135658s_D = \sqrt{\frac{\sum D^2 - n\bar{D}^2}{n-1}} = \sqrt{\frac{439 - 20(3.65)^2}{20-1}} = \sqrt{\frac{439 - 20(13.3225)}{19}} = \sqrt{\frac{439 - 266.45}{19}} = \sqrt{\frac{172.55}{19}} = \sqrt{9.0815789} \approx 3.0135658
Calculate the t-statistic:
t=DˉsD/n=3.653.0135658/20=3.653.0135658/4.4721359=3.650.67375.418t = \frac{\bar{D}}{s_D / \sqrt{n}} = \frac{3.65}{3.0135658 / \sqrt{20}} = \frac{3.65}{3.0135658 / 4.4721359} = \frac{3.65}{0.6737} \approx 5.418
Degrees of freedom df=n1=201=19df = n - 1 = 20 - 1 = 19
Significance level α=0.05\alpha = 0.05
Since we are testing for an improvement, this is a one-tailed test. Looking up the t-critical value for a one-tailed test with df=19df = 19 and α=0.05\alpha = 0.05, we find tcritical=1.729t_{critical} = 1.729.
Since our calculated t-statistic (5.4185.418) is greater than the t-critical value (1.7291.729), we reject the null hypothesis that there is no improvement. Therefore, the scores significantly increased.
Alternatively, we can calculate the p-value. Using a t-distribution calculator with t=5.418t = 5.418 and df=19df = 19, we find the one-tailed p-value to be approximately 0.000010.00001. Since this p-value is less than α=0.05\alpha = 0.05, we reject the null hypothesis and conclude that the scores significantly increased.
The estimated increase in scores is the mean difference Dˉ=3.65\bar{D} = 3.65.

3. Final Answer

Yes, scores increased. The scores increased by 3.65 on average.

Related problems in "Probability and Statistics"

We are given two sets of scores, Time 1 ($X_1$) and Time 2 ($X_2$), for 12 subjects. We need to cal...

Hypothesis TestingPaired t-testStatistical SignificanceMeanStandard Deviation
2025/4/14

The problem provides a table showing the performance of 10 students in Chemistry ($x$) and Physics (...

Descriptive StatisticsMeanScatter DiagramLinear Regression
2025/4/13

The problem asks to construct a 99% confidence interval for the average land temperature from 1981-2...

Confidence IntervalT-distributionSample MeanSample Standard DeviationStatistical Inference
2025/4/13

We are given the marks obtained by 8 students in Mathematics and Physics tests. We are asked to calc...

Spearman's Rank CorrelationCorrelationStatistics
2025/4/13

The problem consists of two parts. Part (a) requires drawing a Venn diagram based on two given state...

Venn DiagramsMeanVarianceFrequency Distribution
2025/4/13

Two fair dice, A and B, each with faces numbered 1 to 6, are thrown together. We need to: a) Constru...

ProbabilityDiscrete ProbabilityDiceSample SpaceEvents
2025/4/13

The problem provides a frequency table showing the distribution of heights of seedlings in a nursery...

MeanVarianceFrequency DistributionDescriptive Statistics
2025/4/13

Researchers are interested in the mean age of a certain population. A random sample of 10 individual...

Hypothesis TestingP-valueStatistical SignificanceNormal DistributionMeanVariance
2025/4/13

The problem provides a frequency distribution of companies in the automotive sector based on their r...

Descriptive StatisticsFrequency DistributionHistogramsFrequency PolygonsCumulative Frequency CurvesMeasures of Central TendencyMeasures of DispersionSkewnessKurtosisGini IndexLorenz Curve
2025/4/13

The problem provides a frequency distribution table of the annual local taxes paid by the residents ...

Descriptive StatisticsFrequency DistributionMedianQuartilesMeanVarianceCoefficient of VariationSkewnessGini Index
2025/4/13