We are given a dataset of pre-test scores ($X_1$) and post-test scores ($X_2$) for a group of students. We want to determine if there was a statistically significant increase in scores from the pre-test to the post-test at a significance level of $\alpha = 0.05$, and if so, by how much did the scores increase. We need to conduct a paired t-test.

Probability and StatisticsPaired t-testStatistical SignificanceHypothesis TestingMeanStandard DeviationP-value
2025/4/14

1. Problem Description

We are given a dataset of pre-test scores (X1X_1) and post-test scores (X2X_2) for a group of students. We want to determine if there was a statistically significant increase in scores from the pre-test to the post-test at a significance level of α=0.05\alpha = 0.05, and if so, by how much did the scores increase. We need to conduct a paired t-test.

2. Solution Steps

First, calculate the difference D=X2X1D = X_2 - X_1 for each student. Then calculate the mean of the differences Dˉ\bar{D} and the standard deviation of the differences sDs_D. After that, calculate the t-statistic and finally the p-value.
| Pretest (X1X_1) | Posttest (X2X_2) | D = X2X1X_2 - X_1 |
|---|---|---|
| 90 | 88 | -2 |
| 60 | 67 | 7 |
| 95 | 99 | 4 |
| 93 | 94 | 1 |
| 94 | 100 | 6 |
| 68 | 64 | -4 |
| 88 | 91 | 3 |
| 91 | 95 | 4 |
| 93 | 95 | 2 |
| 84 | 89 | 5 |
| 76 | 82 | 6 |
| 86 | 92 | 6 |
| 83 | 83 | 0 |
| 81 | 85 | 4 |
| 89 | 93 | 4 |
| 65 | 69 | 4 |
| 91 | 90 | -1 |
| 92 | 100 | 8 |
| 87 | 95 | 8 |
| 92 | 96 | 4 |
n=20n = 20
Calculate the mean difference Dˉ\bar{D}:
Dˉ=Dn=2+7+4+1+64+3+4+2+5+6+6+0+4+4+41+8+8+420=7320=3.65\bar{D} = \frac{\sum D}{n} = \frac{-2 + 7 + 4 + 1 + 6 - 4 + 3 + 4 + 2 + 5 + 6 + 6 + 0 + 4 + 4 + 4 - 1 + 8 + 8 + 4}{20} = \frac{73}{20} = 3.65
Calculate the standard deviation of the differences sDs_D:
First calculate D2=(2)2+72+42+12+62+(4)2+32+42+22+52+62+62+02+42+42+42+(1)2+82+82+42=4+49+16+1+36+16+9+16+4+25+36+36+0+16+16+16+1+64+64+16=439 \sum D^2 = (-2)^2 + 7^2 + 4^2 + 1^2 + 6^2 + (-4)^2 + 3^2 + 4^2 + 2^2 + 5^2 + 6^2 + 6^2 + 0^2 + 4^2 + 4^2 + 4^2 + (-1)^2 + 8^2 + 8^2 + 4^2 = 4 + 49 + 16 + 1 + 36 + 16 + 9 + 16 + 4 + 25 + 36 + 36 + 0 + 16 + 16 + 16 + 1 + 64 + 64 + 16 = 439
sD=D2nDˉ2n1=43920(3.65)2201=43920(13.3225)19=439266.4519=172.5519=9.08157893.0135658s_D = \sqrt{\frac{\sum D^2 - n\bar{D}^2}{n-1}} = \sqrt{\frac{439 - 20(3.65)^2}{20-1}} = \sqrt{\frac{439 - 20(13.3225)}{19}} = \sqrt{\frac{439 - 266.45}{19}} = \sqrt{\frac{172.55}{19}} = \sqrt{9.0815789} \approx 3.0135658
Calculate the t-statistic:
t=DˉsD/n=3.653.0135658/20=3.653.0135658/4.4721359=3.650.67375.418t = \frac{\bar{D}}{s_D / \sqrt{n}} = \frac{3.65}{3.0135658 / \sqrt{20}} = \frac{3.65}{3.0135658 / 4.4721359} = \frac{3.65}{0.6737} \approx 5.418
Degrees of freedom df=n1=201=19df = n - 1 = 20 - 1 = 19
Significance level α=0.05\alpha = 0.05
Since we are testing for an improvement, this is a one-tailed test. Looking up the t-critical value for a one-tailed test with df=19df = 19 and α=0.05\alpha = 0.05, we find tcritical=1.729t_{critical} = 1.729.
Since our calculated t-statistic (5.4185.418) is greater than the t-critical value (1.7291.729), we reject the null hypothesis that there is no improvement. Therefore, the scores significantly increased.
Alternatively, we can calculate the p-value. Using a t-distribution calculator with t=5.418t = 5.418 and df=19df = 19, we find the one-tailed p-value to be approximately 0.000010.00001. Since this p-value is less than α=0.05\alpha = 0.05, we reject the null hypothesis and conclude that the scores significantly increased.
The estimated increase in scores is the mean difference Dˉ=3.65\bar{D} = 3.65.

3. Final Answer

Yes, scores increased. The scores increased by 3.65 on average.

Related problems in "Probability and Statistics"

The problem provides a frequency distribution table of marks obtained by students. Part (a) requires...

ProbabilityConditional ProbabilityWithout ReplacementCombinations
2025/6/5

The problem is divided into two questions, question 10 and question 11. Question 10 is about the fre...

Frequency DistributionCumulative FrequencyOgivePercentileProbabilityConditional ProbabilityCombinations
2025/6/5

A number is selected at random from the integers 30 to 48 inclusive. We want to find the probability...

ProbabilityPrime NumbersDivisibility
2025/6/3

The problem describes a survey where 30 people answered about their favorite book genres. The result...

PercentagesData InterpretationPie ChartFractions
2025/6/1

The problem asks us to determine if there is a statistically significant difference in promotion rat...

Hypothesis TestingChi-Square TestContingency TableStatistical SignificanceIndependence
2025/6/1

We are given a contingency table showing the number of students from different majors (Psychology, B...

Chi-Square TestContingency TableStatistical InferenceHypothesis Testing
2025/6/1

The problem describes a scenario where a pizza company wants to determine if the number of different...

Chi-Square TestGoodness-of-Fit TestHypothesis TestingFrequency DistributionP-value
2025/6/1

The problem asks to test the significance of three chi-square tests given the sample size $N$, numbe...

Chi-square testStatistical SignificanceDegrees of FreedomEffect SizeCramer's VHypothesis Testing
2025/5/29

The problem asks us to compute the expected frequencies for the given contingency table. The conting...

Contingency TableExpected FrequenciesChi-squared Test
2025/5/29

The problem asks us to estimate the chi-square value when $n=23$ and $p=99$, given a table of chi-sq...

Chi-square distributionStatistical estimationInterpolation
2025/5/27