The problem provides a dataset of average phone call durations. It requires converting this data into a grouped frequency table using Sturges' rule, plotting the less than and more than cumulative frequency curves on the same graph, calculating the geometric mean, finding the median, and determining the mode of the distribution.

Probability and StatisticsDescriptive StatisticsFrequency DistributionCumulative FrequencyGeometric MeanMedianModeSturges' RuleGrouped Data
2025/3/14

1. Problem Description

The problem provides a dataset of average phone call durations. It requires converting this data into a grouped frequency table using Sturges' rule, plotting the less than and more than cumulative frequency curves on the same graph, calculating the geometric mean, finding the median, and determining the mode of the distribution.

2. Solution Steps

(i) Convert the data into a grouped frequency table using Sturges' rule.
First, we need to find the number of classes (kk) using Sturges' rule:
k=1+3.322log10(n)k = 1 + 3.322 * log_{10}(n)
where nn is the number of observations. In this case, n=50n = 50 (10 rows and 5 columns).
k=1+3.322log10(50)k = 1 + 3.322 * log_{10}(50)
k=1+3.3221.69897k = 1 + 3.322 * 1.69897
k=1+5.6438k = 1 + 5.6438
k6.6438k \approx 6.6438
Since the number of classes must be an integer, we can round kk to

7. So we have 7 classes.

Next, we need to find the class width (ww):
w=Rangekw = \frac{Range}{k}
The range is the difference between the maximum and minimum values in the dataset.
Maximum value = 135
Minimum value = 12
Range = 135 - 12 = 123
w=123717.57w = \frac{123}{7} \approx 17.57
We can round the class width to
1
8.
Now, let's create the grouped frequency table:
| Class Interval | Frequency |
|----------------|-----------|
| 12 - 29 | 10 |
| 30 - 47 | 8 |
| 48 - 65 | 9 |
| 66 - 83 | 4 |
| 84 - 101 | 5 |
| 102 - 119 | 8 |
| 120 - 137 | 6 |
(ii) Plot the less than and more than cumulative frequency curves on the same graph from the grouped table.
This step would require plotting the points of the grouped frequency distribution, which cannot be done in text format.
(iii) Compute the geometric mean.
GM=x1x2...xnnGM = \sqrt[n]{x_1 * x_2 * ... * x_n}
For grouped data, the geometric mean is calculated as:
GM=x1f1x2f2...xkfkNGM = \sqrt[N]{x_1^{f_1} * x_2^{f_2} * ... * x_k^{f_k}}
Where xix_i is the midpoint of the class interval and fif_i is the frequency of that class. NN is the total frequency.
Class midpoints: (20.5, 38.5, 56.5, 74.5, 92.5, 110.5, 128.5)
Frequencies: (10, 8, 9, 4, 5, 8, 6)
N = 50
log(GM)=filog(xi)Nlog(GM) = \frac{\sum f_i * log(x_i)}{N}
log(GM)=10log(20.5)+8log(38.5)+9log(56.5)+4log(74.5)+5log(92.5)+8log(110.5)+6log(128.5)50log(GM) = \frac{10*log(20.5) + 8*log(38.5) + 9*log(56.5) + 4*log(74.5) + 5*log(92.5) + 8*log(110.5) + 6*log(128.5)}{50}
log(GM)=10(1.31175)+8(1.58546)+9(1.75205)+4(1.87216)+5(1.96614)+8(2.04345)+6(2.10891)50log(GM) = \frac{10*(1.31175) + 8*(1.58546) + 9*(1.75205) + 4*(1.87216) + 5*(1.96614) + 8*(2.04345) + 6*(2.10891)}{50}
log(GM)=13.1175+12.68368+15.76845+7.48864+9.8307+16.3476+12.6534650log(GM) = \frac{13.1175 + 12.68368 + 15.76845 + 7.48864 + 9.8307 + 16.3476 + 12.65346}{50}
log(GM)=87.8950log(GM) = \frac{87.89}{50}
log(GM)=1.7578log(GM) = 1.7578
GM=101.7578GM = 10^{1.7578}
GM57.25GM \approx 57.25
(iv) Compute the median.
Median =L+(N2cff)w = L + (\frac{\frac{N}{2} - cf}{f}) * w
Where LL is the lower boundary of the median class, NN is the total frequency (50), cfcf is the cumulative frequency of the class before the median class, ff is the frequency of the median class, and ww is the class width (18).
N/2 =
2
5.
Cumulative frequencies: (10, 18, 27, 31, 36, 44, 50)
The median class is 48 - 65, since the cumulative frequency just exceeds
2
5.
L=47.5L = 47.5 (lower boundary of the median class)
cf=18cf = 18 (cumulative frequency before the median class)
f=9f = 9 (frequency of the median class)
w=18w = 18
Median =47.5+(25189)18 = 47.5 + (\frac{25 - 18}{9}) * 18
Median =47.5+(79)18 = 47.5 + (\frac{7}{9}) * 18
Median =47.5+14 = 47.5 + 14
Median =61.5 = 61.5
(v) Compute the mode of the distribution.
Mode =L+(fmf12fmf1f2)w = L + (\frac{f_m - f_1}{2f_m - f_1 - f_2}) * w
Where LL is the lower boundary of the modal class, fmf_m is the frequency of the modal class, f1f_1 is the frequency of the class before the modal class, f2f_2 is the frequency of the class after the modal class, and ww is the class width.
The modal class is the class with the highest frequency.
In this case, there are several classes with the highest frequency 10, 8, 9, 4, 5, 8, 6, therefore multiple modes exist.
Since there are two classes with frequencies equal to 8, one way of approximating the mode is using the class with the overall maximum frequency. The class with the maximum frequency is the first class (12-29), therefore, we approximate that class as the mode.
L=11.5L = 11.5
fm=10f_m = 10
f1=0f_1 = 0
f2=8f_2 = 8
w=18w = 18
Mode =11.5+(10021008)18 = 11.5 + (\frac{10 - 0}{2*10 - 0 - 8}) * 18
Mode =11.5+(1012)18 = 11.5 + (\frac{10}{12}) * 18
Mode =11.5+15 = 11.5 + 15
Mode =26.5 = 26.5

3. Final Answer

(i) Grouped frequency table:
| Class Interval | Frequency |
|----------------|-----------|
| 12 - 29 | 10 |
| 30 - 47 | 8 |
| 48 - 65 | 9 |
| 66 - 83 | 4 |
| 84 - 101 | 5 |
| 102 - 119 | 8 |
| 120 - 137 | 6 |
(ii) Plot the less than and more than cumulative frequency curves on the same graph from the grouped table (graphical representation needed).
(iii) Geometric Mean: approximately 57.25
(iv) Median: 61.5
(v) Mode: 26.5

Related problems in "Probability and Statistics"

The problem provides a table of observed prices ($Y$) and available quantities ($X$) of a product in...

Regression AnalysisLinear RegressionCorrelation CoefficientCoefficient of DeterminationScatter Plot
2025/6/7

The problem provides a frequency distribution table of marks obtained by students. Part (a) requires...

ProbabilityConditional ProbabilityWithout ReplacementCombinations
2025/6/5

The problem is divided into two questions, question 10 and question 11. Question 10 is about the fre...

Frequency DistributionCumulative FrequencyOgivePercentileProbabilityConditional ProbabilityCombinations
2025/6/5

A number is selected at random from the integers 30 to 48 inclusive. We want to find the probability...

ProbabilityPrime NumbersDivisibility
2025/6/3

The problem describes a survey where 30 people answered about their favorite book genres. The result...

PercentagesData InterpretationPie ChartFractions
2025/6/1

The problem asks us to determine if there is a statistically significant difference in promotion rat...

Hypothesis TestingChi-Square TestContingency TableStatistical SignificanceIndependence
2025/6/1

We are given a contingency table showing the number of students from different majors (Psychology, B...

Chi-Square TestContingency TableStatistical InferenceHypothesis Testing
2025/6/1

The problem describes a scenario where a pizza company wants to determine if the number of different...

Chi-Square TestGoodness-of-Fit TestHypothesis TestingFrequency DistributionP-value
2025/6/1

The problem asks to test the significance of three chi-square tests given the sample size $N$, numbe...

Chi-square testStatistical SignificanceDegrees of FreedomEffect SizeCramer's VHypothesis Testing
2025/5/29

The problem asks us to compute the expected frequencies for the given contingency table. The conting...

Contingency TableExpected FrequenciesChi-squared Test
2025/5/29