We are given a table showing the distribution of salaries (in thousands of FCFA) of 210 employees. The table has two columns: salary range and the number of employees in that range. The problem asks us to perform several statistical analyses on this data, including finding the modal class, quartiles, mean, variance, standard deviation, coefficient of variation, and Yule's coefficient of skewness. It also asks how the mean, variance, and standard deviation change if the salaries increase by a fixed amount or by a percentage. Finally, it asks about concentration measures such as the mediale, concentration range and Gini index, and the effects of a salary increase on them.
Probability and StatisticsDescriptive StatisticsMeanVarianceStandard DeviationQuartilesModal ClassCoefficient of VariationSkewnessGini IndexMedialeConcentration RangeLorenz Curve
2025/4/24
1. Problem Description
We are given a table showing the distribution of salaries (in thousands of FCFA) of 210 employees. The table has two columns: salary range and the number of employees in that range. The problem asks us to perform several statistical analyses on this data, including finding the modal class, quartiles, mean, variance, standard deviation, coefficient of variation, and Yule's coefficient of skewness. It also asks how the mean, variance, and standard deviation change if the salaries increase by a fixed amount or by a percentage. Finally, it asks about concentration measures such as the mediale, concentration range and Gini index, and the effects of a salary increase on them.
2. Solution Steps
Let's address the questions one by one.
1. Determining the Modal Class
The modal class is the salary range with the highest frequency (number of employees). In the given data, the highest frequency is 60, which corresponds to the salary range .
Interpretation: The most common salary range for the employees is between 150,000 FCFA and 220,000 FCFA.
2. Calculating Quartiles
We will use linear interpolation to calculate the quartiles.
* Q1 (First Quartile): 25% of the data falls below Q
1. $0.25 * 210 = 52.5$. The cumulative frequency before the interval $[125, 150[$ is
1
4. The cumulative frequency of interval $[125, 150[$ is $14 + 38 = 52$. The cumulative frequency before the interval $[150,220[$ is
5
2. Therefore Q1 lies in the interval $[150,220[$.
Where:
is the lower limit of the quartile class = 150
is the total number of observations = 210
is the cumulative frequency of the class before the quartile class = 52
is the frequency of the quartile class = 60
is the class width =
* Q2 (Second Quartile, Median): 50% of the data falls below Q
2. $0.50 * 210 = 105$. The cumulative frequency before the interval $[150, 220[$ is $14 + 38 = 52$. The cumulative frequency of interval $[150,220[$ is $52 + 60 = 112$. Therefore Q2 lies in the interval $[150,220[$.
Where:
is the lower limit of the quartile class = 150
is the total number of observations = 210
is the cumulative frequency of the class before the quartile class = 52
is the frequency of the quartile class = 60
is the class width =
* Q3 (Third Quartile): 75% of the data falls below Q
3. $0.75 * 210 = 157.5$. The cumulative frequency before the interval $[220, 300[$ is $14 + 38 + 60 = 112$. The cumulative frequency of interval $[220,300[$ is $112+27 = 139$. The cumulative frequency before the interval $[300,350[$ is $112 + 27 = 139$. The cumulative frequency of interval $[300,350[$ is $139+33 = 172$. Therefore Q3 lies in the interval $[300,350[$.
Where:
is the lower limit of the quartile class = 300
is the total number of observations = 210
is the cumulative frequency of the class before the quartile class = 139
is the frequency of the quartile class = 33
is the class width =
3. Calculate the arithmetic mean, interquartile range, variance, standard deviation, and coefficient of variation.
First calculate the midpoints of the intervals, we have these midpoints: 80, 137.5, 185, 260, 325, 387.5, 512.
5. The frequencies are: 14, 38, 60, 27, 33, 20,
1
8.
Mean ():
Interquartile Range (IQR):
Variance ():
Standard Deviation ():
Coefficient of Variation (CV):
4. If the salaries increase monthly by 25,000 FCFA (25 in thousands) or by 10%.
If the salaries increase by a fixed amount (25):
* Mean: The mean will increase by
2
5. New mean = $248.405 + 25 = 273.405$.
* Variance: The variance will remain unchanged since the spread of the data is the same. .
* Standard Deviation: The standard deviation will also remain unchanged. .
If the salaries increase by 10%:
* Mean: The mean will increase by 10%. New mean = .
* Variance: The variance will increase by . New variance =
* Standard Deviation: The standard deviation will increase by 10%. New standard deviation =
5. Dissymmetry of the frequency polygon and Yule's coefficient of skewness.
Yule's coefficient of skewness (Q):
Interpretation: Since the coefficient is positive, the distribution is positively skewed (skewed to the right).
6. Concentration of Salaries
a. Calculate the mediale.
To find the mediale, first calculate the total income. Then find the income level where half the total income is earned.
Total income = (calculated previously)
Half of total income = .
We need to determine which interval contains the mediale. We calculate cumulative income.
* : . Cumulative income = 1120
* : . Cumulative income =
* : . Cumulative income =
* : . Cumulative income =
* : . Cumulative income =
The mediale lies in the interval .
We can estimate the mediale by linear interpolation.
where L = 300, cf = 24465, f = 33 and w = 50, the median income =
b. Calculate the concentration range.
The concentration range spans from the median income to the maximum income. The median is estimated at previously calculated. The range extends to
6
0
0.
c. Construct the Lorenz curve and calculate the Gini index using the triangle method.
To calculate the Gini index using the triangle method, we need to calculate the cumulative percentage of employees and the cumulative percentage of income.
| Salary Range | Employees | % Employees | Cumulative % Employees | Income Midpoint | Income | Cumulative Income | % Income | Cumulative % Income |
|--------------|-----------|---------------|--------------------------|-----------------|--------|-------------------|-----------|-----------------------|
| [35, 125[ | 14 | 6.67% | 6.67% | 80 | 1120 | 1120 | 2.15% | 2.15% |
| [125, 150[ | 38 | 18.10% | 24.76% | 137.5 | 5225 | 6345 | 10.02% | 12.16% |
| [150, 220[ | 60 | 28.57% | 53.33% | 185 | 11100 | 17445 | 21.28% | 33.44% |
| [220, 300[ | 27 | 12.86% | 66.19% | 260 | 7020 | 24465 | 13.46% | 46.90% |
| [300, 350[ | 33 | 15.71% | 81.90% | 325 | 10725 | 35190 | 20.56% | 67.46% |
| [350, 425[ | 20 | 9.52% | 91.43% | 387.5 | 7750 | 42940 | 14.86% | 82.32% |
| [425, 600[ | 18 | 8.57% | 100.00% | 512.5 | 9225 | 52165 | 17.68% | 100.00% |
The Gini index is approximately the area between the line of perfect equality and the Lorenz curve, divided by the total area under the line of perfect equality. We can approximate this area using the trapezoidal rule. Since our data represents cumulative percentage, the line of perfect equality is a straight line going from (0,0) to (100,100). Thus the area under the line of perfect equality is 0.
5. $Gini= 1 - \sum_{i=1}^n (X_i - X_{i-1})(Y_i + Y_{i-1})$, where the Xi are the cumulative percentage of employees and Yi are cumulative percentage income. The first term is 0, and we set the value of X_0 and Y_0 to
0. $Gini = 1 - [0.0667*(0.0215) +0.2476 - 0.0667)*(0.1216+0.0215)/2 + (0.5333-0.2476)*(0.3344+0.1216)/2 + (0.6619 - 0.5333)*(0.4690+0.3344)/2 +(0.8190-0.6619)*(0.6746+0.4690)/2+ (0.9143-0.8190)*(0.8232+0.6746)/2+ (1-0.9143)(1+0.8232)/2] = 0.2707$.
d. Influence of a 15% salary increase on the Gini index.
If all salaries increase by 15%, the Gini index will remain approximately the same. A proportional increase in all salaries does not change the relative distribution of income. The Gini index reflects the relative inequality.
3. Final Answer
1. Modal Class: $[150, 220[$ (thousands of FCFA)
2. Q1: 150.583, Q2: 211.833, Q3: 328.03 (thousands of FCFA)
3. Mean: 248.405, IQR: 177.447, Variance: 14019.35, Standard Deviation: 118.403, CV: 47.66% (thousands of FCFA)
4. If salaries increase by 25,000 FCFA: new mean: 273.405, variance and standard deviation are unchanged. If salaries increase by 10%: new mean: 273.2455, new variance: 16963.4135, new standard deviation: 130.2433 (thousands of FCFA)
5. Yule's Coefficient of Skewness: 0.3096
6. a. Mediale: 349.
0
1
5. b. Concentration range: from the median ($211.833$) to
6
0