The problem provides a frequency distribution of companies in the automotive sector based on their revenue (in millions of euros). The revenue is divided into intervals. The goal is to answer a series of statistical questions about this data. The table is as follows: | Revenue (millions of euros) | Number of Companies | |-----------------------------|-----------------------| | [0; 0.25[ | 137 | | [0.25; 0.5[ | 106 | | [0.5; 1[ | 112 | | [1; 2.5[ | 154 | | [2.5; 5[ | 100 | | [5; 10[ | 33 | The tasks are: 1. Identify the observed characteristic and its nature.
Probability and StatisticsDescriptive StatisticsFrequency DistributionHistogramsFrequency PolygonsCumulative Frequency CurvesMeasures of Central TendencyMeasures of DispersionSkewnessKurtosisGini IndexLorenz Curve
2025/4/13
1. Problem Description
The problem provides a frequency distribution of companies in the automotive sector based on their revenue (in millions of euros). The revenue is divided into intervals. The goal is to answer a series of statistical questions about this data.
The table is as follows:
| Revenue (millions of euros) | Number of Companies |
|-----------------------------|-----------------------|
| [0; 0.25[ | 137 |
| [0.25; 0.5[ | 106 |
| [0.5; 1[ | 112 |
| [1; 2.5[ | 154 |
| [2.5; 5[ | 100 |
| [5; 10[ | 33 |
The tasks are:
1. Identify the observed characteristic and its nature.
2. Construct a histogram, frequency polygon, and cumulative frequency curve.
3. Determine the modal class, median, quartiles Q1 and Q3, and deciles D1 and D
9.
4. Calculate the centered moments of order 2, 3, and 4, the Fisher skewness coefficient, and the Pearson kurtosis coefficient.
5. Calculate the mediale and the concentration range.
6. Calculate the concentration index and plot the concentration curve (interpret).
2. Solution Steps
1. Observed Characteristic:
The observed characteristic is the revenue (chiffre d'affaires) of the companies. Its nature is quantitative and continuous (since revenue can take any value within a certain range).
2. Histogram, Frequency Polygon, and Cumulative Frequency Curve:
Constructing these requires plotting the data.
- Histogram: Rectangles are drawn with bases corresponding to the class intervals and heights proportional to the frequencies.
- Frequency Polygon: Connect the midpoints of the tops of the histogram rectangles.
- Cumulative Frequency Curve: Plot the cumulative frequencies against the upper limits of the class intervals and connect the points with a smooth curve. To build the cumulative frequencies:
- [0; 0.25[: 137
- [0; 0.5[: 137 + 106 = 243
- [0.5; 1[: 243 + 112 = 355
- [1; 2.5[: 355 + 154 = 509
- [2.5; 5[: 509 + 100 = 609
- [5; 10[: 609 + 33 = 642
3. Modal Class, Median, Quartiles, and Deciles:
Total number of companies, .
- Modal Class: The modal class is the class with the highest frequency. Here, it is with a frequency of
1
5
4.
- Median: The median is the value that divides the distribution into two equal halves. It's the value corresponding to the th observation. . The median class is the class containing the 321st observation, which is . To estimate the median, we use linear interpolation within the median class. , , , .
- Q1 (First Quartile): The first quartile is the value corresponding to the th observation. . The Q1 class is the class containing the 160.5th observation, which is . , , , .
- Q3 (Third Quartile): The third quartile is the value corresponding to the th observation. . The Q3 class is the class containing the 481.5th observation, which is . , , , .
- D1 (First Decile): The first decile is the value corresponding to the th observation. . The D1 class is the class containing the 64.2th observation, which is . , , , .
- D9 (Ninth Decile): The ninth decile is the value corresponding to the th observation. . The D9 class is the class containing the 577.8th observation, which is . , , , .
4. Centered Moments, Skewness, and Kurtosis:
Calculating these requires finding the mean first. Then compute each moment using deviations from the mean. The formulas involved are relatively complex and require computation that cannot be completed without a calculator.
5. Mediale and Concentration Range:
The mediale is the value such that half of the total revenue is obtained by companies with revenue below this value and the other half by companies with revenue above. The total revenue is the sum of the revenue of all companies. Then find the value where half the revenue is reached.
6. Concentration Index and Curve:
Calculating the Gini index and plotting the Lorenz curve needs detailed calculations. The Lorenz curve plots the cumulative percentage of total revenue earned against the cumulative percentage of the number of companies. The Gini index is the area between the Lorenz curve and the line of perfect equality (the 45-degree line) divided by the total area under the line of perfect equality.
3. Final Answer
1. The observed characteristic is the revenue (chiffre d'affaires) of the companies. Its nature is quantitative and continuous.
2. The histogram, frequency polygon, and cumulative frequency curve require graphing based on the data provided.
3. Modal Class: $[1; 2.5[$. Median: approximately 0.
8
4
8
2. Q1: approximately 0.
2
9
2
9. Q3: approximately 2.
2
3
2
1. D1: approximately 0.
1
1
7
1. D9: approximately 4.
2
2.