Estimation
Data: 10, 12, 23, 16, 17, 20, 15
Range
- Ungrouped Data: \(R = X_H - X_L\)
- Grouped Data: \(R = L_n - L_1\)
Mean Deviation (MD)
Around mean, \(\displaystyle MD(\bar x)=\frac{1}{n}{\sum_{i=1}^{n} |x_i - \bar{x}|}\)
- For grouped data, \(\displaystyle MD(\bar x) = \frac{1}{N}{\sum_{i=1}^{n} f_i|x_i - \bar{x}|}\)
- Around Median, \(\rightarrow MD (Me) = ?\)
- Around Mode, \(\rightarrow MD (Mo) = ?\)
Think!
What if?
\(\displaystyle MD(\bar x)=\frac{1}{n}{\sum_{i=1}^{n} (x_i - \bar{x})}\)
Grouped Data
“Classified” is not a proper term
| 10–19 |
5 |
| 20–29 |
12 |
| 30–39 |
18 |
| 40–49 |
10 |
| 50–59 |
6 |
Variance and Standard Deviation
Variance, \(\displaystyle \sigma^2 = \frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2\)
- Also, \(\displaystyle \sigma^2 = \frac{\sum x_i^2}{n} - \left(\frac{\sum x_i}{n}\right)^2\)
- Mean of square - square of mean
- \(\bar {x^2} - (\bar x)^2\)
- SD = \(\sqrt{Variance}\)
- Why squared in the first place?
Variance for Grouped Data
??
- \(\displaystyle \sigma^2 = \frac1n{\sum_{i=1}^{n} f_i (x_i - \bar x)^2}\) or
- \[\displaystyle \sigma^2 = \frac{\sum f_i x_i^2}{n} - \left(\frac{\sum f_i x_i}{n}\right)^2\]
- Create just two columns for calculation (\(f_ix_i\) and \(f_ix_i^2)\)
Find Variance
| 10–20 |
15 |
5 |
|
|
| 20–30 |
25 |
8 |
|
|
| 30–40 |
35 |
7 |
|
|
Coefficients
- Coefficient of Range, \(CR=\frac{X_H-X_L}{X_H+X_L}\times 100\)
- Coefficient of Mean Deviation, \(CMD(\bar x) = \frac{MD(\bar x)}{\bar x}\)
- Coefficient of Quartile Deviation, \(QD = \frac{Q_3-Q_1}{Q_3+Q_1} \times 100\)
- Coefficient of Variance (Best), \(CV = \frac{\sigma}{\bar x} \times 100\)
Quartiles
Recall Median Formula
- When \(n\) is odd, \(Me = \frac{n+1}{2}th\) term
- When \(n\) is even, \(Me = \frac{\left(\frac{n}{2}\right)^{\text{th}} \text{ term} + \left(\frac{n}{2} + 1\right)^{\text{th}} \text{ term}}{2}\)
- Find median: 10, 20, 13
- What now: 10, 20, 13, 11
- Median score of 50 students is 75; what does it mean?
- Quartiles divide the data into 4 equal parts
Estimate CV
Given \(\sigma = 4, \bar x = 10\)
- \(CV = \frac{\sigma}{\bar x}\times 100\)
- \(\frac{4}{10}\times 100\)
Who is better?
Marks in 5 subjects out of 30
| A |
20 |
3.2 |
|
| B |
20 |
8.5 |
|
| C |
25 |
3.2 |
|
\(CV = \frac{\sigma}{\bar x}\times 100\)