
Three Meanings
- Plural of statistic
- Table of data
- Methodology
Takes a sample from a population.

There are many sampling techniques.

- Identification of unwanted spam messages in e-mail
- Segmentation of customer behavior for targeted advertising
- Forecasts of weather behavior and long-term climate changes
- Prediction of popular election outcomes
- Development of algorithms for auto-piloting drones and self-driving cars
- Optimization of energy use in homes and office buildings
- Projection of areas where criminal activity is most likely
- Discovery of genetic sequences linked to diseases
THANK YOU!
- Definition
- Population and sample
- Variable and its types
- Scale of measurement
- Use of summation sign
- Summation theorems
- Mathematical Problems
Coxton and Crowden
Statistics may be defined as the science of collection, presentation, analysis and interpretation of numerical data.
Population: A set of similar items or events which is of interest Sample: Any subset of population

Examples
Group/collection of –
- students in a classroom
- stars in the sky
- possible outcomes when flipping a coin repeatedly
- books in a library
- water molecules in the ocean
- employees in a company
- \(\pi\)
- Name of students
- Result of a die throw
- Income of a regular employee
- Income of a freelancer
- Any unchanging number, e.g, \(\pi\)
- Result of a die throw
- Father’s name
- Mark of a subject
- GPA of a student
- Qualitative \(\rightarrow\) numeric
- Quantitative \(\rightarrow\) non-numeric
- Discrete: Limited and pre-specified
- Continuous: Can take on any values between any two given number
Describes nature of information within the values.
- Nominal: Name of Insignificant number, e.g., color, Street no.,
- Ordinal: Order matters, e.g., rating
- Interval: Zero may not be zero, like temperature, IQ
- Ratio: Zero is 0; most variables fall in this category
Match as per suitable scale
| Movie Rating | Scale |
|---|---|
| Poor, bad, good, excellent | ratio |
| In a scale of -10 to 10: -10, -2, 0, 5, 10 | interval |
| Awesome, Amazing, Mind-blowing, Stunning | nominal |
| In a scale of 0 to 10: 0, 5, 8, 10 | ordinal |
Multiplication/division cannot be carried out
- Temperature (Celsius scale)
- Dates (AD)
- Location in Cartesian coordinates
- Direction measured in degrees
For more see here and Interval Scale discussion on Stat Mania

Say we have values, \(x_1, x_2, \cdot \cdot \cdot , x_n\)
- Origin shift: Adding/Subtracting
- \(y_1 = x_1-a \space or \space x_1+a\)
- Scale shift: Multiplying/Division
- \(y_1 = b \cdot x_1 \space or \space x_1/b\)
- both: \(y_i = \frac{x_i-a}{b}\)
- Why might we need it?
- \[x_1 + x_2 + x_3 + x_4 = \sum_{i=1}^4 x_i\]
- \[x_1 + x_2 + ... x_n = \sum_{i=1}^n x_i\]
- \[x_1 + x_2 + ... x_{10} = ?\]
\[\sum_{i=1}^n bx_i=b \sum_{i=1}^n x_i\]
\[\sum_{i=1}^n (ax_i-b)=a \sum_{i=1}^n x_i-nb\]
- \(\sum_{i=1}^n a = na\)
- Can you prove it?
\[\sum_{i=1}^n (ax_i^2-bx_i+c)=a\sum_{i=1}^n x_i^2-b\sum_{i=1}^n x_i + nc\]
\[\sum_{i=1}^n (ax_i-by_i)=a\sum_{i=1}^n x_i - b \sum_{i=1}^n y_i\]
\[\sum_{i=1}^n (ax_i-b)^2=a^2 \sum_{i=1}^n x_i^2 - 2ab \sum_{i=1}^n x_i + nb^2\]
\[(\sum_{i=1}^n x_i)^2=\sum_{i=1}^n x_i^2 + \sum_{i \ne j}^n\sum x_ix_j\]
\[\prod_{i=1}^k x_i = x_1 \times x_2 \times \cdot \cdot \cdot \times x_n\]
\[\prod_{i=1}^k x_iy_i = (\prod_{i=1}^k x_i)(\prod_{i=1}^k y_i)\]
Expand \(\displaystyle \sum_{i=1}^3 \sum_{j=1}^4 x_iy_j\)
| X | 20 | 25 | 15 |
|---|---|---|---|
| Y | 15 | 30 | 20 |
- \(20 \times 15 + 20 \times 30 + 20 \times 20 \rightarrow x_1 \times y_j\)
- \(25 \times 15 + 25 \times 30 + 25 \times 20 \rightarrow x_2 \times y_j\)
- \(15 \times 15 + 15 \times 30 + 15 \times 20 \rightarrow x_3 \times y_j\)
- Row to row \(\rightarrow x\) varies but \(y_j\) is constant
A company sells 3 products (P1, P2, P3) in 4 cities (C1, C2, C3, C4). The monthly sales (in thousands of dollars) are:
| P1 | P2 | P3 | |
|---|---|---|---|
| C1 | 5 | 8 | 6 |
| C2 | 7 | 4 | 9 |
| C3 | 3 | 5 | 2 |
| C4 | 6 | 7 | 4 |
\(\displaystyle \sum_{i=1}^m \sum_{j=1}^n (x_i+y_j)=n\sum_{i=1}^m x_i + m \sum_{i=1}^n y_j\)
- \(\displaystyle \sum_{i=1}^m (x_i+y_1+x_i+y_2+\cdots+x_i+y_n)\)
- \(\displaystyle \sum_{i=1}^m \{(x_i+x_i+\cdots \text{up to n})+(y_1+y_2+\cdots+y_n)\)
- \(\displaystyle \sum_{i=1}^m(nx_i+\sum_{j=1}^ny_j)\)
- \(\displaystyle (nx_1+\sum_{j=1}^ny_j+nx_2+ \sum_{j=1}^ny_j)+\cdots+nx_m+\sum_{j=1}^ny_j))\)
- \(\displaystyle n\sum_{i=1}^m x_i+m\sum_{j=1}^ny_j\)
\(\displaystyle \sum_{i=1}^m \sum_{i=1}^n x_iy_j=(\sum_{i=1}^n x_i) (\sum_{i=1}^n y_j)\)
- \(\displaystyle \sum_{i=1}^m (x_iy_1+x_iy_2+\cdots+x_iy_n)\)
- \(\displaystyle \sum_{i=1}^m x_i(y_1+y_2+\cdots+y_n)\)
- \(\displaystyle \sum_{i=1}^m x_i \sum_{i=1}^n y_i\)
Given
\(f_1=2, f_2 = 4, f_3 = 6\)
\(x_1 = -3, x_2 =7, x_3 = 4\)
Find the values of
where \(\sum_{i=1}^{10} x_i = 20\)
- Discrete vs continuous variable
- Prove \[\sum_{i=1}^k abx_i = ab \sum_{i=1}^k x_i\]
Prove \[\prod_{i=1}^n c =c^n\]
\[\sum_{i=1}^{10} (x_i-4)\] where \[\sum_{i=1}^{10}=20\]
\(\displaystyle \sum_{i=1}^{10} x_i = 20, \sum_{i=1}^{10} x_i^2 = 400\)
Find the value of \(\displaystyle \sum_{i=1}^{10} (x_i^2+5x_i+10)\)
| X | 20 | 25 |
|---|---|---|
| Y | 15 | 30 |
| Income (x) | 120 | 130 | 88 | 150 | 175 | 144 | 180 | 200 | 160 | 155 |
|---|---|---|---|---|---|---|---|---|---|---|
| Expense (y) | 80 | 120 | 70 | 100 | 160 | 114 | 170 | 195 | 140 | 131 |
Prove
- \(\displaystyle \sum_{i=1}^{10} \sum_{j=1}^{10}x_iy_j=(\sum_{i=1}^{10}x_i)(\sum_{j=1}^{10}y_j)\)$
- \(\displaystyle \sum_{i=1}^{10} \sum_{j=1}^{10}(x_i-y_j)=10 \times \sum_{i=1}^{10}x_i- 10 \times \sum_{j=1}^{10}y_j\)
- \(\displaystyle \sum_{i=1}^{10}x_iy_i \ne (\sum_{i=1}^{10}x_i)(\sum_{i=1}^{10}y_j)\)

www.statmania.info | Abdullah Al Mahmud | Press space or arrow to change slides