Mesastres of Cental Tendency
Measures of central tendency are statistical techniques used to find the center of distributions.
They are also known as statistical averages.
The three main measures of central tendency are:
Mean: The arithmetic average of all numbers.
Median: The middle number in a sequence of numbers.
Mode: The number that appears most frequently in a data set.
The number denoting the central tendency is the representative figure for the entire data set.
It is the point about which items have a tendency to cluster.
======
Mean
Mean is a type of average that equally weighs all values in a dataset.
For ungrouped data, the individual values are directly summed and divided by the count of the data points.
For grouped data, the midpoint of each group (x-bar) is multiplied by the frequency of that group, and the results are summed and divided by the total count.
Mean can be calculated using the direct method, where the sum of the values is divided by the count.
Mean can also be calculated using the indirect method, where the deviations from the mean are calculated, summed, and divided by the count, then multiplied by -1 and added to the mean’s sum.
======
Computing Mean from Ungrouped Data
The text compares the Direct and Indirect methods of calculating the mean.
The Direct Method involves adding all the observations and dividing by the number of observations.
The Indirect Method requires selecting a constant (assumed mean), subtracting it from each observation, and then calculating the mean of the deviations.
The formula for the Indirect Method is $\overline{X} = A + \frac{\sum d}{N}$.
In the given example, the Direct Method results in a mean of 926.29 mm, which is the same as the mean obtained using the Indirect Method (with 800 as the assumed mean).
======
Computing Mean from Grouped Data
The text presents the calculation of the mean for grouped data using both the direct and indirect methods.
The direct method involves multiplying each midpoint (X) by its frequency (f) and then dividing the sum of these products by N, the total number of data points.
The indirect method uses an assumed mean group with a midpoint (A) to calculate the deviation (d) of each midpoint from A. The mean is then calculated using the formula: mean = A ± ∑fd/N.
The text includes a table with data for 99 workers, which is used to demonstrate the calculation of the mean using both methods, resulting in a mean of 102.6.
The indirect method can be used with both equal and unequal class intervals.
The formula for the sum of fx, ∑fd, and the total number of data points, N, are also provided in the text.
======
Median
Median is a positional average.
It is defined as the point in a distribution with an equal number of cases on each side of it.
The Median is represented by the symbol M.
It does not require the data to be in a particular order for calculation.
Median is a better measure than mean when data is skewed.
======
Computing Median for Ungrouped Data
The median for ungrouped data is computed by finding the central observation in a arranged series.
The central value can be located from either end of the arranged series.
The formula used to compute the median is Value of $\left(\frac{\mathrm{N}+1}{2}\right)$ th item.
The series is arranged in ascending or descending order and the central value is determined.
For example, the median height of mountain peaks in parts of the Himalayas is 8,172 m.
======
Computing Median for Grouped Data
The text provides a detailed process for calculating the median of a set of data.
The steps include setting up a frequency table, calculating cumulative frequencies, determining the median number, identifying the median class, and finally calculating the median using the formula: M = l + i/f(m - c).
The median class is the class interval that contains the median number.
The median is calculated to be 82.5 in this example.
The formula for calculating the median is: M = N/2, where N is the number of values in the data set.
======
Mode
Mode is a measure of central tendency, represented as Z or M0.
It is the most frequently occurring value in a distribution.
Mode is less commonly used than mean and median.
There can be more than one mode in a data set, known as multimodal distribution.
Mode is not affected by extreme values or outliers.
======