Quartiles and Standard Deviation

Quartiles and Standard Deviation

Quartiles

  • Quartiles are values that divide a dataset into four equal parts, or quarters.
  • The first quartile (Q1), also known as the lower quartile, is the value below which 25% of the data falls.
  • The second quartile (Q2) is the median, below which 50% of the data falls. It also divides the lower and upper quartiles.
  • The third quartile (Q3), also known as the upper quartile, is the value below which 75% of the data falls.
  • Quartiles can help give a clearer picture of the distribution of a dataset, especially when you’re dealing with large datasets.
  • The interquartile range (IQR) is the range within which the middle 50% of a dataset falls, and is calculated as Q3 - Q1.
  • The IQR is a useful tool for spotting outliers. Data points that fall more than 1.5 times the IQR below Q1 or above Q3 are considered outliers.

Standard Deviation

  • The standard deviation is a measure of the amount of variation or dispersion within a set of values.
  • A low standard deviation means that the values tend to be close to the mean of the set, while a high standard deviation indicates that the values are spread out over a wider range.
  • To calculate the standard deviation, first calculate the mean of the dataset.
  • Then, subtract the mean from each number in the dataset and square the result. This gives you a list of ‘squared differences’.
  • The mean of these squared differences is known as the variance.
  • The standard deviation is the square root of the variance.
  • Understanding standard deviation can give valuable insight into how consistent or reliable a set of data is. It is used in a range of fields, including finance, engineering, and physics.
  • The symbol for standard deviation is σ (sigma) when referring to population standard deviation, and s when referring to sample standard deviation. Be familiar with both.