Spearman's rank correlation coefficient
Spearman’s Rank Correlation Coefficient
Introduction
- Spearman’s Rank Correlation Coefficient, often symbolised as ρ, is a non-parametric measure of statistical gauging the strength and direction of monotonic relationships between paired data.
- This measure is named after the British psychologist Charles Spearman.
- Unlike Pearson’s correlation, Spearman’s is used when the relationship between the variables is non-linear and the variables measured are ordinal.
Definitions and Properties
- Spearman’s rank correlation coefficient ρ is calculated using this formula: ρ = 1- [(6∑d_i^2) / (n(n^2 - 1))].
- In this formula, d represents the differences between the ranks of two variables, and n is the number of observations.
- The result will always fall between -1 and 1, inclusive.
- A positive ρ indicates a positive relationship between the variables; as one increases, the other also increases.
- A negative ρ indicates a negative relationship between the variables; as one increases, the other decreases.
- The absolute value of ρ indicates the strength of the relationship, with values closer to 1 or -1 indicating a stronger correlation.
Assumptions
- Spearman’s Rank Correlation Coefficient can be applied to ordinal variables and does not require the assumption of normality in the dataset.
- It is used when we cannot assure the relationship between variables is linear and variables may not be normally distributed.
- An important assumption is that there are no tied ranks.
Applications
- This coefficient is commonly used in fields such as psychology, education, and social sciences to analyse human behaviour.
- It’s most often used when determining the correlation between variables that are non-linearly related or if data has outliers that could skew results of a Pearson’s correlation.
- It can be central to hypothesis testing or to establish the validity of experimental results.
Practical Example
- For example, one might use Spearman’s rank correlation coefficient to test the relationship between people’s subjective rankings of a public speaker (from 1st place to 100th place) and the number of pauses the speaker made during their speech. Regardless of whether the relationship is linear or not, a ρ close to 1 might suggest that the number of pause a speaker made has a strong positive influence on viewers’ perception of them.