Statistics

Understanding Statistics

  • Statistics is the branch of mathematics dealing with collection, analysis, interpretation, presentation, and organisation of data.
  • An important aspect is descriptive statistics, which summarise and organise data sets intuitively. Concepts include mean, median, mode, range, and standard deviation.
  • Inferential statistics make predictions or inferences about a population based on a sample of data. Key areas involve hypothesis testing, regression analysis, and correlation.

Data Types

  • Primary data is collected first-hand for a specific research purpose, while secondary data is already existing data used for a new research purpose.
  • Qualitative data provides insightful information but cannot be measured numerically, such as opinions or descriptions.
  • Quantitative data can be counted, measured, and expressed using numbers. Subtypes include discrete data (countable values) and continuous data (measurable quantities).

Measures of Central Tendency

  • The mean is the sum of the data divided by the number of data points. Typically, it’s the first go-to measure of central tendency.
  • The median is the middle value when arranging data in ascending order. It effectively splits the data set into two halves.
  • The mode is the value that appears most frequently in a data set.

Measures of Dispersion

  • The range is the difference between the highest and the lowest values in a dataset.
  • Variance shows how individual data points differ from the mean. The square root of variance is the standard deviation, a widely used measure of variability or diversity.

Correlation and Regression

  • Correlation measures the relationship between two variables, i.e., how they change together. It’s quantified by the correlation coefficient, which ranges from -1 to +1.
  • Regression analysis predicts the relationship between two or more variables.

Probability

  • Probability is a measure of how likely an event is to occur out of the number of possible outcomes.
  • Key concepts include independent events, dependent events, mutually exclusive events and exhaustive events.

Hypothesis Testing

  • This is a statistical method that uses sample data to evaluate two mutually exclusive statements about a population and makes a decision about whether to reject or fail to reject the null hypothesis.

Sampling and Survey Methods

  • Techniques include random sampling, systematic sampling, stratified sampling, cluster sampling, convenience sampling, etc.
  • Proper sampling methods ensure a more accurate representation of the population and help avoid bias.