Comparing Data Sets

Comparing Data Sets

Understanding Data Sets

  • Data sets are groups of data gathered to help in finding an answer or hypothesis.
  • Identifying the type of data is critical. Data can be quantitative (numerical) or qualitative (descriptive)
  • Quantitative data can be discrete (countable) or continuous (measurable).
  • Qualitative data may be nominal (categories) or ordinal (ordered categories).

Comparing and Describing Data Sets

  • Look for patterns, trends, or peculiarities in the data that could be of importance.
  • When comparing data sets, consider the mean, median and mode to describe central values.
  • Understand the range, which shows the spread of the values in a data set.
  • The interquartile range can be used to describe dispersion in the data. It measures the spread of the middle 50% of values in the ordered data set.
  • In comparative analysis, data sets can also be evaluated by their variance and standard deviation - measures of how spread out the numbers are from the mean.

Graphical Representation of Data

  • Create bar charts, histograms or pie charts, which can visually compare data sets in a meaningful way.
  • A box-and-whisker plot shows the median, quartiles, and possible outliers in the data set, which aids in comparison.
  • A scatter plot may reveal a correlation or trend between two data sets.

Interpreting the Comparison of Data Sets

  • Data sets are not always fully representative or entirely unbiased, which should be remembered when drawing conclusions.
  • Identify outliers, as they can heavily influence mean values and may not be typical of the data as a whole.
  • Remember, correlation between data sets does not necessarily imply causation. Many other factors could influence the results.
  • Data manipulation, such as standardising, can eliminate unit differences and make data sets of different scales comparable.