Measures of central tendency and dispersion

    Cards (11)

    • Why do we use measures of central tendency of dispersion?
      Raw data can be overwhelming -> lots of numbers without context
    • What is the solution to overwhelming raw data?
      Descriptive statistics -> summarising raw qualitative data
    • What is included in descriptive statistics?
      • Measures of central tendency
      • Measures of dispersion
      • Graphs
    • What are measures of central tendency?
      What we tend to see towards the centre of a data set ('typical' value)
      • Mean
      • Median
      • Mode
    • What are measures of dispersion?
      Single values summarising the spread of a data set + the variation between scores
      • Range
      • Standard deviation
    • What is the mode?
      The most frequent score in a data set
      • 2 different modes = bimodal (2+ = multimodal)
      • Used for nominal level data
      Pros:
      • Not distorted by outliers
      • Only way of giving average in data categories
      Cons:
      • Small data sets = multiple/no modes = no average value
      • Doesn't include all values in calculation
    • What is the median?
      The middle value in a data set ordered from lowest to highest
      • Used for interval level data without outliers + ordinal level data
      Pros:
      • Not distorted by outliers
      • Easy to calculate
      Cons:
      • Doesn't include all values in calculation
      • Even numbers of data sets = median value doesn't exist in raw data
    • What is the mean?
      The mathematical average of a data set
      • Used for interval level data (highly objective data)
      • NOT used with ordinal level data -> too subjective
      Pros:
      • Most sensitive method of summarising data
      Cons:
      • Too sensitive for data with outliers -> easily skewed
    • What is the range?
      The difference between a data set's highest + lowest values
      • Larger range = larger spread of data
      Pros:
      • Easy to calculate
      Cons:
      • Extreme scores distort value
      • Range doesn't show whether scores cluster around the mean or are evenly spread out
    • What is standard deviation?
      A complex calculation using all data points to produce a single value
      • Used when a more specific measure is needed
      • Shows the average distance between each score + the mean -> 1sd = 1 percentile away from the mean
      Pros:
      • Considers all values in its calculation = sensitive
      • Provides info about spread of scores
      Cons:
      • Extreme scores distort sd
      • Difficult to calculate
    • Why do we use descriptive statistics?
      They tell us about key aspects of data + allow us to see patterns that we can make inferences/conclusions about the data's meaning from
    See similar decks