lecture 3 depression/BD

Cards (24)

  • Level of Measurement (LOM) determines the type of statistics you can use:
    • Nominal: attributes are named
    • Ordinal: attributes can be ordered
    • Interval: distance is meaningful
    • Ratio: absolute zero
  • LOM and parametric vs. nonparametric:
    • Categorical and ordinal dependent variable -> non-parametric
    • Interval and ratio dependent variable -> more complicated
    • If the distribution is normal -> parametric
    • If the distribution is non-normal -> non-parametric
  • Parametric statistics:
    • Estimate parameters of a population based on the normal distribution
    • Univariate: mean, standard deviation, skewness, kurtosis
    • Bivariate: correlation, linear regression, t-tests
    • Multivariate: multiple linear regression, ANOVAs
    • More powerful, more assumptions, vulnerable to violations
  • Non-parametric statistics:
    • Do not assume sampling from a population that is normally distributed
    • Univariate: median, frequencies
    • Bivariate: Spearman’s correlation, Chi square test
    • Less powerful, fewer assumptions, less vulnerable to assumption violations
  • How many variables are you working on?
    • One -> Univariate (mean, median, mode, histogram, bar chart)
    • Two -> Bivariate (correlation, t-test, scatterplot, clustered bar chart)
    • More than two -> Multivariate (reliability analyses, factor analysis, multiple linear regression)
  • Central Tendency:
    • Mode: most frequent
    • Median: 50th percentile
    • Mean: average
    • Use depends on data type and distribution shape
  • Mode (Mo):
    • Most common score, not affected by outliers
    • Suitable for all data levels
  • Frequencies (f) and percentages (%):
    • Number of responses in each category
    • Percentage of responses in each category
    • Can be visualized with a bar or pie chart
  • Median (Mdn):
    • Mid-point of the distribution, not badly affected by outliers
    • Might not represent central tendency if data is skewed
  • Mean:
    • Average score, sensitive to outliers
    • Used for normally distributed ratio or interval data
  • Standard deviation (SD):
    • Square root of the variance
    • Calculated with Σ √ ( (x - x̄)2/ (N- 1) )
    • Used for normally distributed interval or ratio data
  • Options for nominal LOM:
    • Describe most frequent, least frequent, frequencies, percentages, cumulative percentages, ratios
  • Descriptives for ordinal data:
    • Similar to nominal, can use percentiles
  • Descriptives for interval data:
    • Central tendency, shape/spread, treated as continuous
  • Descriptives for ratio data:
    • Same as interval data, can use ratios
  • Summary of descriptive statistics:
    • Level of measurement and normality determine parametric treatment
    • Describe central tendency and distribution using various measures
  • Central tendency can be described using frequencies, percentages, mode, median, and mean
  • Distribution can be described using minimum and maximum values, range, quartiles, standard deviation, and variance
  • Skewness is a measure of the lean of the distribution, with a tail to the right indicating positive skew and to the left indicating negative skew
  • Kurtosis indicates how flat or peaked the data distribution is, with positive kurtosis for peaked data and negative kurtosis for flat data
  • In a normal distribution, mean = median = mode; positive skew means mode < median < mean, and negative skew means mean < median < mode
  • Non-normal distributions can be described using non-parametric descriptive statistics like quartiles, percentiles, and interquartile ratio
  • Transformations can be used to convert data into a normal distribution for more powerful tests, but this may complicate interpretation
  • Graphical techniques like bar charts, pie charts, histograms, stem and leaf plots, and box plots can be used to represent data visually