lecture 3 depression/BD

Created by

Aiman Ali

Cards (24)

Level of Measurement (LOM) determines the type of statistics you can use:
Nominal: attributes are named
Ordinal: attributes can be ordered
Interval: distance is meaningful
Ratio: absolute zero
View source
LOM and parametric vs. nonparametric:
Categorical and ordinal dependent variable -> non-parametric
Interval and ratio dependent variable -> more complicated
If the distribution is normal -> parametric
If the distribution is non-normal -> non-parametric
View source
Parametric statistics:
Estimate parameters of a population based on the normal distribution
Univariate: mean, standard deviation, skewness, kurtosis
Bivariate: correlation, linear regression, t-tests
Multivariate: multiple linear regression, ANOVAs
More powerful, more assumptions, vulnerable to violations
View source
Non-parametric statistics:
Do not assume sampling from a population that is normally distributed
Univariate: median, frequencies
Bivariate: Spearman’s correlation, Chi square test
Less powerful, fewer assumptions, less vulnerable to assumption violations
View source
How many variables are you working on?
One -> Univariate (mean, median, mode, histogram, bar chart)
Two -> Bivariate (correlation, t-test, scatterplot, clustered bar chart)
More than two -> Multivariate (reliability analyses, factor analysis, multiple linear regression)
View source
Central Tendency:
Mode: most frequent
Median: 50th percentile
Mean: average
Use depends on data type and distribution shape
View source
Mode (Mo):
Most common score, not affected by outliers
Suitable for all data levels
View source
Frequencies (f) and percentages (%):
Number of responses in each category
Percentage of responses in each category
Can be visualized with a bar or pie chart
View source
Median (Mdn):
Mid-point of the distribution, not badly affected by outliers
Might not represent central tendency if data is skewed
View source
Mean:
Average score, sensitive to outliers
Used for normally distributed ratio or interval data
View source
Standard deviation (SD):
Square root of the variance
Calculated with Σ √ ( (x - x̄)2/ (N- 1) )
Used for normally distributed interval or ratio data
View source
Options for nominal LOM:
Describe most frequent, least frequent, frequencies, percentages, cumulative percentages, ratios
View source
Descriptives for ordinal data:
Similar to nominal, can use percentiles
View source
Descriptives for interval data:
Central tendency, shape/spread, treated as continuous
View source
Descriptives for ratio data:
Same as interval data, can use ratios
View source
Summary of descriptive statistics:
Level of measurement and normality determine parametric treatment
Describe central tendency and distribution using various measures
View source
Central tendency can be described using frequencies, percentages, mode, median, and mean
View source
Distribution can be described using minimum and maximum values, range, quartiles, standard deviation, and variance
View source
Skewness is a measure of the lean of the distribution, with a tail to the right indicating positive skew and to the left indicating negative skew
View source
Kurtosis indicates how flat or peaked the data distribution is, with positive kurtosis for peaked data and negative kurtosis for flat data
View source
In a normal distribution, mean = median = mode; positive skew means mode < median < mean, and negative skew means mean < median < mode
View source
Non-normal distributions can be described using non-parametric descriptive statistics like quartiles, percentiles, and interquartile ratio
View source
Transformations can be used to convert data into a normal distribution for more powerful tests, but this may complicate interpretation
View source
Graphical techniques like bar charts, pie charts, histograms, stem and leaf plots, and box plots can be used to represent data visually
View source