week 4

Cards (67)

  • Covariation
    • Correlation underlies many different statistical analyses we use in this unit, e.g. internal consistency, regression, factor analysis
  • Total error = sum of deviance =  (xi - X̄)
  • Sum of squared errors (SS) =  (xi - X̄) (xi - X̄)
  • Variance (s2) = ss/(N-1) = ( (xi- X ̄)2)/(N-1)
  • Standard deviation (s) = √ ((xi- X ̄2)/(N-1)
  • Linear correlation
    • To what extent do two variables have a linear relationship?
    • A simple/straight-line/constant relationship
    • To interpret a linear correlation, look at the: Direction, Strength, Statistical significance
  • Types of correlation
    • No relationship (r is close to 0)
    • Linear relationship
    • Non-linear relationship
  • How do you choose which type of correlation to use?
    It depends on the levels of measurement of your variables
  • Nominal by nominal
    • Contingency (crosstabs) table of observed and expected frequencies, Row and/or column percentages, Marginal totals, Clustered bar chart, Chi-square, Phi (Φ) or Cramer’s V
  • Chi-square
    To calculate: χ2 = Σ((O-E) ^2/E), χ2 = the test statistic that approaches a χ2 distribution, O = frequencies observed, E = frequencies expected (by the null hypothesis), χ2 = sum of: (observed – expected) ^2 / expected, Expected counts are the cell frequencies that should occur if the variables are not correlated, Chi-square is based on the squared differences between the actual and expected cell counts
  • Effect size: χ2(1) = 292, p < .001, Φ = .18, 2x2 -> use the Phi effect size
  • PHI (Φ) & CRAMER’S V

    These are non-parametric measures of correlation, Phi is used for 2x2, 2x3, or 3x2 analyses, Cramer’s V is used for 3x3 and greater analyses
  • Write up example: 'A Chi-squared analysis revealed a significant, small association (χ2(1) = 292, p < .001, Φ = .18). Returning to the cross-tabulation results, women were more likely to say it is true that the father’s gene determines the sex of a baby, while men are more likely to say this is false'
  • Ordinal by ordinal

    • Spearman’s rho (rs), Kendall tau (τ)
  • Graphing: Ordinal by ordinal data is difficult to visualize – it is non-parametric, but it can have many points. You can use non-parametric approaches (e.g. clustered bar chart), Parametric approaches (e.g. scatterplot with line of best fit)
  • Spearman’s rho
    Spearman’s rho is also called Spearman’s rank order correlation. Use this when your data is ranked (ordinal) LOM. The formula is the same as the Pearson’s product-moment correlation. Be careful with your interpretation so you are considering the underlying ranked scales
  • Kendall’s tau
    Three versions: Tau a – does not take joint ranks into account (e.g. tied for 2nd place), Tau b – takes joint ranks into account, works for square tables (i.e. 2x2, 3x3, 4x4...), Tau c – takes joint ranks into account, works for rectangular tables (2x3, 4x2, etc.)
  • Response options
    • Very unsure, Fairly unsure, Fairly sure, Very sure
  • Clustered bar chart
    Have to first change the format of one variable to categorical in jamovi
  • Scatterplot
    For this option, both variables in jamovi should be at least ordinal
  • Interpretation: There is a significant effect where people who are very confident in their first answer also tend to be very confident in their second answer. τb = .32, p < .001
  • Dichotomous by interval/Ratio
    • Correlational approaches: You have one dichotomous and one interval/ratio variable
  • There is a significant effect where people who are very confident in their first answer also tend to be very confident in their second answer
  • Antibiotics kill viruses as well as bacteria
  • Interpretation of positive relationship
    • The higher one scores on the interval measure, the more likely they are to choose 'false'
  • Interpretation of negative relationship
    • The higher one scores on the interval measure, the more likely they are to choose 'true'
  • There was a small, significant relationship
    rpb = .12, p < .001
  • Those who believe science is more credible
    Are more likely to report that antibiotics do NOT kill viruses as well as bacteria
  • Those who are more sceptical about science
    Are more likely to say it is true that antibiotics kill viruses as well as bacteria
  • Correlation is a measure of the standardized covariance
  • The relationship between X and Y is the covariance of X and Y divided by the product of the standard deviations of X and Y
  • The covariance between two variables is the variance that they share
  • Covariance reflects the direction of the relationship (+/-)
  • Covariance is unstandardized – depends on two variables having the same scale of measurement
  • When you standardize the covariance (by dividing it by the cross-product of the SDs), you get the correlation
  • Correlation is an effect size – a standardized measure of the strength of the relationship
  • It’s rare to get correlations of 0 (no relationship at all)
  • How do we decide if we have a ‘real’ relationship between our variables?
  • What is the likelihood that an observed relationship is due to chance?
  • What is the likelihood that it reflects a true relationship?