2.5.4 Hypothesis testing for correlation

Cards (73)

  • What does Pearson's correlation coefficient measure?
    Linear relationship strength
  • If the absolute value of Pearson's correlation coefficient exceeds the critical value, we reject the null hypothesis.
  • If the p-value is less than the significance level, we reject the null hypothesis
  • What does the null hypothesis in correlation testing assume?
    No correlation exists
  • Match the type of alternative hypothesis with its mathematical expression:
    Two-sided ↔️ ρ0\rho \neq 0
    One-sided (Positive) ↔️ ρ>0\rho > 0
    One-sided (Negative) ↔️ ρ<0\rho < 0
  • What does a correlation coefficient of r=r =1 1 indicate?

    Perfect positive correlation
  • The formula for Pearson's correlation coefficient involves summing the product of (x_{i} - \bar{x})(y_{i} - \bar{y})</latex>.
  • The significance level is the probability of rejecting the null hypothesis when it is actually true
  • What are two common significance levels used in hypothesis testing?
    0.05 and 0.01
  • Steps to calculate Pearson's correlation coefficient
    1️⃣ Find the means xˉ\bar{x} and yˉ\bar{y}
    2️⃣ Calculate (xixˉ)(x_{i} - \bar{x}) and (yiyˉ)(y_{i} - \bar{y})
    3️⃣ Compute (xixˉ)(yiyˉ)(x_{i} - \bar{x})(y_{i} - \bar{y})
    4️⃣ Sum (xixˉ)(yiyˉ)(x_{i} - \bar{x})(y_{i} - \bar{y})
    5️⃣ Calculate (xixˉ)2(x_{i} - \bar{x})^{2} and (yiyˉ)2(y_{i} - \bar{y})^{2}
    6️⃣ Sum (xixˉ)2(x_{i} - \bar{x})^{2} and (yiyˉ)2(y_{i} - \bar{y})^{2}
    7️⃣ Calculate the square root of the product
    8️⃣ Divide the sum of (xixˉ)(yiyˉ)(x_{i} - \bar{x})(y_{i} - \bar{y}) by the square root of the product
  • What does α=\alpha =0.05 0.05 represent in hypothesis testing?

    5% significance level
  • If the p-value is less than α\alpha, we reject
  • If the p-value is greater than or equal to α\alpha, we fail to reject the null hypothesis.
  • What is summarized in the table regarding significance levels, p-value ranges, and decision criteria?
    Hypothesis testing for correlation
  • Match the significance level with the corresponding p-value range to reject the null hypothesis:
    0.05 (5%) ↔️ p<0.05p < 0.05
    0.01 (1%) ↔️ p<0.01p < 0.01
  • What does hypothesis testing for correlation assess?
    Linear relationship between variables
  • The null hypothesis in correlation testing states there is no correlation between variables.
  • To test correlation, we use the correlation coefficient r
  • What is the mathematical notation for the null hypothesis in correlation testing?
    ρ=\rho =0 0
  • The alternative hypothesis in correlation testing states that a correlation exists
  • If r=r =0.75 0.75, exceeds a critical value of 0.60.6, and the p-value is less than 0.050.05, we reject the null hypothesis.
  • What does the null hypothesis assume in correlation testing?
    No correlation exists
  • A one-sided alternative hypothesis can posit either a positive or negative correlation
  • Match the type of hypothesis with its corresponding statement:
    Two-sided ↔️ ρ0\rho \neq 0
    One-sided (Positive) ↔️ ρ>0\rho > 0
    One-sided (Negative) ↔️ ρ<0\rho < 0
  • What does Pearson's correlation coefficient (r) measure?
    Strength and direction of linear relationship
  • A value of r=r =1 1 indicates a perfect positive correlation
  • A value of r=r =0 0 means there is no linear correlation between variables.
  • Steps to calculate Pearson's correlation coefficient (r)
    1️⃣ Find the means xˉ\bar{x} and yˉ\bar{y}
    2️⃣ Calculate (xixˉ)(x_{i} - \bar{x}) and (yiyˉ)(y_{i} - \bar{y})
    3️⃣ Compute (xixˉ)(yiyˉ)(x_{i} - \bar{x})(y_{i} - \bar{y})
    4️⃣ Sum (xixˉ)(yiyˉ)(x_{i} - \bar{x})(y_{i} - \bar{y})
    5️⃣ Calculate (xixˉ)2(x_{i} - \bar{x})^{2} and (yiyˉ)2(y_{i} - \bar{y})^{2}
    6️⃣ Sum (xixˉ)2(x_{i} - \bar{x})^{2} and (yiyˉ)2(y_{i} - \bar{y})^{2}
    7️⃣ Calculate (xixˉ)2(yiyˉ)2\sqrt{\sum (x_{i} - \bar{x})^{2} \sum (y_{i} - \bar{y})^{2}}
    8️⃣ Compute r=r =(xixˉ)(yiyˉ)(xixˉ)2(yiyˉ)2 \frac{\sum (x_{i} - \bar{x})(y_{i} - \bar{y})}{\sqrt{\sum (x_{i} - \bar{x})^{2} \sum (y_{i} - \bar{y})^{2}}}
  • What is the significance level (α\alpha)?

    Probability of Type I error
  • If the p-value is less than α\alpha, we reject
  • The t-distribution is used for small sample sizes.
  • How are the degrees of freedom (dfdf) calculated in the t-distribution?

    df=df =n1 n - 1
  • Match the significance level with its corresponding critical values for df=df =20 20:

    0.05 ↔️ ±2.086\pm 2.086
    0.01 ↔️ ±2.845\pm 2.845
  • What does the test statistic (t-value) measure?
    Deviation from null hypothesis
  • What does the test statistic (t-value) measure?
    Deviation from null hypothesis
  • The formula for calculating the t-value is t = \frac{r \sqrt{n - 2}}{\sqrt{1 - r^{2}}}
  • What does rr represent in the t-value formula?

    Pearson's correlation coefficient
  • The variable nn in the t-value formula represents the sample size
  • The calculated t-value in the example is 5.07
  • What is the purpose of hypothesis testing for correlation?
    Assess linear relationship