QM Test 2

    Cards (55)

    • What is correlation?
      A statistical relationship between two variables which may or may not be causal
    • What is a positive correlation?
      Both variables increase
    • What is a negative correlation?
      Both variables decrease while the other increases
    • What is a co-vary?
      When one variable increases or decreases as another variable increases or decreases
    • What is covariance?
      Similar to the concept of variances, but is the product of deviation from the mean of two variables
    • What is the correlation coefficient?
      The standard measure of the relationship between two variables
    • What is the Pearson correlation coefficient? 

      Correlation coefficient (r) given by the description above, most common ranges for +1, perfect positive relationship to -1, perfect negative relationship points - falls on a line.
    • What other definition does the PCC have?
      This is a linear estimator of the direction and strength of the relationship.
    • What is the Spearman Rank Correlation (rho, p,rs)?
      Nonparmentic correlation based on ranks of values, used when nominates can be assumed or with ordinal data.
    • What is the assumption of Pearson Correlation Coefficient?
      Continuous data 
    • Pearson Correlation Coefficient is an estimator for what?
      Linear
    • Significance test for correlation is what?
      t-distribution 
    • A non-parametric correlation based on ranks of values, used when normality can’t be assumed, or with ordinal data, is know as
      Spearman Rank Correlation
    • Strength levels of correlations - >0.8
      very strong
    • Strength levels of correlations - 0.6-0.8
      Strong
    • Strength levels of correlations - 0.4-0.6
      Moderate
    • Strength levels of correlations - 0.2-0.4
      Weak
    • Strength levels of correlations - <0.2
      Very weak
    • Regression examines the relationship between a (Blank) variable, and (Blank) Variable(s)

      Dependent & Independent
    • Ordinary Least Squares (OLS) that makes it "BLUE" is broken down into what
      1. Best
      2. Linear
      3. Unbiased
      4. Estimator
    • In OLS "BLUE" The B stands for what
      B = it is the most efficient, regression line has the least sample to sample error variation of any estimator
    • In OLS "BLUE" The L stands for what
      L = OLS estimates a straight line
    • In OLS "BLUE" The U stands for what
      U = The mean of n sample estimates is equal to the true population parameter values
    • In OLS "BLUE" The E stands for what
      E = It estimates populations values of Y given sample data for Y and X
    • What is the formula for linear regression
      ŷ = α + βx
    • What do the different values in the linear regression formula stand for? ŷ = α + βx
      • ŷ = the predicted values of y
      • α = line intercept, the value of y when x is zero
      • β = the slope of the line (B = Change in y/Change in x)
      • x = is the observed value of x
    • True or false - In linear regression assumptions, the residuals are dependent which means the value of one error is affected by the value of another.
      False the residuals are INDEPENDENT
    • Residuals are the difference between the (Blank) values of y and the (Blank) values of y.

      Predicted & Observed
    • Regression lines are not designed to exactly (Blank) every observed value of the (Blank) variable.

      Predict & Dependent
    • True or false - Normality can be examined with a histogram and/or Shapiro-Wilk test.
      True
    • Root Mean Square Error is the (Blank Blank)of the error term, and is the square root of the Mean Square Residual (or Error)
      Standard deviation
    • What is linear regression?

      Fits a straight line through the data to describe the effect of x on y and how well it predicts x and y
    • Can the linear regression equation accommodate a bivariate regression
      yes - one dependent and one independent variable
    • What is an error in terms of linear regression
      prediction of y has error associated with every observed value of y - i.e difference between the predicted and observed value for every observation
    • What is the difference between residuals vs fitted values plot
      shows the residuals of each observation on the y-axis against the predicted values of the dependent variable for each observation on the x-axis
    • What are the degrees of freedom
      1. total = # of observation -1
      2. Model = # of predictions -1, the intercept counts
      3. Residual = total df - model df
    • What is the coefficient of determination (r2 or R squared) 

      The proportion of variation in the dependent variable explained by the regression model
    • What is collinearity?
      Correlation between independent variables such that they express a linear relationship in a regression model
    • What are added assumptions in multicollinearity?
      No multicollinearity, multiple regression assumption that the independent variable are not highly corrected with each other.
    • What is a dummy variable? 

      Binary variable (0/1) that are created to represent nominal categories in a regression model
    See similar decks