Qs D

Cards (23)

  • Correlation coefficient
    A superior metric than covariance to measure the linear relationship, as it is not affected by the magnitude of values in data
  • Probability distributions
    • Weibull
    • Log-normal
    • Logistic
    • Normal
  • Calculating probability of not having congestion at a gate
    1. Calculate λ (arrival rate)
    2. Use exponential distribution to calculate P(x>8)
  • Calculating probability of conducting 20 experiments before identifying no pollutants 18 times
    Use negative binomial distribution
  • Increase in advertising by a firm
    Demand curve shifts right, increasing equilibrium price and quantity
  • Marginal utility added up for each unit gives total utility
  • Correlation coefficient
    Informs about causation, unlike covariance
  • Correlation coefficient is not affected by outliers, unlike covariance
  • Covariance is difficult to calculate, while correlation coefficient is more practical
  • Correlation coefficient is a superior metric than covariance by definition
  • Exponential distribution is used to model the arrival of pedestrians
  • Negative binomial distribution is used to calculate the probability of conducting 20 experiments before identifying no pollutants 18 times
  • Calculating the upper limit of 95% confidence interval for the number of traffic congestions per day

    Use the formula: + Zα/2 * σ/√n
  • Calculating the hypothesis test score for the mean number of accidents per 4 months
    Use the Z-score formula: (X̄ - μ) / √μ
  • The null hypothesis is that the sample mean is equal to the population mean, and the alternative hypothesis is that the sample mean is different than the population mean
  • The test score calculated is 1.83
  • If the test score is 1.83
    The null hypothesis cannot be rejected at 90% CL
  • Calculating the test statistic (χ2) to test if Amsterdam and Enschede have the same crash patterns
    Use the formula: Σ (Oi - Ei)2/Ei
  • The calculated χ2 is 36.37, which is greater than χ0.10,3^2 = 6.251
  • Based on 90% CL, the number of crashes in Amsterdam and Enschede are different
  • Conditions where Poisson regression model is preferred
    • If the dependent variable takes count values
    • If there is an overdispersion problem
  • Speed limit variable has a positive coefficient and traffic volume variable has a negative coefficient in a negative binomial regression model for cycling accidents

    The higher the speed limits, the larger the number of accidents
  • Low speed limit roadways do not necessarily have large traffic volume