Inferential statistics

Cards (68)

  • Hypothesis is a statement or conjecture about any phenomena for which the truth has not been verified
  • Hypothesis will need investigation to ascertain its truthfulness
  • Examples of hypotheses:
    • Fear increases hypertension
    • Females are more prone to rheumatoid arthritis
    • Breast cancer is commoner in women who do not breastfeed
    • Sore throat is commoner in winter
  • Procedure for test of hypothesis:
    • State Null Hypothesis
    • State alternative hypothesis
    • State level of statistical error
    • Choose appropriate test statistic
    • Apply data and evaluate test statistic
    • Take decision: reject or not reject null hypothesis
  • Null Hypothesis:
    • Denoted by H_o
    • It is the hypothesis under test
    • It is the hypothesis to be nullified if data does not support it
    • Example: No difference in age of onset of rheumatoid arthritis between males and females
  • Alternative Hypothesis:
    • Denoted by H_a
    • It is the hypothesis to consider if H_o is rejected
    • Statement in the affirmative language
    • Example: There is a difference in the age of onset of Rheumatoid Arthritis between males and females
  • Level of Statistical Error:
    • Alpha or type 1 error (α): This is the error committed for rejecting true null hypothesis
    • Always very small (0.05 or 5%)
  • Type of Errors in Decision Making:
    • Type 1 Error (α): Reject a true Null Hypothesis. This is rejecting a Null Hypothesis when it is indeed True
    • Type II Error (β): Failure to reject a false Null Hypothesis. This is not rejecting a Null Hypothesis when it is indeed false (β-ERROR)
  • The P-Value:
    • Type 1 error is the level of significance – (α) (alpha)
    • Type II error is denoted by β (beta)
    • 1 - β = Power of the test
  • Usual Objectives of Studies:
    • Compare characteristics of groups particularly average values
    • Investigation of association or relationship between variables
  • Choice of Test Statistic:
    • Depend on study objectives (research questions)
    • Kind of data (quantitative or qualitative)
    • Sample size (small or large)
  • Type of Test Statistics:
    • Parametric tests
    • Non-parametric tests
  • Parametric Tests:
    • Assume distributional forms for the measurements and parameters in the population
    • Commonest assumption is normal distribution
    • Examples:
    • Z-test for comparing 2 proportions
    • T-test to compare mean values between only 2 groups
    • F-test or analysis of variance to compare mean values between several groups (more than 2 groups)
  • Non-Parametric Tests:
    • Do not assume any particular functional form for a population distribution
    • Called distribution-free methods
    • Examples:
    • CHI-SQUARE TEST
    • MANN-WHITNEY-U TEST
    • WILCOXON SIGNED RANK SUM TEST
    • KRUSKAL-WALLIS TEST
    • MEDIAN TEST
  • Statistics is a field of study concerned with:
    • The collection, organization, summarization, and analysis of data
    • The drawing of inferences about a body of data when only a part of the data is observed
  • Statistics can be broadly classified into 2 categories:
    • Descriptive statistics
    • Inferential statistics
  • Descriptive statistics include:
    • Measures of central tendency
    • Measures of dispersion
    • Diagrammatic (Graphic or pictorial) presentation of data
  • Inferential statistics deals with extrapolation, drawing conclusions to the population based on observations from a sample
  • Population:
    • The largest collection of entities for which we have an interest, which may be finite or infinite
  • Sample:
    • A part of a population that is analyzed to draw conclusions about the population
  • Parameter vs. Statistic:
    • When information is based on the entire population, it is a parameter
    • When information is based on a sample, it is a statistic or an estimate
  • Study statistics are estimates of corresponding population parameters because the true values of the population are usually unknown
  • The purpose of inferential statistics is to determine whether relationships observed in a sample are "real" or due to chance variation
  • Hypothesis:
    • A claim or assumption about the population parameter, such as the population mean or proportion
  • Null Hypothesis (H0):
    • The hypothesis under test that is nullified if data does not support it
    • It is a statement of no difference, association, effect, or equality
  • Alternative Hypothesis (Ha):
    • The research question considered if the null hypothesis is rejected
    • It is a statement in affirmative language
  • Errors in Hypothesis Tests:
    • Type I error: Rejecting the null hypothesis when it is true (α)
    • Type II error: Failing to reject the null hypothesis when it is false
  • Type I error is the error committed when you reject the null hypothesis, but in reality, the null hypothesis is correct
  • Type I error is also known as alpha (α) error
  • Type I error is made if you find an association where there is none, or if you claim there is a difference when there isn't, or if you state a treatment has an effect when it does not
  • Type II error is the error committed when you fail to reject the null hypothesis, but in reality, the null hypothesis is false
  • Type II error is also known as beta (β) error
  • Type II error is made if you find no association where there is one, or if you claim there is no difference when there is, or if you state a treatment has no effect when it does
  • Level of statistical error, also known as the level of significance or alpha (α) error, is the maximum probability of committing a Type I error
  • A small p-value (less than 0.05) leads to the rejection of the null hypothesis
  • Power (1-β) of a test is its ability to reject a null hypothesis when it is false
    1. Value is a measure of how much evidence we have against the null hypothesis
  • Smaller p-value implies greater inconsistency
  • Conventionally, we reject the null hypothesis if the p-value is less than 0.05
  • Steps in testing of statistical hypothesis: