Inferential Statistics

Created by

seerat

Cards (17)

Choosing a Statistical Test:
Carrots Should Come, Mashed With Swede, Under Roast Potatoes.
NOI, URR
Type 1 Errors:
Optimistic error.
This is the incorrect rejection of a null hypothesis which is true.
A false positive.
Type 2 Errors:
The failure to reject the null hypothesis that is false.
A false negative.
Measures of Central Tendency: Mean:
The average of all data points.
Strengths:
Includes all data points, provides a precise average.
Weaknesses:
Can be distorted by extreme values (outliers).
Not ideal for skewed distributions.
Measures of Central Tendency: Median:
Strengths:
Reflects the central point of the data.
Not affected by outliers, better for skewed data.
Weaknesses:
Does not account for all data points.
May not be as accurate in symmetrical data.
Measures of Central Tendency: Mode:
Strengths:
Useful for nominal/categorical data.
Simple to calculate.
Weaknesses:
May not represent the data accurately if there is no clear mode or multiple modes.
Measures of Dispersion: Range:
Strengths:
Easy to calculate, gives a quick sense of the spread of data.
Weaknesses:
Sensitive to outliers.
Doesn’t reflect the distribution of the majority of data points.
Measures of Dispersion: Standard Deviation:
Strengths:
More precise than the range.
Gives a better measure of spread.
Considers all data points.
Weaknesses:
More complex to calculate.
Can be affected by outliers.
May not be meaningful in highly skewed data.
Graphical Representation: Bar Charts:
Strengths:
Simple to interpret.
Useful for comparing categorical data.
Weaknesses:
Can become cluttered with too many categories.
May lose accuracy in large datasets.
Graphical Representation: Histograms:
Strengths:
Ideal for continuous data.
Shows frequency distribution clearly.
Weaknesses:
Less precise with small datasets.
Not suitable for categorical data.
Graphical Representation: Frequency Polygons:
Strengths:
Clearly shows trends and comparisons between distributions.
Weaknesses:
Less detailed than histograms.
Can be hard to interpret with complex data.
Graphical Representation: Pie Charts:
Strengths:
Good for showing proportions.
Easy to understand for simple datasets.
Weaknesses:
Not suitable for large datasets.
Can be misleading if too many categories are used.
Graphical Representations: Scattergrams:
Strengths:
Clearly shows correlations between two variables.
Visually simple.
Weaknesses:
May not reveal causation.
Can be hard to interpret without a clear relationship.
Nominal Data:
Data that consists of categories with no particular order or ranking.
Gender, nationality, hair colour, eye colour.
Strengths:
Easy to collect and categorize.
Useful for classifying and counting frequencies.
Weakness:
Does not allow for any meaningful numerical operations (like averages).
Limited in terms of statistical analysis.
Ordinal Data:
Data that consists of categories with a meaningful order, but the intervals between the categories are not equal or defined.
Likert scales, class rankings.
Strengths:
Allows for a ranking of items, giving more information than nominal data.
Easier to analyse than nominal data, as it shows relative positions.
Weaknesses:
The distances between categories are not uniform, making it harder to make precise comparisons.
Lacks the ability to perform complex mathematical calculations.
Interval Data:
Data Interval data consists of ordered data with equal intervals between values, but there is no true zero point (the zero is arbitrary).
Temperature in Celsius or Fahrenheit, IQ scores, calendar dates.
Strengths:
Equal intervals allow for meaningful comparisons between values (e.g., the difference between 10°C and 20°C is the same as between 30°C and 40°C).
Allows for a wide range of statistical operations (e.g., mean, standard deviation, correlation).
More precise than ordinal and nominal data.
Weaknesses:
Lack of a true zero means you can't make statements about "absolute" values (e.g., 0°C doesn't mean "no temperature").
Mathematical operations involving ratios (e.g., "twice as much") are not meaningful because of the lack of an absolute zero.
Can be difficult to interpret in some contexts (e.g., comparing temperature scales like Celsius and Fahrenheit).
What is Nominal Data?
Data in categories.