provide simple summaries about the sample and the measures
used to present quantitative descriptions in a manageable form
used to summarize data in an organized manner by describing the relationship between variables in a sample or a population
Coverage of Descriptive Statistics
Measures of Frequency (STDEV)
Tells us how many times the value or investigated variable appears in the sample group
Measures of Central Tendency (Variation)
Tells us the position of the values on a normal distribution curve
Measures of Dispersion or Variation
Tells us how close or near a certain group of data are with one another
Measures of Position
Percentile, Decile, Quartile
Categorical Variables
qualitative data or discrete data
Qualitative data in which the values are assigned to set of distinct groups or categories
can either be nominal, ordinal, or dichotomous
Example:
gender
civil status (qualitative data but not dichotomous instead they are multinomous)
Continuous Data
quantitative or numerical
measurable amounts
can either be interval or ratio
Measures of Frequency
number of times a particular value occurs in the data
Relative Frequencies - can be expressed in ration, rates, proportions, and percentages
Measures of Central Tendency
are single values that attempt to describe a set of data by identifying their central position within that set of data
Mean - arithmetic average or the sum of values in a data set divided by the total number of observations
Median - middle value in distribution when the data are ranked in order from the lowest to highest
Mode - most common value in a data set.
Mean
Measures of Dispersion
describes the degree to which the values are similar or diverse
Range - difference between the lowest and the highest value in a set of values
Variance and Standard Deviation - measures of spread that reveal how close each observed value is to the mean of the entire data set
Although the measures of central tendency provide important information in describing ones’ data, they fail to capture the variability within the database
DESCRIPTIVE STATISTICS FOR CATEGORICAL VALUES
To be able to analyze categorical data we must need to convert qualitative data to numeric data by coding
standard deviation
width of the curve
To draw a normal distribution, you need to
know:
The average measurement as this tells you where the center of the curve goes
The standard deviation of the measurements as this tell you how wide the curve should be and the width of the curve determines how tall it is
The wider the curve, the shorter
The narrower the curve, the taller
If the tail is towards the right side of
the value, it is positively skewed
If the tail is towards the left side of the
value, it is negatively skewed
Hypothesis
an assumption or educated guess concerning one or more population parameters of a distribution.
could give a determination whether the observed difference is convincingly different from what was expected from the model
Predictions about what the examination of appropriately collected data will show