Save
1st - sem 1
Research Methods Semester 1:
Descriptive Statistics:
Save
Share
Learn
Content
Leaderboard
Share
Learn
Created by
Natasha Hess
Visit profile
Cards (40)
What is the focus of descriptive statistics?
Descriptive statistics focuses on
summarizing
and describing data.
View source
What is a population in statistics?
A population is the
entire
group
of
people
we’re
interested
in.
View source
What is a sample in statistics?
A sample is a
subset
of our
population
.
View source
How is a sample usually represented in statistics?
A
sample
is
usually represented
with
the
letter
n.
View source
What does n represent in statistics?
n
represents
our
sample size.
View source
What types of data are there in statistics?
There are
categorical
,
discrete
, and continuous data.
View source
What is categorical data?
Categorical data has 2 or more
categories
with no ordering to them.
View source
Give an example of categorical data.
Hair colour
is an example of categorical data.
View source
What is discrete data?
Discrete data has a
fixed value
with a
logical order
.
View source
Give an example of discrete data.
Shoe size
is an example of discrete data.
View source
What is continuous data?
Continuous data can take any
fractional
value.
View source
Give an example of continuous data.
Reaction times
are an example of continuous data.
View source
How can categorical data be presented?
Categorical data can be presented as its
raw frequency
or as a
percentage frequency
.
View source
How can discrete data be presented?
Discrete data can be presented as a
raw frequency
,
percentage
, or cumulative frequency.
View source
What should you do if you have many values in your data?
You should use
frequency
ranges to present the data instead.
View source
What are the measures of central tendency?
Mode
– the score that happens the most often in a dataset
Median
– the middle score in a dataset
Mean
– the sum of data points divided by the number of data points
View source
What is the mode in a dataset?
The
mode
is the
score
that happens the most often in a
dataset.
View source
For what type of data can the mode be used?
The mode can be used for
nominal data
.
View source
What are bimodal and multimodal distributions?
Bimodal distributions have two
modes
, while multimodal distributions have more than two modes.
View source
What is the median in a dataset?
The
median
is the
middle
score in a
dataset.
View source
How is the median calculated for odd value datasets?
The median is calculated as
(
n
+
1
)
/
2
(n+1)/2
(
n
+
1
)
/2
for odd value datasets.
View source
How is the median calculated for even value datasets?
The median is calculated as the
average
of the
middle
two values.
View source
What is an advantage of using the median?
The median is insensitive to
outliers
.
View source
Why is the median often meaningful?
The median often gives a
real
,
meaningful
data
value.
View source
For what types of data is the median useful?
The median is useful for
ordinal
data and skewed
interval/ratio
data.
View source
What is a disadvantage of using the median?
The median ignores a lot of the
data
.
View source
What is a challenge when calculating the median?
It can be hard to calculate the median without a
computer
.
View source
For what type of data can't the median be used?
The median cannot be used with
nominal data
.
View source
How is the mean calculated?
The mean is calculated as the
sum
of data points divided by the
number
of data points.
View source
What is an advantage of using the mean?
The mean uses all the
data
.
View source
For what type of datasets is the mean most effective?
The mean is most effective for
normally distributed
datasets.
View source
What is a disadvantage of using the mean?
The mean is sensitive to
outliers
.
View source
Why might values of the mean not always be meaningful?
Values of the mean aren’t always meaningful, such as when we cannot get a score of
6.74
/
10
6.74/10
6.74/10
.
View source
For what types of data is the mean meaningful?
The mean is only meaningful for
ratio
and
interval
data.
View source
What are the measures of spread related to central tendency?
Mode
: no measures of spread
Median
: distance-based measures (range + interquartile range)
Mean
: center-based measures of spread (
variance
+ standard deviation)
View source
How does the interquartile range compare to the range?
The interquartile range is
similar
to the range but ignores the most
extreme
values.
View source
What does the interquartile range represent?
The interquartile range represents the range of scores within the
middle 50%
of the scores.
View source
How is the lower quartile defined?
The lower quartile is the
median
of the lower
half
of the data.
View source
How is the interquartile range calculated?
The interquartile range is calculated as the
upper quartile
minus the
lower quartile
.
View source
What are deviance and variance in statistics?
Deviance: Each score is subtracted from the
mean
, which could result in a deviance of ‘0’.
Variance: An average of the
sum of squared errors
(
SS
), which is the sum of squared deviances.
View source