Save
week 4
Data and stats: week 4
Save
Share
Learn
Content
Leaderboard
Learn
Created by
Oumedine Djibrilla
Visit profile
Cards (20)
Scatter
plots
are data points that are connected by dot to dot lines to help make the overall trend clear
bar
charts show data that have
discrete
or categorical values instead of a continuous range of values
Histograms
illustrate frequency data and can be plotted as numbers of percentage
box
and
whisker
allow you to easily see where most of the data in a test set full
standard error
of the mean is a quantity that indicates the uncertainty in a calculated mean
confidence interval
gives an estimated range of values that is likely to include the population parameter being studied standard deviation
chi-square compare observed data with data you would except to obtain according to a hypothesis
T-test
determine if there is a big difference between the mean values of two groups
analysis
of
variance
compares the mean of 2 or more sets of data by calculating how widely individual values in each set vary
correlation
analysis whether there is a relationship/ correlation between two variables
regression analysis
evaluate the scatter of data points around a line that best fit
p-value
means the
probability
causation
means that one event is responsible for the occurrence of the other, while
correlation
means that 2 events appear occur together
both
and
rule
applies when you want to know the probability of two or more independent events occurring together
either and rule
applies when you want to know the probability of an event happening when there are two more alternative ways for that event to occur
When species respond to climate change they
move,
acclimate
,
adapt
,
die
A
biased sample
is a sample that does not represent the whole population.
Observation
study
is when individuals are observed or certain outcomes are measured
An
experimental study
is a study in which the researcher manipulates one of the variables, to determine how it influences other variables
R2
is a measure of the percentage of total variation in the dependent variable that is accounted for by the independent variable.