Save
...
Part 4
4.4 Big data
4.4.1 Sampling
Save
Share
Learn
Content
Leaderboard
Learn
Created by
King Mole
Visit profile
Cards (13)
Sampling
Examining a subset of the available data, known as the
sample
, instead of the
entire
data set
View source
Sampling
Used for a wide range of scientific, political and economic purposes for more than
two
centuries
Allows researchers to determine
patterns
and
trends
without examining the entire data set
View source
Sampling
Political
opinion polls
Monitoring
car journeys to study
traffic
congestion
View source
Representative sample
A sample that
accurately
reflects the wider population in terms of
relevant
characteristics
View source
Creating a
representative
sample is incredibly hard, as the sample can contain hidden biases or omissions that will lead to mistaken conclusions
View source
Failure
of opinion polls to predict 2015 UK general election
Polls systematically
overrepresented Labour
voters at the expense of
Conservative
voters
View source
Sampling has the drawback that both the
sample
and the data obtained from the sample are defined at the
start
View source
Limitations
of sampling in opinion polls
Landline-based
polls exclude
younger
voters without landlines
Internet-only polls favour
younger
voters
View source
Pollsters cannot ask questions that are not in the
poll
, even if they realise a key question is
missing
View source
Sampling
in scientific investigation
Allows researchers to examine a tiny subset of the possible
data
due to time and
expense constraints
View source
Genetic
testing by
23andMe
Samples a relatively small number of
genes
known to be associated with certain traits and conditions, to keep
costs down
View source
Sampling a small number of
genes
Means
diseases
associated with
genes
outside the sample, or those whose genetic origin is uncertain, cannot be detected
View source
Analysing
complete DNA sequences
Avoids the problem of limited sampling, but is
computationally
intensive and more
expensive
View source