Save
...
Part 4
4.4 Big data
4.4.5 Data volume versus data quality
Save
Share
Learn
Content
Leaderboard
Learn
Created by
King Mole
Visit profile
Cards (9)
Data volume
The amount of data being processed
View source
Data quality
The accuracy and trustworthiness of the data
View source
The results from any data analysis are only as good as the original data that is being processed
View source
Sampling
Improves the quality of the data at the expense of the volume
Researchers often select the best quality data from reliable sources
Researchers often exclude extreme values as they are unlikely to be representative
View source
The sheer volume of data involved in big data processing means there is high probability that at least some of the data is of poor quality
View source
Veracity
Ensuring the correctness and trustworthiness of the data
View source
Other 'Vs' describing big data
Value
Variability
Visualisation
View source
The 'toos'
Too much data for traditional databases
Too complex for conventional categorisation
Too many updates to the data
View source
Use of big data in 2016 US presidential campaign
Large volumes of data on voters
Constant updates on voter opinions
Variety of data sources including social media and credit card purchases
View source