The term Big Data has been used to refer to a range of problems and technologies that relate to the management of very large data sets. The amount of data that we currently generate is enormous, and is increasing each year.
“IBM say that ‘every day, we create 2.5 quintillion bytes of data – so much that 90 per cent of the data in the world today has been created in the last two years alone.’”
Social media platforms produce huge quantities of data, both from individual network profiles and the content that influencers and the less influential alike produce (e.g. you-tubers, everyday technology users). Short form blogging, link-sharing, expert blog comments, user forums, ‘likes’ and more all contain potentially useful information that can be mined.
There is also data produced through sheer activity, for example machine-generated