unit 5: data

Cards (10)

  • Information: the collection of facts and patterns extracted from data
  • Metadata: data about data
  • Cleaning Data - a process that makes the data uniform without changing its meaning (EX: replacing all equivalent abbreviations, spellings, and capitalizations with the same word)
  • Data filtering - choosing a smaller subset of a data set to use for analysis, for example by eliminating/keeping only certain rows in a table
  • Correlation: a relationship between two pieces of data, typically referring to the amount that one varies in relation to the other
  • Data Bias - data that does not accurately reflect the full population or phenomenon being studied
  • Information- the collection of facts and patterns extracted from data
  • Big data- collect huge amounts of data so we can learn even more from it
  • Citizen Science- scientific research conducted in whole or part by distributive individuals, many of whom may not be scientist, who contribute relevant data to research using their own computing device
  • Crowd sourcing- the practice of obtaining input or information from a large number of people via the internet.