Data Mining

Subdecks (3)

Cards (33)

  • Data Mining Methodology
    1) Problem Understanding
    2) Data Understanding
    3) Data Pre-processing
    4) Data Modelling
    4) Data Evaluation
  • 4 questions to consider before data mining
    1. Can we clearly define the problem?
    2. Do potentially meaningful data exist?
    3. Do the data contain hidden knowledge or report only
    4. Cost of processing < profit increase from data mining?
  • Why Data Mining?
    (1) more data is being generated
    (2) computing power is affordable
  • data mining
    A process that uses a variety of data analysis tools to discover patterns and relationships in data that may be used to make valid predictions
  • data mart
    contains a subset of data warehouse information
  • data warehouse
    a place where databases are stored so that they are available when needed
  • big data

    large volume of data