Statistics

Cards (25)

  • What is the definition of data science?
    No exact definition exists
  • At what intersection does data science lie?
    Statistical and computational sciences
  • What is the concept shared by data mining and big data?
    Use of powerful hardware and algorithms
  • What does data science synthesize according to Cao (2017)?

    Statistics, informatics, computing, communication, management, sociology
  • What is the primary use of data science?
    To make decisions and predictions
  • What is the center of data science?
    Data, especially Big Data
  • What is the purpose of data science?
    To obtain knowledge for better decisions
  • How is data science described as a field?
    Multidisciplinary with applied theories
  • How do some view the distinction between data science and statistics?
    Many see no distinction
  • Who believes data science is statistics?
    Karl Broman
  • What does Nate Silver think about the term "data scientist"?
    It is an attractive term for a statistician
  • What does Andrew Gelman emphasize about data science?
    Statistics is not the most important part
  • How does Vasant Dhar view data science?
    It seeks actionable patterns for predictions
  • What are the components of a data scientist's role?
    Mathematician, computer scientist, trend-spotter
  • Why are data scientists sought after by businesses?
    They can manipulate raw data into useful information
  • What tasks must a data scientist be able to perform?
    • Collect and transform messy data
    • Solve business-related problems
    • Work with programming languages (SAS, R, Python)
    • Grasp statistics, tests, and distributions
    • Learn analytical techniques (machine learning, deep learning)
    • Spot trends and patterns in data
    • Communicate with IT and business
  • What is R used for?
    Statistical computing and graphics
  • What are the features of R?
    Free, extensible, runs on various OS
  • Who developed Python?
    Guido van Rossum
  • What are the characteristics of Python?
    Object-oriented, interpreted, interactive
  • What is SAS used for?
    Statistical analysis tool
  • What is a key feature of SAS?
    Leading tool in commercial analytics
  • What is a drawback of SAS?
    It is the most expensive language
  • What are the differences between R, Python, and SAS?
    • R: Statistical computing and graphics, free, extensible
    • Python: Object-oriented, interpreted, interactive, clear syntax
    • SAS: Statistical analysis tool, leading in commercial analytics, expensive
  • Statistics is the collecting, organizing, presenting, analyzing, and interpreting of data