MATH

Cards (19)

  • Data science is the study of data to extract
    meaningful insights for business. It is a
    multidisciplinary approach that combines
    principles and practices from the fields of
    mathematics, statistics, artificial intelligence, and
    computer engineering to analyze large amounts of
    data.
  • Data Collection- is the process of gathering and
    measuring information on variables of interest in an
    established system , which then enables one to answer
    stated research questions and evaluate outcomes.
  • Two formats for presenting data sets: Ungrouped data and Grouped data
  • Ungrouped data which is an explicit listing of all the
    individual data points.
  • Grouped data which data are clustered into classes and
    are represented in frequencies.
  • There are five Common ways of presenting data:
    Frequency Table
    Cumulative Frequency Table
    Relative Frequency Table
    Stem and Leaf Diagram
    Histogram
  • Frequency Table - common way to present data in tabular form in which each distinct value x is listed id the first row or column and its frequency f in the second row or column. frequency is the number of times the value x appears in the data set.
  • Cumulative Frequency is the number of times a
    data value occurs that is above or below a certain
    data value within the data set.
  • Relative Frequency refers to the percentage of a data set
    that each specific data value appears within the data. In other
    words, determines the portion of the data set consisting of
    each of the individual data values. It can be written as
    fractions, decimals, or percent.
  • Stem and Leaf Diagram - It is a special table where each data value is split into a
    "stem" (the first digit or digits) and a "leaf" (usually the last
    digit).
    It is a way to arrange and represent data so that it is simple
    to see how frequently various data values occur.
  • Histogram
    It is a graphical display of data using bars of different
    height.
    It is often described as a bar graph of the frequency.
  • A measure of central tendency is a descriptive statistic
    that describes the average, or typical value of a set of
    scores.
    There are three common measures of central tendency:
    the mean
    the median
    the mode
  • Ungrouped Data Mean- The mean is also known as the average, and it is calculated by
    adding up all the values in a data set and dividing by the total number of
    values.
  • Ungrouped Data Median- The median is the middle value of a data set, which separates
    the highest and lowest values equally. It is calculated by arranging the
    data set in order from lowest to highest and finding the value in the
    exact middle.
  • Ungrouped Data Mode- The mode is the value that appears most frequently in a data set.
  • Grouped Data Mean - the mean or average of observations is the sum of the values of all the observations divided by the total number of observations.
  • grouped Median -
  • grouped mode - it is that value of a variety that occurs most often. More precisely, the mode is the value of the valuable at which the concentration of the data is maximum.
  • Modal Class - In a frequency distribution, the class having the maximum frequency is called the modal class.