lesson 8

Cards (9)

  • Data wrangling
    The process of CONVERTING and MAPPING raw data and getting it ready for analysis
  • tidyverse
    • A COLLECTION OF PACKAGE designed to work together as a full pipeline for doing every stage of data analysis on tidy data as an alternative to the inbuilt base R functions
  • dplyr library
    Part of the tidyverse
  • hflights Dataset
    A dataset used in the examples
  • Analysing the hflights Dataset
    1. Filter for flights originating from IAH airport
    2. Count total flights and delayed flights by each carrier
    3. Convert it to a Delayed per thousand (DPH) metric
    4. Sort the result by DPH in descending order
  • The Pipe (%>%)
    Part of the tidyverse for data manipulation
  • tidyr
    Part of the tidyverse for RESHAPING DATA
  • Tidy data
    • Every column is a variable, every row is an observation, and every cell is a single value
  • The goal of tidyr is to help you create tidy data