The process of CONVERTING and MAPPING raw data and getting it ready for analysis
tidyverse
A COLLECTION OF PACKAGE designed to work together as a full pipeline for doing every stage of data analysis on tidy data as an alternative to the inbuilt base R functions
dplyr library
Part of the tidyverse
hflights Dataset
A dataset used in the examples
Analysing the hflights Dataset
1. Filter for flights originating from IAH airport
2. Count total flights and delayed flights by each carrier
3. Convert it to a Delayed per thousand (DPH) metric
4. Sort the result by DPH in descending order
The Pipe (%>%)
Part of the tidyverse for data manipulation
tidyr
Part of the tidyverse for RESHAPING DATA
Tidy data
Every column is a variable, every row is an observation, and every cell is a single value