Detecting data quality issues
1. Investigate unique domain values (unique)
2. Investigate value ranges for the column
3. Type casting (ex. Parse date string to datetime object to catch exceptions (pandas.to_datetime)
4. Highly dependent on the domain and problems
5. Identifying keys to check duplicates (try different keys)
6. Investigate unique domain values
7. Investigate value range, cautious about extremely small and large values
9. Range of values using df.describe()
10. Graphical tools (ex. boxplot)
12. Good to do some comparison between results found by different identifiers