-Gathering data form different sources, and different file formats.
-Assessing data for quality and tidiness issues.
-cleaning data for both quality and tidiness issues.
-Getting insights from the data and make representative visualizations.
-making learning-based models for predicting out-of-sample behavior.