Design data warehouse using Snowflake schema by building data mappings, engineer and maintain data pipelines to import data of variant sources on a daily basis from OLTP system (SQL Server), Extract, Transform, and Load (ETL) 20+ million rows of data into GCP BigQuery, optimize SQL queries execution speed.
Develop key metrics for business reporting, create storytelling dashboards with complex calculated fields (DAX), parameters, and user-filters in Tableau, engage clients, and enhance the sub rate. Curate technical documents that explain the development of new intelligence and assist with 1000+ clients' training.
Lead motor repair price predictive analysis via machine learning models (Random Forest, XGBoost, NN), improve model performance through data cleaning, missing values imputation, dimension reduction, and feature engineering fused with domain knowledge from industry experts using Sklearn and Keras in Python.