Data Engineer
Data Scientist
Setting up the environment for the storage, processing and analysis of data.
Analyzing different platforms for Data Lake Jupyter, zeppelin and R Studio
EMR/EC2 Instance were created for Data Lake platforms
Data Extraction / ETL
Technology Used:
Hadoop Sqoop,Hive
AWS: S3 bucket, Redshift, Elastic Search,DyamoDB, Data Pipeline.
Spark: RDD, Data frames.
Putty , WinZip & VPN
Work Terms
As Needed - Open to Offers