Passionate about Problem-solving and Big Data Technologies. 4+ years of real-world working experience with technologies such as Hadoop, Spark, SQL, Python, Cloud Computing & Data warehousing.
Projects Involved:
1. KPI Dashboard Development: Developed 100+ metrics of different franchises for pharmaceutical clients by writing optimized SQL queries. Optimized the Spark configuration which led to increased efficiency of data pipelines that processed over 50 TB of data daily.
2. Migration of Workstreams: Rewriting the base code to meet the Target Layer standards. Handling multiple SQL and python files in Hadoop deployment, migrating to S3 and Amazon EMR. Conversion of all underlying jobs from Client to Cluster mode. Optimization of the process to make it more time and cost-efficient.
In love with Data World. Informationally One with the Universe!