Overall 4+ years of experience in Big Data/Hadoop and Spark development has a strong background with JAVA, Scala and Groovy. Understands the complex processing needs of big data and has experience developing codes and modules to address those needs. Worked on Big Data projects for Citi Bank and Deutsche Bank. Understanding the requirement from functional specification documents provided by the DQP team. Create Enterprise Java Spark application following industry standards with design pattern and proper logging. Created Hive tables to store the processed results in a tabular format. These tables have been stored on RDBMS. Responsible for developing scalable distributed data solutions using Spark, Java, Scala. Used Spark to create Dataset from JSON, CSV, Oracle, Netteza and create Schema from file. performed validation on file content using Spark RDD Api. perform various transformations on the the dataset using inbuilt dataset functions as well as Spark-SQL. used JDBC for querying database and get the resultSet as dataset in case of multiple and large table involvement instead of loading all/whole table in SparkSession. Done Project on RestFul Web Services using SparkJava Micro services and Grails framework.