Build the platform for extraction, transformation, and loading of data from several data sources using SQL and Hadoop technologies for structured and unstructured data. Using cloud services AWS EMR/Glue, S3, Redshift, Azure Data Factory, HDInsight/Databricks, Blob, SQL Server. Provide data sources support for software developers and data scientists in implementing application Create and optimize data pipeline architecture