Domains: Distributed Computing, Streaming Data Analysis, Legacy Batch System Analysis, Supervised Learning, Unsupervised Learning, Deep Learning Tech Stack: Apache Hadoop Eco-System, [Map-Reduce version 2, Yarn, Hive, Pig, HBase, Spark] Distribution: Open-Source, Databricks, Cloudera, Hortonworks, MapR. Language: Scala, Java (1.7, 1.8), Python, R, Go, MatLab