BigData, Hadoop, Spark and Machine Learning


Spark

  1. How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.
  2. What is RDD in Spark ? and Why do we need it ?
  3. Dataframe Operations in Spark using Scala – Part I

Hadoop/MapReduce

  1. How-To : Setup Development Environment for Hadoop MapReduce
  2. Top 20 Hadoop MapReduce Interview Questions
  3. How to Use MultiThreadedMapper in MapReduce
  4. Top 20 Hadoop and Big Data Books
  5. Recommended Readings for Hadoop
  6. Top 15 HDFS Interview Questions
  7. How-To :Become a Hadoop Certified Developer ?

Kafka

  1. How-To : Integrate Kafka with HDFS using Camus (Twitter Stream Example)
  2. How-To : Write a Kafka Producer using Twitter Stream ( Twitter HBC Client)

Hive

  1. Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY
  2. How-To : Connect HiveServer2 service with JDBC Client ?
  3. How-To : Configure MySQL Metastore for Hive ?
  4. Hive Strict Mode

HBase

  1. How to Write a CoProcessor in HBase

Pig

  1. Hadoop : Getting Started with Pig
  2. What is Apache HCatalog ?
  3. How-To : Use HCatalog with Pig

Java

  1. Hashmap Performance Improvements in Java 8
  2. Java : What does finalize do and How?
  3. String Interning-What ,Why and When ?
  4. Swagger Documentation for RESTful Webservices