Spark
- How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.
- What is RDD in Spark? and Why do we need it?
- Dataframe Operations in Spark using Scala
- CatBoost – Scoring and Training in Spark.
Hadoop/MapReduce
- How-To: Setup Development Environment for Hadoop MapReduce
- Top 20 Hadoop MapReduce Interview Questions
- How to Use MultiThreadedMapper in MapReduce
- Top 20 Hadoop and Big Data Books
- Recommended Readings for Hadoop
- Top 15 HDFS Interview Questions
- How-To: Become a Hadoop Certified Developer?
Kafka
- How-To: Integrate Kafka with HDFS using Camus (Twitter Stream Example)
- How-To: Write a Kafka Producer using Twitter Stream ( Twitter HBC Client)
Hive
- Hive: SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY
- How-To: Connect HiveServer2 service with JDBC Client?
- How-To: Configure MySQL Metastore for Hive?
- Hive Strict Mode
HBase
Pig
Java
- Hashmap Performance Improvements in Java 8
- Java: What does finalize do and How?
- String Interning-What, Why, and When?
- Swagger Documentation for RESTful Webservices