Skip to content

SaurzCode

  • Home
  • Java
  • Spark
  • Big Data
    • Hive
    • Kafka
    • HBase
  • Security
  • About Me
  • Useful Resources

SaurzCode

BigData, Java, Scala, Hadoop, Hive, Spark and Machine Learning Tutorial and How To Do

  • Home
  • Java
  • Spark
  • Big Data
    • Hive
    • Kafka
    • HBase
  • Security
  • About Me
  • Useful Resources

BigData, Hadoop, Spark and Machine Learning

Learn Big Data, Spark and Applied ML

Learn more

Spark

  1. How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.
  2. What is RDD in Spark? and Why do we need it?
  3. Dataframe Operations in Spark using Scala
  4. CatBoost – Scoring and Training in Spark.

Hadoop/MapReduce

  1. How-To: Setup Development Environment for Hadoop MapReduce
  2. Top 20 Hadoop MapReduce Interview Questions
  3. How to Use MultiThreadedMapper in MapReduce
  4. Top 20 Hadoop and Big Data Books
  5. Recommended Readings for Hadoop
  6. Top 15 HDFS Interview Questions
  7. How-To: Become a Hadoop Certified Developer?

Kafka

  1. How-To: Integrate Kafka with HDFS using Camus (Twitter Stream Example)
  2. How-To: Write a Kafka Producer using Twitter Stream ( Twitter HBC Client)

Hive

  1. Hive: SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY
  2. How-To: Connect HiveServer2 service with JDBC Client?
  3. How-To: Configure MySQL Metastore for Hive?
  4. Hive Strict Mode

HBase

  1. How to Write a CoProcessor in HBase

Pig

  1. Hadoop: Getting Started with Pig
  2. What is Apache HCatalog?
  3. How-To: Use HCatalog with Pig

Java

  1. Hashmap Performance Improvements in Java 8
  2. Java: What does finalize do and How?
  3. String Interning-What, Why, and When?
  4. Swagger Documentation for RESTful Webservices

Git

  • How to Split Git SubDirectory into Separate Repository?

Tags

apache apache-spark bg bigdata big data bigdata interview bigdata interview questions camus cloud collection ELK ELK Stack fast fg hadoop hadoop interview hashmap hbase hcatalog hdfs hdfs interview questions high availability hive hive interview interview java java8 Java Interview jobs kafka kafka-hdfs Learning Elk Stack linkedin mapreduce metastore PacktPub Pig process management rdd Saurabh Chhajed scala spark storm twitter unix

Pages

  • About Me
  • BigData, Hadoop, Spark and Machine Learning
  • Useful Resources

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 324 other subscribers