Skip to content

SaurzCode

  • Home
  • Java
  • Spark
  • Big Data
    • Hive
    • Kafka
    • HBase
  • Security
  • About Me
  • Useful Resources

SaurzCode

BigData, Java, Scala, Hadoop, Hive, Spark and Machine Learning Tutorial and How To Do

  • Home
  • Java
  • Spark
  • Big Data
    • Hive
    • Kafka
    • HBase
  • Security
  • About Me
  • Useful Resources

BigData, Hadoop, Spark and Machine Learning

Learn Big Data, Spark and Applied ML

Learn more

Spark

  1. How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.
  2. What is RDD in Spark? and Why do we need it?
  3. Dataframe Operations in Spark using Scala
  4. CatBoost – Scoring and Training in Spark.

Hadoop/MapReduce

  1. How-To: Setup Development Environment for Hadoop MapReduce
  2. Top 20 Hadoop MapReduce Interview Questions
  3. How to Use MultiThreadedMapper in MapReduce
  4. Top 20 Hadoop and Big Data Books
  5. Recommended Readings for Hadoop
  6. Top 15 HDFS Interview Questions
  7. How-To: Become a Hadoop Certified Developer?

Kafka

  1. How-To: Integrate Kafka with HDFS using Camus (Twitter Stream Example)
  2. How-To: Write a Kafka Producer using Twitter Stream ( Twitter HBC Client)

Hive

  1. Hive: SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY
  2. How-To: Connect HiveServer2 service with JDBC Client?
  3. How-To: Configure MySQL Metastore for Hive?
  4. Hive Strict Mode

HBase

  1. How to Write a CoProcessor in HBase

Pig

  1. Hadoop: Getting Started with Pig
  2. What is Apache HCatalog?
  3. How-To: Use HCatalog with Pig

Java

  1. Hashmap Performance Improvements in Java 8
  2. Java: What does finalize do and How?
  3. String Interning-What, Why, and When?
  4. Swagger Documentation for RESTful Webservices

Git

  • How to Split Git SubDirectory into Separate Repository?

Tags

apache apache-spark bg bigdata big data camus cloud cloudera collection effective java ELK fg hadoop hadoop interview hashmap hbase hcatalog hdfs high availability hive hortonworks interview java java8 Java Interview jobs joshua bloch kafka kafka-hdfs linkedin mapereduce mapr mapreduce metastore PacktPub Pig process management Saurabh Chhajed scala spark spark interview storm tls twitter unix

Pages

  • About Me
  • BigData, Hadoop, Spark and Machine Learning
  • Useful Resources

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 324 other subscribers