Category: Java

May 27, 2018

How to Use MultiThreadedMapper in MapReduce

In simple MapReduce Job each instance of Mapper.map() method is invoked by a single thread and key value pair are passed serially. MultithreadedMapper class is used instead of default Mapper when tasks are CPU bound...

Big Data / Java / Scala / Spark / Technology

October 15, 2017

How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.

Apache Spark is becoming very popular among organizations looking to leverage its fast, in-memory computing capability for big-data processing. This article is for beginners to get started with Spark Setup on Eclipse/Scala IDE and getting...

Big Data / Java / Technology

December 13, 2015

My Book on ELK Stack : Learning ELK Stack

Learning ELK Stack I am writing this post to announce the general availability of my book on ELK stack titled ” Learning ELK Stack ” with PacktPub publications. Book aims to provide individuals/technologists who seek...

Big Data / Hive / Java

October 18, 2015

What is Apache HCatalog ?

What is HCatalog ? Apache HCatalog is a Storage Management Layer for Hadoop that helps to users of different data processing tools in Hadoop ecosystem like Hive, Pig and MapReduce easily read and write data...

Java

September 23, 2015

Hashmap Performance Improvements in Java 8

Problem Statement : Until Java 7, java.util.Hashmap implementations always suffered with the problem of Hash Collision, i.e. when multiple hashCode() values end up in the same bucket, values are placed in a Linked List implementation, which reduces...

Big Data / Java / Kafka / Technology

February 12, 2015

How-To : Integrate Kafka with HDFS using Camus (Twitter Stream Example)

Simple String Example for Setting up Camus for Kafka-HDFS Data Pipeline I came across Camus while building a Lambda Architecture framework recently. I couldn’t find a good Illustration of getting started with Kafk-HDFS pipeline ,...

Category: Java

How to Use MultiThreadedMapper in MapReduce

Like this:

How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.

Like this:

My Book on ELK Stack : Learning ELK Stack

Like this:

What is Apache HCatalog ?

Like this:

Hashmap Performance Improvements in Java 8

Like this:

How-To : Integrate Kafka with HDFS using Camus (Twitter Stream Example)

Like this:

Category: Java

How to Use MultiThreadedMapper in MapReduce

Share this:

Like this:

How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.

Share this:

Like this:

My Book on ELK Stack : Learning ELK Stack

Share this:

Like this:

What is Apache HCatalog ?

Share this:

Like this:

Hashmap Performance Improvements in Java 8

Share this:

Like this:

How-To : Integrate Kafka with HDFS using Camus (Twitter Stream Example)

Share this:

Like this: