Category: Technology

October 18, 2015

What is Apache HCatalog ?

What is HCatalog ? Apache HCatalog is a Storage Management Layer for Hadoop that helps to users of different data processing tools in Hadoop ecosystem like Hive, Pig and MapReduce easily read and write data...

Java

September 23, 2015

Hashmap Performance Improvements in Java 8

Problem Statement : Until Java 7, java.util.Hashmap implementations always suffered with the problem of Hash Collision, i.e. when multiple hashCode() values end up in the same bucket, values are placed in a Linked List implementation, which reduces...

Big Data / Technology

June 21, 2015

Unix Job Control Commands – bg, fg, Ctrl+Z,jobs

Since Hadoop jobs are often long running, its difficult for newbies to manage the processes in Unix unless they know some useful Unix commands to do so, so that they can increase their efficiency. In...

Big Data / Java / Kafka / Technology

February 12, 2015

How-To : Integrate Kafka with HDFS using Camus (Twitter Stream Example)

Simple String Example for Setting up Camus for Kafka-HDFS Data Pipeline I came across Camus while building a Lambda Architecture framework recently. I couldn’t find a good Illustration of getting started with Kafk-HDFS pipeline ,...

Big Data / Java / Kafka / Technology

February 12, 2015

How-To : Write a Kafka Producer using Twitter Stream ( Twitter HBC Client)

Twitter open-sourced its Hosebird client (hbc), a robust Java HTTP library for consuming Twitter’s Streaming API. In this post, I am going to present a demo of how we can use hbc to create a Kafka twitter stream...

Big Data / HBase / Technology

January 27, 2015

How-to : Write a CoProcessor in HBase

What is Coprocessor in HBase ? Coprocessor is a mechanism which helps to move computations closer to the data in HBase. It is like a Mapreduce framework to distribute tasks across the cluster. You can...

Category: Technology

What is Apache HCatalog ?

Like this:

Hashmap Performance Improvements in Java 8

Like this:

Unix Job Control Commands – bg, fg, Ctrl+Z,jobs

Like this:

How-To : Integrate Kafka with HDFS using Camus (Twitter Stream Example)

Like this:

How-To : Write a Kafka Producer using Twitter Stream ( Twitter HBC Client)

Like this:

How-to : Write a CoProcessor in HBase

Like this:

Category: Technology

What is Apache HCatalog ?

Share this:

Like this:

Hashmap Performance Improvements in Java 8

Share this:

Like this:

Unix Job Control Commands – bg, fg, Ctrl+Z,jobs

Share this:

Like this:

How-To : Integrate Kafka with HDFS using Camus (Twitter Stream Example)

Share this:

Like this:

How-To : Write a Kafka Producer using Twitter Stream ( Twitter HBC Client)

Share this:

Like this:

How-to : Write a CoProcessor in HBase

Share this:

Like this: