hadoop Archives - Page 2 of 3

January 27, 2015

Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY

In Apache Hive HQL, you can decide to order or sort your data differently based on ordering and distribution requirement. In this post we will look at how SORT BY, ORDER BY, DISTRIBUTE BY and...

Big Data / Java / Technology

January 23, 2015

How-To : Setup Development Environment for Hadoop MapReduce

This post is intended for folks who are looking out for a quick start on developing a basic Hadoop MapReduce application. We will see how to set up a basic MR application for WordCount using...

Big Data / Hive / Java / Technology

January 12, 2015

How-To : Use HCatalog with Pig

Using HCatalog with Pig :- This post is a step by step guide on running HCatalog and using HCatalog with Apache Pig :- Assumptions : Pig and Hive are installed and tested with basic modes....

Big Data / Hive / Java

January 8, 2015

Hive Strict Mode

What is Hive Strict Mode ? Hive Strict Mode ( hive.mapred.mode=strict) enables hive to restrict certain performance intensive operations. Such as – It restricts queries of partitioned tables without a WHERE clause. hive> set hive.mapred.mode=strict;...

Big Data / Hive / Java / Technology

January 6, 2015

How-To : Configure MySQL Metastore for Hive ?

Hive by default comes with Derby as its metastore storage, which is suited only for testing purposes and in most of the production scenarios it is recommended to use MySQL as a metastore. This is...

Big Data / Java / Technology

June 28, 2014

Hadoop : Getting Started with Pig

What is Apache Pig? Apache Pig is a high level scripting language that is used with Apache Hadoop. It enables data analysts to write complex data transformations without knowing Java. It’s simple SQL-like scripting language is called Pig...

Tagged: hadoop

Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY

Like this:

How-To : Setup Development Environment for Hadoop MapReduce

Like this:

How-To : Use HCatalog with Pig

Like this:

Hive Strict Mode

Like this:

How-To : Configure MySQL Metastore for Hive ?

Like this:

Hadoop : Getting Started with Pig

Like this:

Tagged: hadoop

Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY

Share this:

Like this:

How-To : Setup Development Environment for Hadoop MapReduce

Share this:

Like this:

How-To : Use HCatalog with Pig

Share this:

Like this:

Hive Strict Mode

Share this:

Like this:

How-To : Configure MySQL Metastore for Hive ?

Share this:

Like this:

Hadoop : Getting Started with Pig

Share this:

Like this: