Category: Spark

May 2, 2021

How to Train and Score Catboost Model on Spark

About CatBoost Catboost (developed by Yandex) is one of the great open-source gradient boosting libraries with great performance without a lot of additional tuning. It provides support for categorical features without any need for encoding...

Big Data / Scala / Spark / Technology

September 14, 2019

Spark – How to Run Spark Applications on Windows

Whether you want to unit test your Spark Scala application using Scala Tests or want to run some Spark application on Windows, you need to perform a few basics settings and configurations before you do...

Big Data / Scala / Spark

September 14, 2019

What does Skipped Stage means in Spark WebUI ?

Skipped Stages in Spark UI You must have come across various scenarios where you see a DAG like below, where you see a few stages shows greyed out with a text (skipped) after the stage...

Big Data / Scala / Spark / Technology

June 7, 2018

Dataframe Operations in Spark using Scala

Dataframe in Apache Spark is a distributed collection of data, organized in the form of columns. Dataframes can be transformed into various forms using DSL operations defined in Dataframes API, and its various functions. In...

Big Data / Java / Scala / Spark / Technology

October 15, 2017

How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.

Apache Spark is becoming very popular among organizations looking to leverage its fast, in-memory computing capability for big-data processing. This article is for beginners to get started with Spark Setup on Eclipse/Scala IDE and getting...

Big Data / Spark

October 24, 2015

What is RDD in Spark ? and Why do we need it ?

Resilient Distributed Datasets -RDDs in Spark Apcahe Spark has already taken over Hadoop (MapReduce) because of plenty of benefits it provides in terms of faster execution in iterative processing algorithms such as Machine learning. In...

Category: Spark

How to Train and Score Catboost Model on Spark

Like this:

Spark – How to Run Spark Applications on Windows

Like this:

What does Skipped Stage means in Spark WebUI ?

Like this:

Dataframe Operations in Spark using Scala

Like this:

How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.

Like this:

What is RDD in Spark ? and Why do we need it ?

Like this:

Category: Spark

How to Train and Score Catboost Model on Spark

Share this:

Like this:

Spark – How to Run Spark Applications on Windows

Share this:

Like this:

What does Skipped Stage means in Spark WebUI ?

Share this:

Like this:

Dataframe Operations in Spark using Scala

Share this:

Like this:

How to Configure Spark Application ( Scala and Java 8 Version with Maven ) in Eclipse.

Share this:

Like this:

What is RDD in Spark ? and Why do we need it ?

Share this:

Like this: