Big Data And Spark

Big Data And Spark Training Provided by Revanth Technologies Training Institute in Hyderabad

Beginner 0(0 Ratings) 0 Students enrolled
Created by Revanth Technologies Training Institute staff Last updated Thu, 07-Apr-2022 English


Big Data And Spark free videos and free material uploaded by Revanth Technologies Training Institute staff .

Syllabus / What will i learn?

Introduction To Big Data And Spark

Learn how to apply data science techniques using parallel programming during Spark training, to explore big (and small) data

Introduction to Big Data

Challenges with Big Data

Batch Vs. Real Time Big Data Analytics

Batch Analytics – Hadoop Ecosystem Overview

Real Time Analytics Options

Streaming Data – Storm

In Memory Data – Spark

What is Spark?

Modes of Spark

Spark Installation Demo

Overview of Spark on a cluster

Spark Standalone Cluster

Spark Baby Steps

Learn how to invoke spark shell, build spark project with sbt, distributed persistence and much more…in this module

Invoking Spark Shell

Creating the Spark Context

Loading a File in Shell

Performing Some Basic Operations on Files in Spark Shell

Building a Spark Project with sbt

Running Spark Project with sbt

Caching Overview

Distributed Persistence

Spark Streaming Overview

Example: Streaming Word Count

Playing With RDDs In Spark

The main abstraction Spark provides is a resilient distributed dataset (RDD), which is a collection of elements partitioned across the nodes of the cluster that can be operated on in parallel

RDDs

Spark Transformations in RDD

Actions in RDD

Loading Data in RDD

Saving Data through RDD

Spark Key-Value Pair RDD

Map Reduce and Pair RDD Operations in Spark

Scala and Hadoop Integration Hands on

Shark - When Spark Meets Hive

Shark is a component of Spark, an open source, distributed and fault-tolerant, in-memory analytics system, that can be installed on the same cluster as Hadoop. This module of spark training, will give insights about Shark

Why Shark?

Installing Shark

Running Shark

Loading of Data

Hive Queries through Spark

Testing Tips in Scala

Performance Tuning Tips in Spark

Shared Variables: Broadcast Variables

Shared Variables: Accumulators



Curriculum for this course
0 Lessons 00:00:00 Hours
+ View more
Description
You need online training / explanation for this course?

1 to 1 Online Training contact instructor for demo :


+ View more

Other related courses
About the instructor
  • 0 Reviews
  • 1 Students
  • 160 Courses
Student feedback
0
Average rating
  • 0%
  • 0%
  • 0%
  • 0%
  • 0%
Reviews

Material price :

Free

1:1 Online Training Fee: 10000 /-
Contact instructor for demo :