UrbanPro
true

Big data with spark scala

LIVE
40 Hours

Course offered by Satendra Kumar

0 review

                                                                       BIG DATA

Introduction to Big Data

  • Overview of Big Data Technologies and its role in Analytics
  • Big Data challenges & solutions
  • Data Science vs Data Engineering
  • FOUR V's of Big Data given by Google.

Unix & Java

  • Introduction to UNIX shell.
  • Basic Commands of UNIX
  • Create
  • Copy
  • Move
  • Delete etc.
  • Basic of JAVA Programming Language
  • Architecture JVM, JRE, JIT
  • Control Structures
  • OOP's Concept in Java
  • String Classes/Array/Exception Handling
  • Collection Classes

Apache HDFS

  • Understanding the problem statement and challenges persisting to such large data to
  • perceive the need of Distributed File System.
  • Understanding HDFS architecture to solve problems
  • Understanding configuration and creating directory structure to get a solution of the given
  • problem statement
  • Setup appropriate permissions to secure data for appropriate users
  • Setting up Java Development with HDFS libraries to use HDFS Java APIs

Apache Map-Reduce

  • What is Map Reduce.
  • Input and output formats.
  • Data Types in Map Reduce.
  • Flow of Map Reduce Jobs.
  • Wordcount In Map Reduce.
  • How to use Custom Input Formats
  • Use case for Structure Data Sets.
  • Writing Custom Classes.

 

 

APACHE HIVE

  • What is HIVE.
  • Architecture of HIVE.
  • Tables in Hive with Load Functions.
  • Query Optimization.
  • Partitioning and Bucketing.
  • Joins in HIVE.
  • Indexing In HIVE.
  • File Formats in HIVE.
  • How to read JSON files in HIVE.

APACHE SQOOP

 

  • What is Sqoop.
  • Relation between SQL & Hadoop.
  • Performing Sqoop Import.
  • Incremental and Conditional Imports
  • Performing Sqoop Export.

PIG

  • What is PIG & ETL.
  • Introduction to PIG Architecture.
  • Introduction of PIG Latin.
  • How to Perform ETL on any Kind of data
  • (PIG Eats Everything)
  • Use cases of PIG.
  • Joins in PIG.
  • Co-grouping In PIG.

Introduction to NoSQL Database &OOZIE

  • What is HBASE.
  • Architecture of HBASE.
  • CRUD operations in HBASE
  • Retrival of HBASE Data.
  • Introduction of Apache Oozie (Scheduler tool)

Introduction to Programming in Scala

  • Basic data types and literals used
  • List the operators and methods used in Scala
  • Classes of Scala
  • Traits of Scala.
  • Control Structures in Scala.
  • Collection of Scala.
  • Libraries of Scala.

 

Introduction to Spark

  • Limitations of MapReduce in Hadoop Objectives
  • Batch vs. Real-time analytics
  • Application of stream processing
  • Spark vs. Hadoop Eco-system

Using RDD for Creating Applications in Spark

  • Features of RDDs
  • How to create RDDs
  • RDD operations and methods
  • Explain RDD functions and describe how to
  • write different codes in Scala

Running SQL queries Using SparkQL

  • Explain the importance and features of SparkQL
  • Describe methods to convert RDDs to
  • DataFrames
  • Explain concepts of SparkSQL

Describe the concept of hive integration

Spark ML Programming

  • Explain the use cases and techniques of
  • Machine Learning (ML)
  • Describe the key concepts of Spark ML
  • Explain the concept of an ML Dataset, and ML
  • algorithm, model selection via cross validation

About the Trainer

Satendra Kumar picture

Avg Rating

0 Reviews

0 Students

1 Courses

Satendra Kumar

Mca

12 Years of Experience

Working IT professnal

Students also enrolled in these courses

LIVE
30 Hours

Course offered by Alok Ganguly

1 review
LIVE
40 Hours

Course offered by Alok Ganguly

1 review
LIVE
10 Hours

Course offered by Alok Ganguly

1 review
LIVE
6 Hours

Course offered by Ayush singh rathore

0 review

Tutor has not setup batch timings yet. Book a Demo to talk to the Tutor.

Different batches available for this Course

No Reviews yet!

Reply to 's review

Enter your reply*

1500/1500

Please enter your reply

Your reply should contain a minimum of 10 characters

Your reply has been successfully submitted.

Certified

The Certified badge indicates that the Tutor has received good amount of positive feedback from Students.

Different batches available for this Course

tickYou have successfully registered

Big data with spark scala by Satendra Kumar

Satendra Kumar picture
LIVE

Class
starts in

01

Hour

01

Min

01

Sec

Select One

Register Now

Do you want to Register for this Free class?

Yes, Register No, not right now

Tell us a little more about yourself

Big data with spark scala by Satendra Kumar

Satendra Kumar picture
LIVE

Class
starts in

01

Hour

01

Min

01

Sec

Please enter Student name

Please enter your email address.

Please enter phone number.

Verify Your Mobile Number

Please verify your Mobile Number to book this free class.

Update

Please enter 10 digit phone number.

Please enter your phone number.

Please Enter a valid Mobile Number

This number is already in use.

Resend

Please enter OTP.

Or, give a missed call and get your number verified

080-66-0844-42

This website uses cookies

We use cookies to improve user experience. Choose what cookies you allow us to use. You can read more about our Cookie Policy in our Privacy Policy

Accept All
Decline All

UrbanPro.com is India's largest network of most trusted tutors and institutes. Over 55 lakh students rely on UrbanPro.com, to fulfill their learning requirements across 1,000+ categories. Using UrbanPro.com, parents, and students can compare multiple Tutors and Institutes and choose the one that best suits their requirements. More than 7.5 lakh verified Tutors and Institutes are helping millions of students every day and growing their tutoring business on UrbanPro.com. Whether you are looking for a tutor to learn mathematics, a German language trainer to brush up your German language skills or an institute to upgrade your IT skills, we have got the best selection of Tutors and Training Institutes for you. Read more