What is Scala?
Why Scala for Spark?
Scala in other frameworks
Introduction to Scala REPL
Basic Scala operations
Variable Types in Scala
Control Structures in Scala
Foreach loop, Functions and Procedures
Collections in Scala- Array
ArrayBuffer, Map, Tuples, Lists, and more
Class in Scala
Getters and Setters
Custom Getters and Setters
Properties with only Getters
Auxiliary Constructor and Primary Constructor
Singletons
Extending a Class
Overriding Methods
Traits as Interfaces and Layered Traits
Programming
Higher Order Functions
Anonymous Functions, and more
What is Big Data?
Big Data Customer Scenarios
Limitations and Solutions of Existing Data Analytics Architecture with Uber Use Case
How Hadoop Solves the Big Data Problem
What is Hadoop?
Hadoop’s Key Characteristics
Hadoop Ecosystem and HDFS
Hadoop Core Components
Rack Awareness and Block Replication
HDFS Read/Write Mechanism
YARN and Its Advantage
Hadoop Cluster and Its Architecture
Hadoop: Different Cluster Modes
Data Loading using Sqoop
Big Data Analytics with Batch & Real-Time Processing
Why Spark is Needed?
What is Spark?
How Spark Differs from Its Competitors?
Spark at eBay
Spark’s Place in Hadoop Ecosystem
Spark Components & it’s Architecture
Running Programs on Scala IDE & Spark Shell
Spark Web UI
Configuring Spark Properties
Challenges in Existing Computing Methods
Probable Solution & How RDD Solves the Problem
What is RDD, It’s Functions, Transformations & Actions?
Data Loading and Saving Through RDDs
Key-Value Pair RDDs and Other Pair RDDs o RDD Lineage
RDD Persistence
WordCount Program Using RDD Concepts
RDD Partitioning & How It Helps Achieve Parallelization
Need for Spark SQL
What is Spark SQL?
Spark SQL Architecture
SQL Context in Spark SQL
Data Frames & Datasets
Interoperating with RDDs
JSON and Parquet File Formats
Loading Data through Different Sources
What is Machine Learning?
Where is Machine Learning Used?
Different Types of Machine Learning Techniques
Face Detection: USE CASE
Understanding MLlib
Features of Saprk MLlib and MLlib Tools
Various ML algorithms supported by Spark MLlib
K-Means Clustering & How It Works with MLlib
Analysis on US Election Data: K-Means Spark MLlib USE CASE
Need for Kafka
What is Kafka?
Core Concepts of Kafka
Kafka Architecture
Where is Kafka Used?
Understanding the Components of Kafka Cluster
Configuring Kafka Cluster
Producer and Consumer
Need of Apache Flume
What is Apache Flume
Basic Flume Architecture
Flume Sources
Flume Sinks
Flume Channels
Flume Configuration
Integrating Apache Flume and Apache Kafka
Drawbacks in Existing Computing Methods
Why Streaming is Necessary?
What is Spark Streaming?
Spark Streaming Features
Spark Streaming Workflow
How Uber Uses Streaming Data
Streaming Context & DStreams
Transformations on DStreams
WordCount Program using Spark Streaming
Describe Windowed Operators and Why it is Useful
Important Windowed Operators
Slice, Window and ReduceByWindow Operators
Stateful Operators
Perform Twitter Sentimental Analysis Using Spark Streaming
Big Data Applications for the Healthcare Industry with Apache Sqoop and Apache Solr
Created by team of both industry & academic experts having 20+ years of rich R&D experiance
In Online training, you will get
In Class room training, you will get
Top industry experts with rich 20+ years of R&D experience in mentoring students across the world.
Soft copy of the course material will be mailed to you.
In online instructor-led training, team of experts will train you with a group of our course learners for 25+ hours over online conferencing software like Zoom & Webminar. Online Classes will happen every day from Monday to Friday.
At the end, of course, you will work on a real-time project. Once you are done with the project (it will be reviewed by an expert), you will be awarded a certificate which you can share on LinkedIn.
Enrollment into course entails 30 days of free access to labs depending on date of enrollment. Can be extended based on permission.
Yes, you can renew your subscription anytime. Please choose your desired plan for the lab and make payment to renew your subscription
Mail our most dynamic & ever active director through email director@vaidehisoftware.com
Have more questions? Please contact us at director@vaidehisoftware.com