Live Class Practical Oriented Training
Timely Doubt Resolution
Dedicated Student Success Mentor
Certification & Job Assistance
Free Access to Workshop & Webinar
No Cost EMI Option
Fundamentals of Hadoop and YARN and write applications using them
Spark, Spark SQL, Streaming, Data Frame, RDD, GraphX and MLlib writing Spark applications
Set up different configurations of Hadoop cluster
Leverage Pig, Hive, Hbase, ZooKeeper, Sqoop, Flume, and other projects from the Apache Hadoop ecosystem
Practicing real-life projects using Hadoop and Apache Spark
Setting up pseudo-node and multi-node clusters on Amazon EC2
Hadoop administration activities like cluster managing, monitoring, administration and troubleshooting
Maintain and monitor Hadoop cluster by considering the optimal hardware and networking settings
Hadoop testing applications using MRUnit and other automation tools
Lecture-1 Introduction to Apache Hadoop and the Hadoop Ecosystem
Lecture-2 Apache Hadoop File Storage
Lecture-3 Distributed Processing on an Apache Hadoop Cluster
Lecture-4 Apache Spark Basics
Lecture-5 Working with DataFrames and Schemas
Lecture-6 Analyzing Data with DataFrame Queries
Lecture-7 RDD Overview
Lecture-8 Transforming Data with RDDS
Lecture-9 Aggregating Data with Pair RDDS
Lecture-10 Querying Tables and Views with SQL
Lecture-11 Working with Datasets in Scala
Lecture-12 Writing, Configuring, and Running Spark Applications
Lecture-13 Spark Distributed Processing
Lecture-14 Distributed Data Persistence
Lecture-15 Common Patterns in Spark Data Processing
Lecture-16 Introduction to Structured Streaming
Lecture-17 Structured Streaming with Apache Kafka
Lecture-18 Aggregating and Joining Streaming DataFrames
Lecture-19 Message Processing with Apache Kafka