Login

Register

Create an Account
Retrieve Password
Back to login/register

REGISTER FOR A FREE WEBINAR - OPPORTUNITIES IN BIG DATA ANALYTICS

BIG DATA & HADOOP

PREPARE YOURSELF FOR THE MOST SOUGHT AFTER CAREER OF THE 21ST CENTURY!

BIG DATA & HADOOP

Instructors With 20+ Years Of Combined Experience In Data Science & Big Data Analytics

Instructors Have Personally Trained 1,600+ Participants Across India

Participate From Your PC/Mobile With HD Video Live Streaming

Fully Interactive With Two-Way Live Communication

Access Recorded Classes For Easy Reference

Access To Instructor For 1 Month After Course Completion

REGISTER FOR A FREE WEBINAR - OPPORTUNITIES IN BIG DATA ANALYTICS

The Big Data Hadoop Certification Course from MyAbhyaas Academy has been designed to boost the participants’ career in the Big Data Analytics – one of the most sought after career opportunities in the 21st century. With ever-growing generation of data and need for effective interpretation of data – the demand for professionals with cutting edge knowledge on Hadoop is growing.

WHY BIG DATA COURSE FROM MYABHYAAS?

1. Everyday we are generating 2.5 quintillion bytes of data (source: IBM); businesses needs to analyze this data to develop intelligence and insights to take informed business decisions
2. As Hadoop offers software framework for storing and processing Big Data, the global Hadoop market is set to reach $84.6 billion by 2021 (Source: Allied Market Research)
3. There will be shortage of 1.5 million data experts by 2018 (Source: McKinsey)
4. Our program, which is developed by instructors with 20+ years of combined experience in Data Science field, will position you to benefit from the growing demand for Hadoop professionals.
5. The course provides exposure to high quality content, work activities and real-life projects, enabling best in class learning and exposure.

MyAbhyaas has specifically designed this course to make participants learn the practical aspects of using Hadoop in Big Data analytics. During this course, participants will learn about opportunities and challenges in Big Data, Hadoop ecosystem, HDFS and YARN architecture, MapReduce, Pig, Hive, Hadoop streaming using Python and basics of Apache Spark. The participants will work extensively on projects and assignments enabling them to prepare for real life scenarios.

Certification with grade will be issued based on the submitted assignments as well as performance in the final test.

KEY LEARNING FROM THIS PROGRAM

After successful completion of this course, the participants would be well versed in:

1. Conceptual understanding of the entire process of Big Data Analytics and Hadoop

2. Using Using HDFS, YARN, MapReduce, Hive, Pig for effective data analytics

3. Hadoop administration activities such as cluster monitoring, managing and troubleshooting

4. Practice real-life projects using Hadoop and Apache Spark

5. Business application of Big Data Analytics

WHO SHOULD REGISTER FOR THIS COURSE?

The course is suitable for beginners as well as experienced professionals who want to make career in ever growing Big Data analytics. While knowledge of Core Java and SQL will be beneficial, they are not necessary to master this program. To benefit from this program, all you need is analytical bent of mind.

Following set of people will find it useful:

1. IT & Software professionals looking to switch to big-data analytics

2. Professionals in data analytics and business analytics job

3. Engineers and MBAs looking to build career in big-data analytics

4. Statisticians and Business Analysts

5. Project Managers

6. Anyone with a genuine interest in the data science field

GET IN TOUCH TO KNOW MORE

Call Us: +91-90040-89006

BIG DATA & HADOOP COURSE BENEFITS

LEADING INSTRUCTORS

Learn from Instructors with 20+ years of combined experience in Big Data Analytics & Business Intelligence; have trained 1,600+ participants!

40+ TOPICS

Specially designed to provide the requisite conceptual knowledge, tools, and skills to become an effective Big Data analytics professional; focus on developing critical thinking abilities

LIVE INTERACTIVE CLASSES

Participate in 40 hours of instructor-led live training; fully interactive with two-way live communication; Designated forums to facilitate interaction and knowledge sharing among participants!

LEARN FROM ANYWHERE

Participate in this program from your PC/mobile with HD video live streaming; life time access to recorded classes for easy reference!

LEARN WITH ASSIGNMENTS

Online quiz and simulation activities to reinforce practical application of learned concepts; explanatory answers!

REAL LIFE APPLICATION

Focus on practical applications and hands-on training on Big Data analytics project; many industry specific projects during the training and 2 additional projects for practice

1-MONTH ACCESS

Access to the instructors for 1 month even after completing the course for guidance/support

CERTIFICATION

Certificate on completion with grades based on the performance in assignments and projects!

COURSE CURRICULUM

MODULE 1: INTRODUCTION TO BIG DATA (2 HOURS)

Characteristics of Big Data

Challenges for Big Data

Popular Tools Used to Store, Process, Analyze & Visualize Big Data

Use Cases for Big Data

ASSIGNMENT 01: Online simulation activity

MODULE 2: HADOOP ECO-SYSTEM & ARCHITECTURE (4 HOURS)

What is Hadoop?

Hadoop’s Key Characteristics

Hadoop Eco-system & Core Components

Where Hadoop Fits?

Traditional vs. Hadoop’s Data Analytics Architecture

When to Use & Not Use Hadoop?

Apache Hadoop & Distributions

Hadoop Job Trendss

ASSIGNMENT 02: Online simulation activity

MODULE 3: HDFS ARCHITECTURE (2 HOURS)

Introduction to Hadoop Distributed File System

HDFS Architecture and Features

Files and Data Blocks

Anatomy of a File Read/ Write on HDFS

Replication & Rack Awareness

ASSIGNMENT 03: Online simulation activity

MODULE 4: YARN ARCHITECTURE (2 HOURS)

YARN Architecture

Classic vs. YARN

YARN Daemons

Containers

Speculative Execution

HDFS Federation

Authentication & High Availability

ASSIGNMENT 04: Online simulation activity

MODULE 5: HADOOP SETUP (2 HOURS)

Hadoop Deployment Modes

Setting up a Pseudo-distributed Cluster

Hortonworks Sandbox Installation & Configuration

Linux Terminal Commands

Configuration Parameters and Values

ASSIGNMENT 05: Online simulation activity

MODULE 6: HADOOP SETUP PART 2 (2 HOURS)

HDFS File System Operations

Working with Hadoop Services using Ambari

HDFS, MapReduce and YARN Parameters

Hadoop Web UI

Filesystem & Linux Commands

ASSIGNMENT 06: Online simulation activity

MODULE 7: MAPREDUCE BASICS (2 HOURS)

What is MapReduce?

MapReduce Framework, Architecture and Use Cases

Input Splits

Hands on with MapReduce Programming

Packaging MapReduce Jobs in a JAR

ASSIGNMENT 07: Online simulation activity

MODULE 8: MAPREDUCE ADVANCED (2 HOURS)

Setting Mapper & Reducer Counts

Combiners

Partitioners & Custom Partitioners

Input & Output Formats

Sequence Files & Compressions

Distributed Cache

Map Side Join & Reduce Side Join

ASSIGNMENT 08: Online simulation activity

MODULE 9: USING PIG (6 HOURS)

Background

Pig Architecture

Pig Latin Basics

Pig Execution Modes

Pig Processing – Loading and Transforming Data

Pig Built-in Functions

Filtering, Grouping, Sorting Data

Relational Join Operators

Pig User Defined Functions

ASSIGNMENT 09: Online simulation activity

MODULE 10: USING HIVE (4 HOURS)

Background of Hive

Hive Architecture

Warehouse Directory & Metastore

Hive Query Language

Managed & External Tables

Data Processing – Loading Data into Tables

Using Hive Built-in Functions

Using Joins in Hive

Partitioning Data using Hive – Static & Dynamic

Bucketing in Hive

ASSIGNMENT 10: Online simulation activity

MODULE 11: HADOOP STREAMING USING PYTHON (4 HOURS)

Hadoop Streaming Concepts

Hadoop Streaming using Python

Demo: Writing Python Scripts for Streaming

Testing Python Scripts

Executing YARN Jar on Python Script

ASSIGNMENT 11: Online simulation activity

MODULE 12: BASICS OF APACHE SPARK (4 HOURS)

What is Apache Spark?

Using the Spark Shell

RDDs (Resilient Distributed Datasets)

Functional Programming in Spark

A Closer Look at RDDs

Key-Value Pair RDDs

Other Pair RDD Operations

Quiz

ASSIGNMENT 12: Online simulation activity

MODULE 13: RDDS IN SPARK – I (2 HOURS)

RDD Lineage

Caching Overview

Distributed Persistence

Storage Levels of RDD Persistence

Common Spark Use Cases

Iterative Algorithms in Spark

ASSIGNMENT 13: Online simulation activity

MODULE 14: RDDS IN SPARK – II (2 HOURS)

Machine Learning

Example: k-means

Quiz

Spark SQL and the SQL Context

Creating DataFrames

Transforming and Querying DataFrames

ASSIGNMENT 14: Online simulation activity

CAPSTONE PROJECTS

Successful execution of one of the following real-life, industry-based projects – where you will be using PIG, HIVE, HBase and MapReduce to perform Big Data analytics – is mandatory for certificate eligibility.

Project 1: Social Media Analytics
Project 2: Financial Services Data Analytics
Project 3: Ecommerce / Retail Data Analytics
Project 4: Aviation Sector Data Analytics
Sector 5: Media Sector Data Analytics

PARTICIPANTS' FEEDBACK

Harshita Tanwar

Data analytics course from MyAbhyaas is helping me to understand the concepts and its practical application. The guidance provided by the instructor is also helping me to understand key skills required to do better in this field and then acquire those skills!

Data Analyst, Wipro

2017-02-09T18:22:49+00:00

Data Analyst, Wipro

Data analytics course from MyAbhyaas is helping me to understand the concepts and its practical application. The guidance provided by the instructor is also helping me to understand key skills required to do better in this field and then acquire those skills!

Himanshu Chaubal

The Data Analytics with R-programming course by MyAbhyaas is an excellent course! It gave me a clear understanding of all the concepts and tools for business analytics needed to evaluate market trends for different global industries effectively.

Business Research & Advisory Professional, Mumbai

2017-02-10T10:26:57+00:00

Business Research & Advisory Professional, Mumbai

The Data Analytics with R-programming course by MyAbhyaas is an excellent course! It gave me a clear understanding of all the concepts and tools for business analytics needed to evaluate market trends for different global industries effectively.

Contact Us

Phone: +91-9004089006
Email: info@myabhyaas.com


Follow Us!

Learn on the Go!



top
© 2016-17 MyAbhyaas Education Services Private Limited.
MyAbhyaas Newsletter
Join over 10,000 fellow professionals who receive free email updates!

SUBSCRIBE 
We hate spam as much as you! Unsubscribe anytime!
close-link