Course Details | Cloudthat

Course Overview

In this course, you’ll learn about cloud-based big data solutions like Amazon EMR, Amazon Redshift, Amazon Kinesis, and the rest of the AWS big data platform. Learn to use Amazon EMR to process data using the broad ecosystem of Hadoop tools like Hive and Hue, create big data environments, work with Amazon DynamoDB, Amazon Redshift, Amazon QuickSight, Amazon Athena and Amazon Kinesis, and design big data environments for security and cost-effectiveness. The course comprises presentations, group exercises, and hands-on labs.

After completing this course, students will be able to:

  • Use Apache Hadoop with Amazon EMR
  • Launch and configure an Amazon EMR cluster
  • Use common programming frameworks for Amazon EMR, including Hive, Pig, and Streaming
  • Use Hue to improve the ease-of-use of Amazon EMR
  • Use in-memory analytics with Spark on Amazon EMR
  • Understand how services like AWS Glue, Amazon Kinesis, Amazon Redshift, Amazon Athena, and Amazon QuickSight can be used with big data workloads

Upcoming Batches

India Online Enroll

To be Decided

Key Features

  • Our training modules have 50% - 60% hands-on lab sessions to encourage Thinking-Based Learning (TBL)
  • Interactive-rich virtual and face-to-face classroom teaching to inculcate Problem-Based Learning (PBL)
  • AWS certified instructor-led training and mentoring sessions to develop Competency-Based Learning (CBL)
  • Well-structured use-cases to simulate challenges encountered in a Real-World environment
  • Being an authorized AWS Training Partner gives us an edge over competition

Who Should Attend

  • Individuals responsible for designing and implementing big data solutions, namely Solutions Architects and SysOps Administrators
  • Data Scientists and Data Analysts interested in learning about big data solutions on AWS

Prerequisites

We recommend that attendees of this course have:

  • Basic familiarity with big data technologies, including Apache Hadoop, HDFS, and SQL/NoSQL querying · Completed Data Analytics Fundamentals free digital training or equivalent experience
  • Working knowledge of core AWS services and public cloud implementation
  • Completed the AWS Technical Essentials classroom training or have equivalent experience
  • Basic understanding of data warehousing, relational database systems, and database design

Course Outline Download Course Outline

Day 1

Module 1: Overview of Big Data

  • What is big data
  • The big data pipeline
  • Big data architectural principals

Module 2: Big Data ingestion and transfer

  • Overview: Data ingestion
  • Transferring data

Module 3: Big data streaming and Amazon Kinesis

  • Stream processing of big data
  • Amazon Kinesis
  • Amazon Kinesis Data Firehose
  • Amazon Kinesis Video Streams
  • Amazon Kinesis Data Analytics
  • Hands-on lab 1: Streaming and Processing Apache Server Logs Using Amazon Kinesis

Module 4: Big data storage solutions

  • AWS data storage options
  • Storage solutions concepts
  • Factors in choosing a data store Module 5: Big data processing and analytics
  • Big data processing and analytics
  • Amazon Athena
  • Hands-on lab 2: Using Amazon Athena to Analyze Log Data

Day 2

Module 6: Apache Hadoop and Amazon EMR

  • Introduction to Amazon EMR and Apache Hadoop
  • Best practices for ingesting data
  • Amazon EMR
  • Amazon EMR architecture
  • Hands-on lab 3: Storing and Querying Data on Amazon DynamoDB

Module 7: Using Amazon EMR

  • Developing and running your application
  • Launching your cluster
  • Handling output from your completed jobs

Module 8: Hadoop programming frameworks 

  • Hadoop frameworks
  • Other frameworks for use on Amazon EMR
  • Hands-on lab 4: Processing Server Logs with Hive on Amazon EMR

Module 9: Web interfaces on Amazon EMR 

  • Hue on Amazon EMR
  • Monitoring your cluster
  • Hands-on lab 5: Running Pig Scripts in Hue on Amazon EMR

Module 10: Apache Spark on Amazon EMR 

  • Apache Spark
  • Using Spark
  • Hands-on lab 6: Processing NY Taxi Data Using Apache Spark

Day 3

Module 11: Using AWS Glue to automate ETL workloads

  • What is AWS Glue?
  • AWS Glue: Job orchestration

Module 12: Amazon Redshift and big data

  • Data warehouses vs. traditional databases
  • Amazon Redshift
  • Amazon Redshift architecture

Module 13: Securing your Amazon deployments

  • Securing your Amazon deployments
  • Amazon EMR security overview
  • AWS Identity and Access Management (IAM) overview
  • Securing data
  • Amazon Kinesis security overview
  • Amazon DynamoDB security overview
  • Amazon Redshift security overview

Module 14: Managing big data costs

  • Total cost considerations for Amazon EMR
  • Amazon EC2 pricing models
  • Amazon Kinesis pricing models
  • Cost considerations for Amazon DynamoDB
  • Cost considerations and pricing models for Amazon Redshift
  • Optimizing cost with AWS

Module 15: Visualizing and orchestrating big data

  • Visualizing big data
  • Amazon QuickSight
  • Orchestrating a big data workflow
  • Hands-on lab 7: Using TIBCO Spotfire to visualize data

Module 16: Big data design patterns

  • Common architectures

Module 17: Course wrap-up

  • What’s next?

Certification

    • By earning Big Data on AWS certification, you will show your future or current employer that you have knowledge of AWS Cloud concepts.
    • Big Data on AWS certification can be used to learn cloud-based big-data solutions on AWS platforms.
    • On successful completion of Big Data on AWS certification training aspirants receive a Course Completion Certificate from us
    • By successfully clearing the Big Data on AWS certification exams, aspirants earn AWS Certification

Our Top Trainers

Pavan Bhawsar

Pavan is a Microsoft Certified Trainer at CloudThat. He is an enthusiastic and passionate trainer, empathic observer towards the trending technologies with demonstrated skill in Azure and hybrid Cloud Administration. He has 6+ years of corporate experience, etc.

Vivek Kumar

Vivek has been involved in various large and complex projects with global clients. He has experience in AWS, GCP and Azure Cloud Platforms. He has experience in various software development fields like Image Processing, Web designing, Networking etc.

Jagadesh Gonnagar

He has been a part of several large and complex software development projects with global clients. He has worked in USA for over 11 years before relocating to India. He has expertise in Database design/development, Web development, etc.

Haris AK

Haris works as Cloud Solutions Architect in CloudThat technologies, being passionate about ever evolving technology. He is specialist on Docker, Kubernetes, Ansible, Git/Jenkins, Terraform and other DevOps Technologies. Haris Architects’ solutions on Cloud as well on-Premises using wide etc.

Devi Vara Prasad

A Microsoft Certified Trainer with more than 15+ years of Corporate, Online and Classroom Training Experience, well versed in AWS an Azure Cloud platform and have been delivering trainings for more than 5years. Also has a vast etc.

Ajay Kumar Lodha

Ajay is cloud obsessed and cloud addict, that's how he describes himself. Ajay has been working with all the major cloud computing platforms like AWS, Azure, and GCP for more than 5 years now. He is into etc.

Guruprasad Srinivasrao Venugopal

He is a Cloud enthusiast with demonstrated skills in Azure, AWS Hybrid-Cloud administration, Linux and DevOps. Alongside working on Azure Cloud deployment, administration and implementation, he is also engaged in planning, designing and executing various nice technology etc.

Prarthit Mehta

Prarthit has been involved in various large and complex projects with global clients. He has experience in Microsoft, MS office 365 & AWS Infrastructure technologies and Windows servers, designing Active Directory and managing various domain services. He etc.

Priyant Gupta

Priyant is working in the Microsoft Technology space for last 5 years and is sharing the knowledge gained on Azure Administration, Azure Data Engineering and Dynamics 365 CE Apps. He has trained 1000+ professionals as a corporate trainer at etc.

Lakhan Kriplani

He had involved in various client projects to set up infrastructure on Cloud for various Analytics applications, E-Commerce, setup CICD Pipeline using AWS services. He has experience in developing highly secure, scalable web applications using MVC architecture. etc.

Course Fee

Select Course date

Add to Wishlist

Course Price at

₹ 39900 + 18% GST

Enroll Now