Big Data Analytics | Youth Employment Program

Course Introduction

Getting intellectuals ready to become Big Data Experts!

This hands-on training of Seven (7) days being led by industry experts aims to open up the advance career opportunities for attendees to be SQL developers, data analysts, business intelligence specialists, developers, system architects, and database administrators. In the course, attendees will be given extensive hands-on practice on advance Big Data tools and technologies such as Hadoop, Cloudera, Hive, Sqoop etc.

This course will teach students:

How to Extract, Transfer, Load (ETL) processes to prepare data from a MySQL database into HDFS using Sqoop.
Use Data Definition Language (DDL) statements to create or alter structures in the meta store for use by Hive and Impala.
Use Query Language statements in Hive and Impala to analyze data on a cluster.

Course Audience

The following course is designed for

Career newbies, Recent graduates, third year and final year students from the Computer Science/ IT/Software Engineering disciplines.
Professionals from the computer science domain who want to shift the profession to Big Data, i.e. Business Intelligence experts, Data Scientist, Data Analysts.
Executives who want to build the initial knowledge about the impact of the Big Data ecosystem on their organizational growth.

Course Schedule

Syllabus - What you will learn from this course

Lesson 1

Introduction to big data

Introduction to Analytics & Architecture
What is High Performance Computing
What is streaming data
What is visualization
What is Big Data
Your first Big Data application on AWS

Lesson 2

Introduction to data analysis, Storage & processing solutions

Data analytics and data analysis concepts
Introduction to the challenges of data analytics
Introduction to Amazon S3
Introduction to data lakes

Lesson 3

Storage & processing solutions

Introduction to data storage methods
Introduction to data processing methods
Introduction to batch data processing
Introduction to stream data processing

Lesson 4

Data structure and types

Introduction to source data storage
Introduction to structured data stores
Introduction to semi-structured and unstructured data stores
Understanding data integrity
Understanding database consistency
Introduction to ETL process
Introduction to analyzing data
Introduction to visualizing data

Lesson 5

Big Data analytics & architecture

Big Data Analytics on Amazon Web Services (AWS)
Introduction to Amazon EMR
Getting started with real-time data analytics on AWS
Getting started with real-time streaming data in under 5 minutes
Big Data on AWS – structures, unstructured streaming
Evolving your Big Data use cases from batch to real-time
Building Big Data solutions with Amazon EMR and Amazon Redshift
Defining Big Data
Example Big Data Stacks
Big Data Framework | Hadoop Tutorial for Beginners
Big Data Architectural Patterns and Best Practices on AWS
Architectural Patterns for Big Data on AWS

Lesson 6

Big Data ,HPC & Streaming

High Performance Computing (HPC) with Amazon Web Services
High Performance Computing in the Cloud with AWS and Cycle Computing
Large Scale Processing and Huge Data sets
What is a Data Stream?
What is Streaming Data?
What Is Amazon Kinesis Data Streams?
Perform Basic Stream Operations
Creating a Stream

Lesson 7

Software development & Platform Technologies

Architecture
DevOps
Programming Languages
Scripting Languages
Mobile Applications
Web Development
Software architecture
Software development processes and methodologies: definition
Software architecture and design
Introduction to programming
Overview of main programming languages
Introduction to C, C#, C++, .NET, Java, Python and others
Introduction to operating systems and virtualization
How is virtualization used in the cloud?

powered by Typeform

Trainers

Our Trainers

Bikram Adhikari

Bikram Adhikari

Mr. Bikram is an experienced Software engineer with a demonstrated history of working in the computer software industry. Skilled in PHP, Java, HTML, C#, and Web Applications. A…

Anjani Phuya

Anjani Phuya

Mr. Anjani is working on a mission of developing end-to-end product engineering and digital transformation services to companies and startups across Europe and Asia by…

FAQs

Frequently Asked Questions

What will be the duration and type of training?

The training would be instructor led. It will be a full seven days training (from 10AM-6PM).

Do I need any prior knowledge to attend this training?

Attendees don’t need to have any prior Big Data tools and platforms knowledge or hands on for this course. But they should have basic knowledge of programming, databases & SQL.

What is the cost of the training?

For the betterment of KP youth, this digital skill training is offered by KPITB free of cost. CCA Certification cost is not covered in the training. Trainees and KPITB would set modalities for the reimbursement of the certification fee.

What will be the training location?

Training is planned to be conducted at multiple locations across Khyber Pakhtunkwa. The details of training venues / locations would be communicated at the time of registration.

Who should attend this training?

Recent graduates, third year and final year students from the computer science disciplines.
Professionals from the computer science domain who want to shift the profession to Big Data Analytics.
Executives who want to build the initial knowledge about the impact of the Big Data ecosystem on organization growth.

What will be the takeaways of the training?

This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages.
A SQL developer who completes this training would be able to perform core competencies required to pull and generate reports in Cloudera’s CDH environment using Impala and Hive.

Can I get a job after this training?

After this training, you shall be eligible for the international certification of CCA that would be opening up various opportunities not just on national but on international level as well.

How much hands-on will be doing in this training?

Since our courses are led by Industry Experts so it is made sure that content covered in course is designed with hand on knowledge of more than 70-75 % along with supporting theory.

Will I get certification after this course?

Yes, you will be awarded with a course completion certificate by Dice Analytics. You shall also be eligible for the international certification of Cloudera Certified Associate.

Who are the instructors?

Ali Raza Anjum - LinkedIn
Nauman Mir - LinkedIn
Sufyan Nasir - LinkedIn
Muhammad Muneeb - LinkedIn
Nabeel Waqar Wyne - LinkedIn