A Big Data Hadoop and Spark project for absolute beginners
A Big Data Hadoop and Spark project for absolute beginners, Udemy Coupon
Hadoop, Spark, Python,PySpark, Scala, Dataproc, AWS S3 Data Lake, Glue, Athena
Preview this Course GET COUPON CODE
Description
A bank is launching a new credit card and wants to identify prospects it can target in its marketing campaign.
It has received prospect data from various internal and 3rd party sources. The data has various issues such as missing or unknown values in certain fields.The data needs to be cleansed before any kind of analysis can be done.
Since the data is in huge volume with billions of records, the bank has asked you to use Big Data Hadoop and Spark technology to cleanse, transform and analyze this data.
What you will learn :
Big Data, Hadoop concepts
How to create a free Hadoop and Spark cluster using Google Dataproc
Hadoop hands-on - HDFS, Hive
Python basics
PySpark RDD - hands-on
PySpark SQL, DataFrame - hands-on
Project work using PySpark and Hive
Scala basics Spark Scala DataFrame
Project working using Spark Scala
Google Colab environment
Bonus project - Applying spark transformation on data stored in AWS S3 using Glue and viewing data using Athena
Prerequisites :
Some basic programming skills
Some knowledge of SQL queries
100% Off Udemy Coupon . Free Udemy Courses . Online Classes
Hadoop, Spark, Python,PySpark, Scala, Dataproc, AWS S3 Data Lake, Glue, Athena
- Created by FutureX Skill
- English
- Big Data Hadoop and Spark with Scala
- CCA 175 -Spark Developer Exam Preparation + Practice Tests
- Hive in Depth Training and Interview Preparation course
- Cloudera CCA 175 Spark Developer Certification: Hadoop Based
- From 0 to 1: Hive for Processing Big Data
Preview this Course GET COUPON CODE
Description
A bank is launching a new credit card and wants to identify prospects it can target in its marketing campaign.
It has received prospect data from various internal and 3rd party sources. The data has various issues such as missing or unknown values in certain fields.The data needs to be cleansed before any kind of analysis can be done.
Since the data is in huge volume with billions of records, the bank has asked you to use Big Data Hadoop and Spark technology to cleanse, transform and analyze this data.
What you will learn :
Big Data, Hadoop concepts
How to create a free Hadoop and Spark cluster using Google Dataproc
Hadoop hands-on - HDFS, Hive
Why there was a need for Spark
Python basics
PySpark RDD - hands-on
PySpark SQL, DataFrame - hands-on
Project work using PySpark and Hive
Scala basics Spark Scala DataFrame
Project working using Spark Scala
Google Colab environment
Bonus project - Applying spark transformation on data stored in AWS S3 using Glue and viewing data using Athena
Prerequisites :
Some basic programming skills
Some knowledge of SQL queries
100% Off Udemy Coupon . Free Udemy Courses . Online Classes