Skip to content

👷🌇 Set up and build a big data processing pipeline with Apache Spark, 📦 AWS services (S3, EMR, EC2, IAM, VPC, Redshift) and Terraform to setup the infrastructure🥊

License

Notifications You must be signed in to change notification settings

longNguyen010203/Spark-Processing-AWS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

👷 Spark-Processing-AWS

In this project, I set up and build a big data processing pipeline using Apache Spark integrated with various AWS services, including S3, EMR, EC2, VPC, IAM, and Redshift and Terraform to setup the infrastructure

🔦 About Project

📦 Technologies

  • S3
  • EMR
  • EC2
  • Airflow
  • Redshift
  • Terraform
  • Spark
  • VPC
  • IAM

About

👷🌇 Set up and build a big data processing pipeline with Apache Spark, 📦 AWS services (S3, EMR, EC2, IAM, VPC, Redshift) and Terraform to setup the infrastructure🥊

Topics

Resources

License

Stars

Watchers

Forks