data-engineering-pipeline

💜🌈📊 A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api 🌺

Updated Jun 18, 2024
Jupyter Notebook

DivineSamOfficial / Banking-Data-Warehouse-Pipeline

Star

Banking Data Warehouse Pipeline

python aws datawarehousing dbt aws-redshift aws-glue data-engineering-pipeline

Updated Jun 12, 2024
Python

anki-code / xontrib-pipeliner

Sponsor

Star

Let your pipe lines flow thru the Python code in xonsh.

python shell pipeline pipe pipelines data-engineering xonsh xontrib data-engineering-pipeline

Updated Jun 7, 2024
Python

PATRICIAJUNQUEIRA / Airflow_Pipeline_Gera_Pasta

Star

Pipeline de dados automatizado para extração e armazenamento de previsões meteorológicas para o setor de turismo.

python api airflow tourism data-engineering weather-forecast data-pipeline data-engineering-pipeline

Updated Jun 5, 2024
Python

umairkarel / Amazon-Sales-Data-Engineering

Star

Data Engineering Pipeline practice with Amazon Sales Data

python snowflake python3 data-engineering data-engineering-pipeline snowpark

Updated Jun 5, 2024
Python

waqarg2001 / Youtube-Data-Pipeline-AWS

Star

Leveraging AWS Cloud Services, an ETL pipeline transforms YouTube video statistics data. Data is downloaded from Kaggle, uploaded to an S3 bucket, and cataloged using AWS Glue for querying with Athena. AWS Lambda and Glue converts to Parquet format and stores it in a cleansed S3 bucket. AWS QuickSight then visualizes the materialised data.

python aws spark aws-lambda etl aws-s3 pandas pyspark data-engineering aws-iam aws-cloudwatch data-pipeline etl-pipeline aws-glue data-engineering-workflows data-engineering-pipeline aws-lambda-layers aws-data-engineering-project data-engineering-project

Updated May 30, 2024
Python

julian506 / openweathermap-etl

Star

A simple ETL for temperature data from the Openweathermap API, storing it into an Azure SQL Database

python etl azure scheduler data-engineering azure-sql-database etl-pipeline data-engineering-pipeline

Updated May 29, 2024
Python

AyushRaiKhare / Ayush_Khare_Data_Engineering_Portfolio

Star

Ayush @ Data Engineering Portfolio

jenkins data-science data data-visualization data-engineering dataflow dbt kubernetes-deployment data-engineer etl-pipeline data-engineering-pipeline mlops data-engineering-nanodegree

Updated May 27, 2024

Cognizant-Technology-Innovation / lakehouseops-sra-for-databricks

Star

The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.

data-engineering databricks data-engineering-pipeline

Updated Jun 18, 2024
HCL

AtharvTarte / Bing-News-Analysis

Star

In this project, I have created an end to end solution for analyzing the bing latest news data. I have used the microsoft fabric for all the tools.

spark fabric azure powerbi data-engineering-pipeline microsoft-fabric

Updated Apr 23, 2024
Jupyter Notebook

alfredzou / BoardGameGeek_Pipeline

Star

Pipeline to automate the collection of board game and expansion data from BoardGameGeek's XML API2. Data is stored in Google Cloud Storage and BigQuery. Data is modelled using DBT in a star schema. (Terraform, GCP, Mage, Python, dbt)

board-game terraform gcp data-engineering boardgame mage dbt boardgamegeek board-games data-engineering-pipeline

Updated Apr 23, 2024
Python

data2al / dbt-tutorial-course

Star

sql data-engineering-pipeline dbt-core

Updated Apr 2, 2024

prayagnshah / End-to-End-Pipeline

Star

Zillow Data Pipeline: Extracts data from Zillow, transfers it through AWS services, and performs analytics. Utilizes Python scripts, AWS Lambda, S3, Amazon RedShift, and QuickSight. Explore docs/images for architecture visuals.

python aws-lambda aws-s3 aws-ec2 redshift dag zillow-api quicksight data-engineering-pipeline

Updated Mar 27, 2024
Python

yashksaini-coder / Python-for-Data-Engineering

Star

Data Engineering 🛠️ is like the backbone of data processing 📊, managing data pipelines 🚀, warehouses 🏢, and lakes 🌊. It's the bridge 🌉 between raw data and actionable insights, powering businesses 🚀 with efficient data management and analytics 📈.

python aws data-science kafka data-engineering data-engineer data-engineering-pipeline

Updated Mar 26, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the data-engineering-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-engineering-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-engineering-pipeline

Here are 127 public repositories matching this topic...

JessicaHora / JessicaHora

jolly-io / Data_Engineering_Notes

povoaaires / data_project_model

vmware / versatile-data-kit

Semiu / data-engineering

gear5sh / Gear5

longNguyen010203 / Youtube-ETL-Pipeline

DivineSamOfficial / Banking-Data-Warehouse-Pipeline

anki-code / xontrib-pipeliner

PATRICIAJUNQUEIRA / Airflow_Pipeline_Gera_Pasta

umairkarel / Amazon-Sales-Data-Engineering

waqarg2001 / Youtube-Data-Pipeline-AWS

julian506 / openweathermap-etl

AyushRaiKhare / Ayush_Khare_Data_Engineering_Portfolio

Cognizant-Technology-Innovation / lakehouseops-sra-for-databricks

AtharvTarte / Bing-News-Analysis

alfredzou / BoardGameGeek_Pipeline

data2al / dbt-tutorial-course

prayagnshah / End-to-End-Pipeline

yashksaini-coder / Python-for-Data-Engineering

Improve this page

Add this topic to your repo