-
Updated
Jun 26, 2024 - CSS
data-engineering-pipeline
Here are 127 public repositories matching this topic...
-
Updated
Jun 26, 2024
One framework to develop, deploy and operate data workflows with Python and SQL.
-
Updated
Jun 27, 2024 - Python
Introduction to Data Engineering
-
Updated
Jun 19, 2024 - Jupyter Notebook
high performance better alternative to Airbyte, Singer, Meltano
-
Updated
Jun 19, 2024 - Go
πππ A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api πΊ
-
Updated
Jun 18, 2024 - Jupyter Notebook
Banking Data Warehouse Pipeline
-
Updated
Jun 12, 2024 - Python
Let your pipe lines flow thru the Python code in xonsh.
-
Updated
Jun 7, 2024 - Python
Pipeline de dados automatizado para extração e armazenamento de previsáes meteorológicas para o setor de turismo.
-
Updated
Jun 5, 2024 - Python
Data Engineering Pipeline practice with Amazon Sales Data
-
Updated
Jun 5, 2024 - Python
Leveraging AWS Cloud Services, an ETL pipeline transforms YouTube video statistics data. Data is downloaded from Kaggle, uploaded to an S3 bucket, and cataloged using AWS Glue for querying with Athena. AWS Lambda and Glue converts to Parquet format and stores it in a cleansed S3 bucket. AWS QuickSight then visualizes the materialised data.
-
Updated
May 30, 2024 - Python
A simple ETL for temperature data from the Openweathermap API, storing it into an Azure SQL Database
-
Updated
May 29, 2024 - Python
Ayush @ Data Engineering Portfolio
-
Updated
May 27, 2024
The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.
-
Updated
Jun 18, 2024 - HCL
In this project, I have created an end to end solution for analyzing the bing latest news data. I have used the microsoft fabric for all the tools.
-
Updated
Apr 23, 2024 - Jupyter Notebook
Pipeline to automate the collection of board game and expansion data from BoardGameGeek's XML API2. Data is stored in Google Cloud Storage and BigQuery. Data is modelled using DBT in a star schema. (Terraform, GCP, Mage, Python, dbt)
-
Updated
Apr 23, 2024 - Python
-
Updated
Apr 2, 2024
Zillow Data Pipeline: Extracts data from Zillow, transfers it through AWS services, and performs analytics. Utilizes Python scripts, AWS Lambda, S3, Amazon RedShift, and QuickSight. Explore docs/images for architecture visuals.
-
Updated
Mar 27, 2024 - Python
Data Engineering π οΈ is like the backbone of data processing π, managing data pipelines π, warehouses π’, and lakes π. It's the bridge π between raw data and actionable insights, powering businesses π with efficient data management and analytics π.
-
Updated
Mar 26, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the data-engineering-pipeline topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-engineering-pipeline topic, visit your repo's landing page and select "manage topics."