etl
Here are 74 public repositories matching this topic...
Jupyter Notebooks with different purposes: Social Network WebScrapping, ETL, Selenium WebDriver for Web Testing, Automation using Python, Data Wrangling, Data Transformation, Data Cleaning, Stock Market Analysis, APIs, Machine learning Algorithms, etc...
-
Updated
Aug 9, 2020 - Jupyter Notebook
Orchestration of data science and earth observation models in Apache Airflow, scale-up with Celery Executor, experiment with jupyter notebook using a docker containers composition
-
Updated
Aug 23, 2022 - Python
A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.
-
Updated
Dec 7, 2022 - Python
Perform the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.
-
Updated
Oct 12, 2022 - Jupyter Notebook
A starter repository for your next AWS Glue project. This comes with complete IaC, a CD pipeline and a reusable common SDK. Set up jupyter notebook for AWS Glue locally
-
Updated
Sep 6, 2023 - Jupyter Notebook
Various Data Analytics Projects based On Statistics in form of Notebook.
-
Updated
Mar 3, 2020 - HTML
ETL with Jupyter Notebooks, Pandas, and Azure Cosmos DB
-
Updated
Oct 5, 2023 - Jupyter Notebook
👾 my old deep learning notebooks (e.g., tensorflow examples, caffee, deep art, numpy)
-
Updated
Nov 18, 2024 - Jupyter Notebook
This is a repository to hold the files and notebooks produced throughout my Udacity's Nanodegree Data Engineering program.
-
Updated
Dec 5, 2022 - PLpgSQL
This repo contain all the notebooks and the code of the Data Science Mentorship Program offered by Campusx youtube channel.
-
Updated
Aug 26, 2023 - Jupyter Notebook
Common ETL patterns and utilities for PySpark. Notebooks tested on Databricks Community edition
-
Updated
Sep 3, 2022 - Jupyter Notebook
Data science encompasses a wide range of areas, topics, and sub-domains such as Big Data, Machine & Deep learning (ETL, TensorFlow, Keras), Data Mining/Visualization (EDA), BI, Predictive Analytics, Statistical Analytics, etc.
-
Updated
May 3, 2024
Using data extracted from Kaggle on the top restaurants from 2020, this project utilized Python scripting in Jupyter Notebook to transform and clean the data and finally, load the cleaned data frames into a PostgreSQL database.
-
Updated
Mar 29, 2021 - Jupyter Notebook
My AI labs Jupyter notebooks repo
-
Updated
Oct 29, 2025 - Jupyter Notebook
SEC Finance Data Engineering - ETL process for SEC Finance data of S&P 500 companies. Jupyter Notebooks to run ETL work flows. The final dataset is hosted in MongoDB Atlas(cloud). The API is written using Python with PyMongo and Flask libraries. The dashboards with charts are hosted in MongoDB Atlas.
-
Updated
Mar 5, 2024 - Jupyter Notebook
Jupyter Notebook ETL from AWS S3 bucket
-
Updated
Jul 3, 2022 - Jupyter Notebook
Sample notebooks on Azure Databricks for ETL
-
Updated
May 20, 2023 - Scala
Improve this page
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."