Project to get data and store it in a modern data lakehouse and show it in a data viz platform.
Table of Contents
This project aims to implement a complete stack, from data extraction to presentation, using free tools, with an open datalakehouse concept and state-of-the-art technologies.
Some software and libraries are needed to run the project.
- Docker
- Python > 3.8
- Python - https://www.python.org/downloads/
- Docker - https://docs.docker.com/get-docker/
- Airbyte - https://docs.airbyte.io/
- Minio - https://min.io/docs/minio/container/index.html
cd config/containers/minio docker-compose up -d
- MySQL
cd config/containers/mysql docker-compose up -d
- Airflow - https://airflow.apache.org/docs/apache-airflow/stable/howto/docker-compose/index.html
Danton Bertuol - [email protected]
https://airflow.apache.org/docs/apache-airflow/stable/howto/docker-compose/index.html
https://docs.airbyte.com/deploying-airbyte/local-deployment