Week 1 - Infrastructure and Configuration

  • Overview of Architecture
  • Technologies
    • Data Open Sources (Postgres, Iceberg, MinIO, Kafka, Spark, dbt, Airflow, Deequ, DuckDb…)
    • Docker
    • Terraform
    • AWS Cloud

Watch this video https://youtu.be/S-RfuZGlLUQ

If you can not download file,

I put the sample code here: Github repository

Docker + Postgres

Practice this Code

What you will learn from This Week?

AWS + Terraform

Practice this Code

  • Introduction to AWS (Amazon Web Service)
  • Introduction to Terraform Concepts & AWS Pre-Requisites
  • Creating AWS Infrastructure with Terraform

Environment setup

For the course you’ll need:

  • Python 3 (e.g. installed with Anaconda)
  • Docker with docker-compose
  • Terraform

Check out my dotfile for setting up as Data Engineer

What make you better ?

  • Know and practice how Docker and Docker Compose Works
  • Know and practice how to create/start/pull docker image
  • Know and practice how to create docker containers using docker compose
  • Know and practice what is IaC and Terraform
  • Know and practice how to create Service with Terraform
  • Know and practice how to writing function to extract/load/transform data
  • Know and practice Data Warehouse as basis? (e.g: Snowflake, or only Postgres)