Data Engineering Handbook (DEH)

Long Bui * I am a typical engineer.

What this Handbook is ?

Getting started with some questions:

  • Why you need to learn data engineering, data analytics engineering, data science ? Is it for the job? work? passion?
  • What do you actually need to learn to become an awesome data engineer?
  • What we need to learn about data knowledge?
  • How to deal with issues?
  • Where I can search for information about programming, data engineering, systems, knowledge, etc.

Well, look no further. You’ll find it here!

If you are looking for AI algorithms and such data scientist things, building ML models, this might not for you.

This book is for documenting and referencing the journey of data development learning, it is as of any data pipeline you may known: collect, structure, shope, transform, store, visualize, enable, insight, decide, etc.

You will see these terms I mentioned above for whole the book.

You can subscribe here to keep track of changes, new researches, key notes, and more.

Subscribe to Newsletter

What you can find in this book?

Listen to me, before you start to learn the Basic Skills or Advanced Skills, I strongly recommend you to read the Guideline How to use this book, it like you check the facts nutrition before eating anything, without that pre-requisition you will blind, concern, confuse about this huge of knowledge.

In Part 1, You will see the following items in the navigation section and you find the right direction to the resources. From Chapter 1: Beginning with gaining fundamental knowledge about software engineering, Chapter 2: Basics skills are for Big data, Data platform knowledge. After those 2 sections about Data Software Engineering - getting start with Practices with side projects and learning how to building long life data applications.

There is no secret for helping you to be 1%. The thing we need to do is learning and continuously learning, by using the Chapter 3: Books and Courses - getting advanced information and awareness of domain industry.

Take the adventure in Part 2, by starts with the Chapter 4: Advanced Skill to be come better engineering with qualified skills set for data engineer and data architect; this shows us how to develop data applications by going through 5 fundamental elements of any data processing framework. You will fine the Chapter 5: Case Studies and Best practices which I will tell you the real world data platform and data pipelines, how to well-design and implementation phases of that.

A side part with Cloud or Cloud-Open Source software, practicing Data Engineering with take the camping in Chapter 6: Data Engineering Camping, you will learn the practice of how to design and implement the data pipeline, data modeling, automate data flows and designing dashboard with understanding about data. In the Chapter 7: Hands-on section, you will learn the data infrastructure how to setup and build data pipelines in well-structure, that helps you to gain the experiences, knowledge before jumping into Chapter 8: Interview Question, getting join and being in real world project and prepare for interview.

Last but not least, Part 3 is designed to demonstrate how to create the second brain functions that structure and organize thoughts, notes, reads, etc and many of more things. Good practice to package the knowledge as well as maintain all the notes by the programmatic manner.

In the online version, you can find the Data Camping where you can learn, practice your knowledge about data engineering.

Plus, I super highly recommend you to create your own second brain, watch this video for introduction Building longdatadevlog second brain

Summary of the book Note: Captured at 2024-12-07

Subscribe for more

In case you liked the content and want to support me, please:


This handbook was created and maintained by Long Bui. Copyright © LongDataDevLog.com