Airflow

40 Hours

TDXAD-107

Course outline

An Airflow course typically refers to a training program or educational material designed to teach individuals about Apache Airflow, an open-source platform used for orchestrating complex workflows in data pipelines. Airflow allows users to schedule, monitor, and manage workflows, making it a powerful tool for data engineers, data scientists, and anyone involved in building and maintaining data pipelines.

Upcoming meetings

There are no upcoming meetings for this course.
Contact us to schedule this course, which will be customized specifically for your organization.
info@hackerupro.com
Download Full Syllabus

Modules

  • Overview of Airflow and its use cases •
  • Airflow architecture and components
  • Setting up Airflow on a local machine
  • Defining tasks and dependencies
  • Setting up schedules and triggers
  • Using variables and templates
  • Types of operators and when to use them
  • Using hooks to interact with external systems
  • Creating custom operators and hooks
  • Using the Airflow UI to monitor DAGs
  • Logging and exception handling
  • Common errors and troubleshooting techniques
  • Airflow connections and authentication
  • Using Airflow with Kubernetes
  • Best practices and tips for scaling Airflow
  • Practice creating and running DAGs
  • Q&A with the instructor

Prerequisites

  • 01 Python Programming
  • 02 Understanding of Data ConceptsCommand Line Interface (CLI):
  • 03 Version Control Systems (e.g., Git):
  • 04 SQL
  • 05 Familiarity with Linux/Unix Systems, Understanding of Distributed System, Experience with Data Pipelines or Workflow Management

Target Audience

  • Data Engineers, Data Scientists, Data Analysts
  • DevOps Engineers, Software Engineers, System Administrators
  • Business Analysts
  • Anyone Interested in Workflow Automation
  • Big Data Engineers

Contact us

    Skip to content