Airflow on Kubernetes: Containerizing your workflows
Michael Hewitt
At Nielsen Digital we have been moving our ETLs to containerized environments managed by Kubernetes. We have successfully transferred some of our ETLs to this environment in production. In order to do this we used the following technologies: Helm to easily deploy Airflow on to Kubernetes; Airflow’s Kubernetes Executor to take full advantage Kubernetes features; and Airflow’s Kubernetes Pod Operator in order to execute our containerized Tasks within our DAGs. To automate a lot of the deployment process we also used Terraform. Lastly, Kubernetes features were used to gain much more fine grained control of Airflows infrastructure.
Видео Airflow on Kubernetes: Containerizing your workflows канала Apache Airflow
At Nielsen Digital we have been moving our ETLs to containerized environments managed by Kubernetes. We have successfully transferred some of our ETLs to this environment in production. In order to do this we used the following technologies: Helm to easily deploy Airflow on to Kubernetes; Airflow’s Kubernetes Executor to take full advantage Kubernetes features; and Airflow’s Kubernetes Pod Operator in order to execute our containerized Tasks within our DAGs. To automate a lot of the deployment process we also used Terraform. Lastly, Kubernetes features were used to gain much more fine grained control of Airflows infrastructure.
Видео Airflow on Kubernetes: Containerizing your workflows канала Apache Airflow
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]Testing Airflow workflows - ensuring your DAGs work before going into productionRunning Apache Airflow Reliably with Kubernetes | AstronomerAirflow on Kubernetes: Dynamic Workflows Simplified - Daniel Imberman, Bloomberg & Barni SeetharamanThe Newcomer's Guide to Airflow's ArchitectureWhat is KubernetesKeynote: How large companies use Airflow for ML and ETL pipelinesHow to Create GKE Cluster Using TERRAFORM from Scratch? (GCP Shared VPC | Terraform GCP Kubernetes)Data Engineering Principles - Build frameworks not pipelines - Gatis SejaKubernetes Operator simply explained in 10 minsProduction Docker image for Apache AirflowWhat's coming in Airflow 2.0?Running Apache Airflow with the KubernetesExecutor on a multi-node Kubernetes cluster locallyAirflow DAG: Coding your first DAG for BeginnersAchieving Airflow observability with DatabandOrchestrating dbt with DagsterMachine Learning with Apache AirflowFocus On: The Apache Airflow PythonOperator, all you need in 20 mins!Deploy Kubernetes From Scratch On AWS In 5 Min! (Plus Intro To Kubernetes)Building reuseable and trustworthy ELT pipelines (A templated approach)