How to Monitor Airflow with Prometheus, StatsD, and Grafana

Get the best practices written by data experts that will help you and your data engineering team to:

  • Monitor your Airflow pipelines better
  • Configure an open-source observability dashboard
  • Discover how to trust your data
screenshot of grafana operational dashboard

What's inside

Monitoring Airflow can be painful. To debug health problems or find the root cause of failures, a data engineer needs to hop between the Apache Airflow UI, DAG logs, various monitoring tools, and Python code.

It doesn’t have to be this way.

You can use operational dashboards to get a bird’s-eye view of our system, clusters and overall health.

In this guide, we’ll be exploring the best practices for going the open-source route to building an operational dashboard.

This guide’s goal is to quickly answer questions like:

  • Is our cluster alive?
  • How many DAGs do we have in a bag?
  • Which operators succeeded and which failed lately?
  • How many tasks are running right now?
  • How long did it take for the DAG to complete?

Keep up with the Databand community


What is Data Lineage?

The term “data lineage” has been thrown around a lot over the last few years. What started as an idea of connecting between datasets quickly became a very confusing term that now gets misused often. It’s time to put order to the chaos and dig deep into what it really […]


What is Data Reliability and How Observability Can Help

Data matters more than ever – we all know that. But at a time when being a data-driven business is so critical, how much can we trust data and what it tells us? That’s the question behind data reliability, which focuses on having complete and accurate data that people can […]


What is Data Governance and Where Observability Fits In

Data is the most valuable asset for most businesses today. Or at least it has the potential to be. But to realize the full value, organizations must manage their data correctly. This management covers everything from how it’s collected to how it’s maintained and analyzed. And a big component of […]