Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

459 Episodes

Bringing AI Into The Inner Loop of Data Engineering With Ascend - E458

Summary In this episode of the Data Engineering Podcast Sean Knapp, CEO of Ascend.io, explores the intersection of AI and data engineering. He discusses the evolution of data engineering and the role of AI in automating processes, alleviating burdens on data engineers, and enabling them to focus on complex tasks and innovation. The conversation…

Summary In this episode of the Data Engineering Podcast Sean Knapp, CEO of Ascend.io, explores the…

24 March 2025 | 00:52:47


Astronomer's Role in the Airflow Ecosystem: A Deep Dive with Pete DeJoy - E457

Summary In this episode of the Data Engineering Podcast Pete DeJoy, co-founder and product lead at Astronomer, talks about building and managing Airflow pipelines on Astronomer and the upcoming improvements in Airflow 3. Pete shares his journey into data engineering, discusses Astronomer's contributions to the Airflow project, and highlights the…

Summary In this episode of the Data Engineering Podcast Pete DeJoy, co-founder and product lead at…

16 March 2025 | 00:51:41


Accelerated Computing in Modern Data Centers With Datapelago - E456

Summary In this episode of the Data Engineering Podcast Rajan Goyal, CEO and co-founder of Datapelago, talks about improving efficiencies in data processing by reimagining system architecture. Rajan explains the shift from hyperconverged to disaggregated and composable infrastructure, highlighting the importance of accelerated computing in modern…

Summary In this episode of the Data Engineering Podcast Rajan Goyal, CEO and co-founder of…

08 March 2025 | 00:55:36


The Future of Data Engineering: AI, LLMs, and Automation - E455

Summary In this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of DataFold, talks about the intersection of AI and data engineering. He discusses the challenges and opportunities of integrating AI into data engineering, particularly using large language models (LLMs) to enhance productivity and reduce manual toil. The…

Summary In this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of…

26 February 2025 | 00:59:39


Evolving Responsibilities in AI Data Management - E454

Summary In this episode of the Data Engineering Podcast Bartosz Mikulski talks about preparing data for AI applications. Bartosz shares his journey from data engineering to MLOps and emphasizes the importance of data testing over software development in AI contexts. He discusses the types of data assets required for AI applications, including…

Summary In this episode of the Data Engineering Podcast Bartosz Mikulski talks about preparing data…

16 February 2025 | 00:38:57


CSVs Will Never Die And OneSchema Is Counting On It - E453

Summary In this episode of the Data Engineering Podcast Andrew Luo, CEO of OneSchema, talks about handling CSV data in business operations. Andrew shares his background in data engineering and CRM migration, which led to the creation of OneSchema, a platform designed to automate CSV imports and improve data validation processes. He discusses the…

Summary In this episode of the Data Engineering Podcast Andrew Luo, CEO of OneSchema, talks about…

13 January 2025 | 00:54:40


Breaking Down Data Silos: AI and ML in Master Data Management - E452

Summary In this episode of the Data Engineering Podcast Dan Bruckner, co-founder and CTO of Tamr, talks about the application of machine learning (ML) and artificial intelligence (AI) in master data management (MDM). Dan shares his journey from working at CERN to becoming a data expert and discusses the challenges of reconciling large-scale…

Summary In this episode of the Data Engineering Podcast Dan Bruckner, co-founder and CTO of Tamr,…

03 January 2025 | 00:57:30


Building a Data Vision Board: A Guide to Strategic Planning - E451

Summary In this episode of the Data Engineering Podcast Lior Barak shares his insights on developing a three-year strategic vision for data management. He discusses the importance of having a strategic plan for data, highlighting the need for data teams to focus on impact rather than just enablement. He introduces the concept of a "data vision…

Summary In this episode of the Data Engineering Podcast Lior Barak shares his insights on developing…

23 December 2024 | 00:49:59


How Orchestration Impacts Data Platform Architecture - E450

Summary The core task of data engineering is managing the flows of data through an organization. In order to ensure those flows are executing on schedule and without error is the role of the data orchestrator. Which orchestration engine you choose impacts the ways that you architect the rest of your data platform. In this episode Hugo Lu shares his…

Summary The core task of data engineering is managing the flows of data through an organization. In…

16 December 2024 | 00:59:39


An Exploration Of The Impediments To Reusable Data Pipelines - E449

Summary In this episode of the Data Engineering Podcast the inimitable Max Beauchemin talks about reusability in data pipelines. The conversation explores the "write everything twice" problem, where similar pipelines are built without code reuse, and discusses the challenges of managing different SQL dialects and relational databases. Max also…

Summary In this episode of the Data Engineering Podcast the inimitable Max Beauchemin talks about…

08 December 2024 | 00:51:32