DataNation - Podcast for Data Engineers, Analysts and Scientists

Alex Merced Podcasts

About

Welcome to "Datanation," the podcast where your host, Alex Merced, takes you on a captivating journey through the fascinating world of data. In each episode, we explore a wide range of data topics, from data engineering and data analytics to the art and science of data-driven decision-making.

In the age of information, data is the currency that drives innovation and progress. "Datanation" is your passport to this ever-evolving landscape, where we unravel the mysteries, dissect the trends, and celebrate the breakthroughs shaping the data-driven future.

Join Alex Merced, a seasoned data enthusiast and educator, as he engages in enlightening discussions, informative interviews, and thought-provoking explorations of data concepts and practices. Whether you're a seasoned data professional, a curious tech enthusiast, or someone simply intrigued by the power of data, this podcast offers valuable insights and knowledge.

Find all episodes at: https://host.alexmercedpodcast.com/series/datanation/

Follow Alex on Twitter @amdatalakehouse

Find article Alex has written on Data related topics at Dremio.com/Subsurface

Available on

Community

49 episodes

52 – Apache Iceberg, Dremio and PuppyGraph

Alex Merced discusses the benefits of Apache Iceberg’s open data ecosystem! Build a Data Lakehouse on Your Laptop Deploy Deploy into Production

1s
Mar 28
#1 – intro to catalogs, manifests and metadata. Oh my!

In this episode, Alex Merced introduces his new podcast “Catalogs, Manifests, and Metadata. Oh my!” covering open-source data projects like Apache Iceberg and others. Make sure to subscribe, this podcast will be showing up in podcast directories over the next week or so of the publishing of this episode. Follow Alex Merced, find all links […]

1s
Mar 25
51 – Open Data Standards (Apache Iceberg, Apache Parquet, Apache Arrow, Apache Ibis, Apach Substrait)

Alex Merced discusses many of the open source projects aiming to reduce the frictions the heavily fragmented data world. Follow me on Socials:https://bio.alexmerced.com/data

1s
Mar 18
48 – Understanding how Lakehouse Table Formats are Implemented in your Favorite Tools

Alex Merced discusses how formats like Apache Iceberg, Apache Hudi and Delta Lake work and are implemented into your favorite tools, distinguishing what is the responsibility of the format and there responsibility of the engine. Follow Alex on Social, find all links at:https://bio.alexmerced.com/data

1s
Feb 02
Bonus: New Youtube Channel, State of the Data Lakehouse

Find all my data resources below:https://bio.alexmerced.com/data Listen to the State of the Data Lakehouse Podcast Here:https://em360tech.com/podcast/dremio-state-data-lakehouse?utm_source=podcasts&utm_medium=podcast&utm_content=content&utm_campaign=alexmercedcontent&utm_term=iceberg+lakehouse+nessie

1s
Jan 20
2024 Preview – Data/Web Content

youtube.com/@alexmercedcoder youtube.com/@alexmerceddata twitter.com/alexmercedcoder twitter.com/amdatalakehouse

1s
Jan 09
46 – Apache Iceberg vs Delta Lake: Understanding the Table Format Debate

ZeroETL & Virtual Data Marts Presentation: https://www.youtube.com/watch?v=mDwpsg8btto Blog for getting hands on with Dremio on Laptop:https://www.dremio.com/blog/intro-to-dremio-nessie-and-apache-iceberg-on-your-laptop/

1s
Dec 08, 2023
45 – BI Dashboard Acceleration (Extracts, Cubes and Reflections)

Alex Merced discusses different techniques to speed up BI Dashboard performance.

1s
Nov 01, 2023
44 – Multi-Table Versioning and why Abstractions Matter

There is a reason the Git-for-Data Paradigm of Nessie catalogs is so essential, not only for the versioning features it provides but also the level of abstraction it provides them. In this episode, I discuss this more.

1s
Oct 19, 2023
43 – Building a Data Lakehouse on your Laptop

In just a few commands, you can have everything you need to practice ingestion and querying with popular data software. Just install Docker and then run the commands in the image. You can also follow the directions in this blog:https://lnkd.in/eDiC8fc6 Also try out this video series:https://lnkd.in/gp843ErM

1s
Aug 23, 2023
42 – Window Functions and Apache Iceberg Metadata Tables

Alex Merced describes what are window function, and how they can be applied to Apache Iceberg Metadata tables

1s
Jul 12, 2023
41 – Databricks’ “Open” Problem and the Need for an Agnostic Intermediate Data Lakehouse Table Format

Alex Merced discusses some of the fallout from Databricks’ UNIFormat announcement, and the innovation the industry needs to unlock the data lakehouse. Follow me on twitter @amdatalakehouse

1s
Jun 29, 2023
40 – Big Announcements for Apache Iceberg, Delta Lake and Apache Hudi from Snowflake and Databricks

Alex Merced discusses some of the big announcements from this weeks conferences. Make sure to checkout Gnarly Data Waves on your favorite podcast app.

1s
Jun 28, 2023
39 – What are Dremio’s Data Reflections and why are they so cool!

Alex Merced explains what are Dremio reflection and how they bring you speed, reduce storage costs, and do so while keeping things easy for your end users. Follow Alex on twitter @amdatalakehouse

1s
Jun 23, 2023
37 – Dremio, Data Lakehouses and Generative AI

Alex Merced discusses Dremio’s new generative AI Features and the future of Data Lakehouses. Follow Alex on twitter @amdatalakehouse

1s
Jun 16, 2023
36 – ELT & ETL: The Good, The Bad and the Ugly

Alex Merced reflects on a recent article from Lauren Balik on the topic of ELT. Here is the Article:https://medium.com/@laurengreerbalik/how-fivetran-dbt-actually-fail-3a20083b2506 Launren’s Twitter: @laurenbalik My Twitter handle: @amdatalakehouse

1s
May 23, 2023
35 – Data Lakehouse Statistics (Understanding Parquet and Iceberg)

Alex Merced helps explain how stats are collected and used when working with Parquet files and Apache Iceberg tables. Follow Alex on twitter @amdatalakehouse

1s
May 08, 2023
BONUS: What is Object Storage like AWS S3, Minio and more!

Alex Merced discusses what is Object Storage and the history of file systems. Join the community at datanation.click

1s
Apr 12, 2023
34 – What is a Vector Database?

Alex Merced explains what is a Vector Database Join the community at DataNation.click

1s
Apr 08, 2023
BONUS: The Big Picture at a Tech Company (Engineering, Product, Marketing, Sales)

Alex Merced discusses the different departments at a tech company and how they all fit together to create success. follow alex on twitter Web -> @alexmercedcoder Data -> @amdatalakehouse

1s
Mar 23, 2023
33 – CI/CD on the Data Lakhouse

Alex Merced discusses what is CI/CD and how to achieve CI/CD pipelines on the data lakehouse.

1s
Mar 21, 2023
32 – Data Versioning Solutions (Apache Iceberg, Project Nessie, LakeFS)

Alex Merced discusses the different Data Versioning Solutions and the approach different solutions have.

1s
Mar 10, 2023
31 – Optimizing MPP Workloads

Alex Merced discusses how MPP tools plan tasks and how understanding that can help you plan your writes better. dremio.com/subsurface <— Register for Subsurface

1s
Feb 22, 2023
30 – The Subsurface Live! Data Lakehouse Conference

Register for Subsurface at Dremio.com/subsurface Follow me on twitter @amdatalakehouse Subscribe to this and my other podcasts: Gnarly Data WavesSelect * from Data.Lake;Web Dev 101Web and Data: Interviews by Alex Merced

1s
Feb 15, 2023
29 – Optimizing Data Performance on Small Data and Big Data

Alex Merced discusses the different considerations with optimizing data and how no one tool can make every use case performant, but understanding which ones will solve which use cases is the key.

1s
Feb 07, 2023
28 – Reduce Data Warehouse costs with Dremio and DuckDB

Alex Merced discusses how you can really reduce your Data Warehouse costs by using Dremio to unify and organize your data lake and DuckDB for local ad hoc queries on data pulled through Dremio. Join the slack community at DataNation.click

1s
Feb 01, 2023