Alex Merced discusses the benefits of Apache Iceberg’s open data ecosystem! Build a Data Lakehouse on Your Laptop Deploy Deploy into Production
In this episode, Alex Merced introduces his new podcast “Catalogs, Manifests, and Metadata. Oh my!” covering open-source data projects like Apache Iceberg and others. Make sure to subscribe, this podcast will be showing up in podcast directories over the next week or so of the publishing of this episode. Follow Alex Merced, find all links […]
Alex Merced discusses many of the open source projects aiming to reduce the frictions the heavily fragmented data world. Follow me on Socials:https://bio.alexmerced.com/data
Alex thinks on the development of Real-time data pipelines.
Alex Merced discusses how formats like Apache Iceberg, Apache Hudi and Delta Lake work and are implemented into your favorite tools, distinguishing what is the responsibility of the format and there responsibility of the engine. Follow Alex on Social, find all links at:https://bio.alexmerced.com/data
Alex Merced discusses cloud costs Alex’s Links: https://bio.alexmerced/data
Find all my data resources below:https://bio.alexmerced.com/data Listen to the State of the Data Lakehouse Podcast Here:https://em360tech.com/podcast/dremio-state-data-lakehouse?utm_source=podcasts&utm_medium=podcast&utm_content=content&utm_campaign=alexmercedcontent&utm_term=iceberg+lakehouse+nessie
youtube.com/@alexmercedcoder youtube.com/@alexmerceddata twitter.com/alexmercedcoder twitter.com/amdatalakehouse
ZeroETL & Virtual Data Marts Presentation: https://www.youtube.com/watch?v=mDwpsg8btto Blog for getting hands on with Dremio on Laptop:https://www.dremio.com/blog/intro-to-dremio-nessie-and-apache-iceberg-on-your-laptop/
Alex Merced discusses different techniques to speed up BI Dashboard performance.
Submit your talks here: https://www.dremio.com/subsurface/
There is a reason the Git-for-Data Paradigm of Nessie catalogs is so essential, not only for the versioning features it provides but also the level of abstraction it provides them. In this episode, I discuss this more.
In just a few commands, you can have everything you need to practice ingestion and querying with popular data software. Just install Docker and then run the commands in the image. You can also follow the directions in this blog:https://lnkd.in/eDiC8fc6 Also try out this video series:https://lnkd.in/gp843ErM
Alex Merced describes what are window function, and how they can be applied to Apache Iceberg Metadata tables
Alex Merced discusses some of the fallout from Databricks’ UNIFormat announcement, and the innovation the industry needs to unlock the data lakehouse. Follow me on twitter @amdatalakehouse
Alex Merced discusses some of the big announcements from this weeks conferences. Make sure to checkout Gnarly Data Waves on your favorite podcast app.
Alex Merced explains what are Dremio reflection and how they bring you speed, reduce storage costs, and do so while keeping things easy for your end users. Follow Alex on twitter @amdatalakehouse
Alex Merced discusses Dremio’s new generative AI Features and the future of Data Lakehouses. Follow Alex on twitter @amdatalakehouse
Alex Merced reflects on a recent article from Lauren Balik on the topic of ELT. Here is the Article:https://medium.com/@laurengreerbalik/how-fivetran-dbt-actually-fail-3a20083b2506 Launren’s Twitter: @laurenbalik My Twitter handle: @amdatalakehouse
Alex Merced helps explain how stats are collected and used when working with Parquet files and Apache Iceberg tables. Follow Alex on twitter @amdatalakehouse
Alex Merced discusses what is Object Storage and the history of file systems. Join the community at datanation.click
Alex Merced explains what is a Vector Database Join the community at DataNation.click
Alex Merced discusses the different departments at a tech company and how they all fit together to create success. follow alex on twitter Web -> @alexmercedcoder Data -> @amdatalakehouse
Alex Merced discusses what is CI/CD and how to achieve CI/CD pipelines on the data lakehouse.
Alex Merced discusses the different Data Versioning Solutions and the approach different solutions have.
Alex Merced discusses how MPP tools plan tasks and how understanding that can help you plan your writes better. dremio.com/subsurface <— Register for Subsurface
Register for Subsurface at Dremio.com/subsurface Follow me on twitter @amdatalakehouse Subscribe to this and my other podcasts: Gnarly Data WavesSelect * from Data.Lake;Web Dev 101Web and Data: Interviews by Alex Merced
Alex Merced discusses the different considerations with optimizing data and how no one tool can make every use case performant, but understanding which ones will solve which use cases is the key.
Alex Merced discusses how you can really reduce your Data Warehouse costs by using Dremio to unify and organize your data lake and DuckDB for local ad hoc queries on data pulled through Dremio. Join the slack community at DataNation.click