-
The parts, pieces and future of composable data systems
Categories: OpinionThis week on The Data Stack Show, Eric and Kostas chat with a panel of experts as Wes McKinnyey (Cofounder,…
-
Introduction from the original creators of Iceberg
By Ryan Blue and Daniel Weeks, Iceberg PMC Members Apache Iceberg is now the de facto open format for analytic…
-
Why Apache Iceberg — for data warehouse users
Major data warehouse platforms such as Google BigQuery, Snowflake, AWS, and Databricks have all announced support for Apache Iceberg tables. Commercial warehouse engines seldom…
-
Why Apache Iceberg — for data lake users
INTRODUCTION If you have been working in a data lake, you’re probably very familiar with its drawbacks. You’re in luck:…
-
Iceberg and Hudi ACID Guarantees
When I talk about Apache Iceberg and open table formats, I have a strong preference to focus on just Iceberg.…
-
The Case for Independent Storage
Tabular has just secured a new round of funding, led by Altimeter Capital, with participation from Andreessen Horowitz and Zetta…
-
Zen and the art of CDC Performance
I’ve moved around for work quite a bit in my career, mostly when I worked for the government and was…
-
Securing the Data Lake – Part III
Categories: OpinionThe growing need for a secure data lake We’re experiencing two powerful business trends that seem at odds with each…
-
The CDC MERGE Pattern
Categories: OpinionThe MERGE pattern is the first design I discuss in this series that maintains data directly in a mirror table…
-
Iceberg in Modern Data Architecture
The last two weeks have seen several major announcements about Apache Iceberg: These are radical developments. Commercial warehouse engines seldom…
-
CDC Data Gremlins
Categories: OpinionThe first part of this series introduced CDC table mirroring using the change log pattern. It omitted an important part of…
-
Hello, World of CDC!
Categories: OpinionThis is the first in a series about mirroring transactional database tables into a data lake. Mirroring is an important…
-
Securing the Data Lake – Part II
Categories: OpinionMotivation Seemingly every day, more people across the business are using more data in more ways to deliver value. At…
-
Securing the Data Lake – Part I
Categories: OpinionMotivation Seemingly every day, more people across the business are using more data in more ways to deliver value. At…
-
The top 3 things data engineers can stop spending time on
One of the reasons we started Tabular is that data engineers are asked to do far too much unnecessary work.…