Data warehouse or data lake, which do I choose?

Ali LeClerc, Head of Community @ Ahana

Today’s data-driven companies have a choice to make – where do we store our data? As the move to the cloud continues to be a driving factor, the choice becomes either the data warehouse (Snowflake et al) or the data lake (AWS S3 et al). There are pros and cons to each approach. While the data warehouse will give you strong data management with analytics, they don’t do well with semi-structured and unstructured data with tightly coupled storage and compute, not to mention expensive vendor lock-in. On the other hand, data lakes allow you to store all kinds of data and are extremely affordable, but they’re only meant for storage and by themselves provide no direct value to an organization.

Enter the Open Data Lakehouse, the next evolution of the data stack that gives you the openness and flexibility of the data lake with the key aspects of the data warehouse like management and transaction support.

In this talk, you’ll hear from Ali LeClerc who will discuss the data landscape and why many companies are moving to an open data lakehouse. Ali will share more perspective on how you should think about what fits best based on your use case and workloads, and how real-world teams are using Presto, a SQL query engine, to bring analytics to the data lakehouse.

Where & when?

Data Stack Summit 2022 was held virtually on June 22nd, 2022. Stay tuned for announcements about future sessions!

What is the cost to attend the virtual sessions?

Data Stack Summit is always free and open for all to attend

What is Data Stack Summit?

Finding ways to efficiently conquer the modern data stack can become infinitely more possible when we’re able to gather together collaboratively as a community and discuss the tools and capabilities desired by future-forward organizations. 

Hear real-world perspectives from long time enterprise data visionaries, data engineers, data and cloud architects, DataOps and DevOps practitioners as they talk through topics like the building blocks of the modern data platform, open source considerations, best practices for enterprise data operations, migrations, data observability, and tuning data pipelines for performance at scale.

Who's coming to Data Stack Summit?

Data and cloud architects, data engineers, DevOps practitioners and managers, data and ITOps leaders

Join us for talks around:
  • Building blocks of the modern data platform
  • Building the modern data platform using open source
  • Implementing the modern data platform using Kubernetes
  • Modern best practices for enterprise operations
  • Platform observability
  • Migrations to modern data platforms
  • Optimizing high-performance big data for future-forward enterprises
  • Enterprise Spark deployments
Interested in speaking or sponsoring the next Data Stack Summit?

Please reach out to astronaut@solutionmonday.com.

Don't miss out on the next Data Stack Summit! Sign up below for access to future sessions.

A big thank you to our 2022 Data Stack Summit Sponsors for making the event possible