Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance, and Scalability on the Data Lake

$69.99

113 in stock

Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool–a cost-prohibitive process for making warehouse features available to all of your data. The lack of flexibility with these patterns requires you to lock into a set of priority tools and formats, which creates data silos and data drift. This practical book shows you a better way.

Apache Iceberg provides the capabilities, performance, scalability, and savings that fulfill the promise of an open data lakehouse. By following the lessons in this book, you’ll be able to achieve interactive, batch, machine learning, and streaming analytics with this high-performance open source format. Authors Tomer Shiran, Jason Hughes, and Alex Merced from Dremio show you how to get started with Iceberg.

With this book, you’ll learn:

  • The architecture of Apache Iceberg tables
  • What happens under the hood when you perform operations on Iceberg tables
  • How to further optimize Iceberg tables for maximum performance
  • How to use Iceberg with popular data engines such as Apache Spark, Apache Flink, and Dremio

Discover why Apache Iceberg is a foundational technology for implementing an open data lakehouse.

Author: Tomer Shiran, Jason Hughes, Alex Merced
Binding Type: Paperback
Publisher: O’Reilly Media
Published: 06/11/2024
Pages: 341
Weight: 1.21lbs
Size: 9.19h x 7.00w x 0.72d
ISBN: 9781098148621

113 in stock

More Great AI Books

Ready to Lead the AI Revolution?

Don’t just keep up-get ahead. Break the scrolling habit.
Join The AI Book Club Today.

Sign up now and get 15% off your first box, plus exclusive access to our members-only content. Your journey to becoming an AI expert starts here—don’t miss out on the future.