Delta Lake

Delta Lake
Author :
Publisher :
Total Pages : 84
Release :
ISBN-10 : OCLC:1247857155
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Delta Lake by : Denny Lee

Download or read book Delta Lake written by Denny Lee and published by . This book was released on 2022 with total page 84 pages. Available in PDF, EPUB and Kindle. Book excerpt: Analysis and machine learning models are only as good as the data they're built on. Querying processed data and getting insights from it requires a robust data pipeline--and an effective storage solution that ensures data quality, data integrity, and performance. This guide introduces you to Delta Lake, an open-source format that enables building a lakehouse architecture on top of existing storage systems such as S3, ADLS, GCS, and HDFS. Delta Lake enhances Apache Spark and makes it easy to store and manage massive amounts of complex data by supporting data integrity, data quality, and performance. Data engineers, data scientists, and data practitioners will learn how to build reliable data lakes and data pipelines at scale using Delta Lake. Understand key data reliability challenges and how to tackle them Learn how to use Delta Lake to realize data reliability improvements Concurrently run streaming and batch jobs against your data lake Execute update, delete, and merge commands against your data lake Use time travel to roll back and examine previous versions of your data Learn best practices to build effective, high-quality end-to-end data pipelines for real world use cases Integrate with other data technologies like Presto, Athena, Redshift and other BI tools Learn how thousands of companies are processing exabytes of data per month with their lakehouse architecture using Delta Lake.


Delta Lake Related Books

Delta Lake
Language: en
Pages: 84
Authors: Denny Lee
Categories:
Type: BOOK - Published: 2022 - Publisher:

DOWNLOAD EBOOK

Analysis and machine learning models are only as good as the data they're built on. Querying processed data and getting insights from it requires a robust data
Data Engineering with Apache Spark, Delta Lake, and Lakehouse
Language: en
Pages: 480
Authors: Manoj Kukreja
Categories: Computers
Type: BOOK - Published: 2021-10-22 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an indu
Trino: The Definitive Guide
Language: en
Pages: 310
Authors: Matt Fuller
Categories: Computers
Type: BOOK - Published: 2021-04-14 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. With this practical guide, you'
Spark: The Definitive Guide
Language: en
Pages: 594
Authors: Bill Chambers
Categories: Computers
Type: BOOK - Published: 2018-02-08 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With
Delta Lake: The Definitive Guide
Language: en
Pages: 0
Authors: Denny Lee
Categories: Computers
Type: BOOK - Published: 2024-11-30 - Publisher:

DOWNLOAD EBOOK

Discover how Delta Lake simplifies the process of building data lakehouses and data pipelines at scale. With this practical guide, data engineers, data scientis