March 2022_virtual lab

HANDS-ON VIRTUAL LAB FOR DATA PLATFORM ENGINEERS:

Building an Open Data Lakehouse with Presto, Hudi, and AWS S3

flask

Learn how to build an open data lakehouse stack using Presto, Apache Hudi and AWS S3 in this 90 minute free hands-on lab.

This event has ended

Presented by: ahana trademarked site logo

In this 90 minute hands on-virtual lab we’ll show you how to build an Open Data Lakehouse stack with Presto, Apache Hudi, and AWS S3.

Come prepared to have your video on while following along with the instructor – this is an interactive session and we encourage participation from everyone attending!

This event has ended

To keep up-to-date on Ahana events sign up for our newsletter.

What you’ll learn:

  • A quick overview on the open data lakehouse stack, including what is Presto (query engine) and what is Apache Hudi (transaction layer)
  • How to get HUDI support on Presto
  • Querying HUDI data with Presto  
  • How to use Presto to query your AWS S3 Data Lake
  • Future – Whats additional HUDI support is coming to Presto

Course Outline:

  • Introduction to Presto
  • Walk through writing data using Spark in HUDI format
  • Querying data via Presto
  • How ACID compliance works in this stack
  • Out of the box support for Hudi on Presto

By the end of this lab, you’ll know how to run queries with Presto and Hudi to optimize your AWS S3 data lake.