
Presto Training & Learning Center
The Ahana Learning Center covers beginner to advanced level Presto topics, questions, and answers to help you learn Presto.
Topics
AWS Athena vs Snowflake
The High Level Overview Snowflake and Amazon Athena are both cloud analytics tools, but are significantly different in terms of their architecture. Athena is a serverless query engine based on … Continue reading AWS Athena vs Snowflake
Starburst vs Snowflake
The High Level Overview Starburst and Snowflake are both in the data analytics space but are significantly different in terms of their architecture and use cases. Starburst is the corporate … Continue reading Starburst vs Snowflake
Presto and ETL – Clarifying Common Misconceptions
Data integration and ETL are typically seen as crucial to gaining insights from data. But when it comes to Presto, things get trickier: Should you use Presto for ETL? Or … Continue reading Presto and ETL – Clarifying Common Misconceptions
S3 Select vs. AWS Athena – The Quick Comparison
Data analysts and data engineers need simpler ways to access business data stored on Amazon S3. Amazon Athena and S3 Select are two services that allow you to retrieve records … Continue reading S3 Select vs. AWS Athena – The Quick Comparison
4 Trino Alternatives for Better Price / Performance
Trino, a distributed SQL query engine, is known for its ability to process large amounts of semi-structured data using familiar SQL semantics. However, there are situations where an alternative may … Continue reading 4 Trino Alternatives for Better Price / Performance
Exploring Data Warehouse, Data Mesh, and Data Lakehouse: What’s right for you?
We’re hosting a free hands-on lab on building your own Data Lakehouse in AWS. You’ll get trained by Presto and Apache Hudi experts. When it comes to data management, there … Continue reading Exploring Data Warehouse, Data Mesh, and Data Lakehouse: What’s right for you?
AWS Athena vs. Databricks
In this article we’ll look at two different technologies in the data space and share more about which to use based on your use case and workloads. The High Level … Continue reading AWS Athena vs. Databricks
Starburst vs. Athena: Evaluating different Presto vendors
Starburst vs. Athena – what are the differences between these two Presto vendors and how should you choose based on your workloads and use case? In this article we’ll discuss both.
PrestoDB on AWS
What is Presto? Presto is an open-source distributed SQL query engine for running interactive analytic queries against all types of data sources. Learn more about PrestoDB on AWS.
What is Presto? | Presto Caching, Data Sources & Usage Intro
Learn more about what Presto is, how it was developed, and how to use it.
Hive vs Presto vs Spark for Data Analysis
Compare Presto to Hive and compare Presto to Spark for data analysis, Learn is best for you based on your data analytics goals.
Ahana Cloud for Presto Versus Amazon EMR
In this brief post, we’ll discuss some of the benefits of Ahana Cloud over Amazon Elastic MapReduce (EMR). While EMR offers optionality in the number of big data compute frameworks, … Continue reading Ahana Cloud for Presto Versus Amazon EMR
Presto Geospatial Functions Tutorial & Guide l Ahana
Presto Geospatial Functions Tutorial A question that often comes up is “how do I do geospatial queries and spatial joins in Presto?”. Fortunately Presto supports a wealth of functions and … Continue reading Presto Geospatial Functions Tutorial & Guide l Ahana
Python Connect to Presto | Python Script To Connect To Presto Cluster
How to Write a Python Script to Connect to a Presto Cluster If you need a quick tutorial on how to write a python script to connect to a Presto … Continue reading Python Connect to Presto | Python Script To Connect To Presto Cluster
Presto Resource Groups | Setting Up Presto Resource Group | Ahana
Setting-up Presto Resource Groups Before you start allowing users on your Presto cluster it’s best practice to configure resource groups. A resource group is a query admission control and workload … Continue reading Presto Resource Groups | Setting Up Presto Resource Group | Ahana
How To Connect To Presto | Connecting To Presto | Ahana
How To Connect To Presto Connections to and from Presto fall into three main categories: 1) The Presto connectors that provide access to source data in various databases and file … Continue reading How To Connect To Presto | Connecting To Presto | Ahana
How to Access Presto | Best Ways to Connect to Presto | Ahana
How To Access Presto There are several ways to access and use Presto. Here are 3 of the most common ways: Command Line Interface The Presto CLI provides a terminal-based … Continue reading How to Access Presto | Best Ways to Connect to Presto | Ahana
Presto Machine Learning | Machine Learning With Presto | Ahana
Machine Learning With Presto Presto is an invaluable tool for accessing and combining data, and performing feature engineering and feature extraction, in a scalable way ready for machine learning and … Continue reading Presto Machine Learning | Machine Learning With Presto | Ahana
How To Make Presto Query Engine Run Fastest | Ahana
How to Make The Presto Query Engine Run Fastest How do you make the Presto query engine run as fast as possible? The following list of things to consider, check … Continue reading How To Make Presto Query Engine Run Fastest | Ahana
Presto Connections & Connectors – JDBC, ODBC & More | Ahana
How do I connect to Presto? A Presto connection, connections to and from Presto, fall into two main categories: 1) The Presto connectors that provide access to source data in … Continue reading Presto Connections & Connectors – JDBC, ODBC & More | Ahana
How to Presto Data Share | Presto Data Sources & Usage | Ahana
How to Presto Data Share Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. … Continue reading How to Presto Data Share | Presto Data Sources & Usage | Ahana
Presto Performance | Presto Speed, Optimization & Tuning | Ahana
Presto Performance: Speed, Optimization & Tuning Presto is an open source distributed ANSI SQL query engine for analytics. Presto supports the separation of compute and storage (i.e. it queries data … Continue reading Presto Performance | Presto Speed, Optimization & Tuning | Ahana
Presto REST API | Presto Client & REST API Examples | Ahana
Presto REST API Presto’s HTTP API (Presto Rest API) is the communication protocol between server and client. It’s used to send query statements for execution on the server and to … Continue reading Presto REST API | Presto Client & REST API Examples | Ahana
Presto Parquet | Presto Parquet Tools & Tutorial | Ahana
Presto Parquet Parquet is a columnar storage format for Hadoop, supported by Presto. Storing data as columns as opposed to rows enables Presto to fulfill queries more efficiently. By discarding … Continue reading Presto Parquet | Presto Parquet Tools & Tutorial | Ahana
Presto Speed: How Fast is Presto?
Presto Speed Presto is an open source distributed ANSI SQL query engine for analytics. Presto supports the separation of compute and storage (i.e. it queries data that is stored externally … Continue reading Presto Speed: How Fast is Presto?
Presto Tools & Presto Tool | Key Presto Libraries | Ahana
Presto Tools Several tools and libraries are available, some are bundled with core Presto and others are downloadable extras. See the documentation Resources page at https://prestodb.io/resources.html for details. Core Presto … Continue reading Presto Tools & Presto Tool | Key Presto Libraries | Ahana
Presto Documentation & Presto Docs
Presto Documentation Presto’s documentation is maintained by the Presto Foundation and the current version is available here: https://prestodb.io/docs/current/ Releases can be found at: https://github.com/prestodb and the Release Notes are at: … Continue reading Presto Documentation & Presto Docs
Presto Query Examples | Presto CTE Examples & SQL Syntax| Ahana
Presto Query Example Since Presto is an ANSI SQL query engine, Presto’s SQL will be very familiar to practically anyone who has used a database, despite the fact Presto is … Continue reading Presto Query Examples | Presto CTE Examples & SQL Syntax| Ahana
Presto Best Practices | What Are The Best Practices for Presto | Ahana
Presto Best Practices to Improve Performance and Reduce Costs We’ve put together a list of Presto best practices, based on our experience. We hope you find these helpful. Configuration Configure … Continue reading Presto Best Practices | What Are The Best Practices for Presto | Ahana
Apache Presto Architecture & Design | PrestoDB Architecture | Ahana
Presto (and PrestoDB) Architecture and Design The basics The Presto architecture and design seamlessly integrates with an existing data ecosystem without needing to modify it. It’s an additional faster data … Continue reading Apache Presto Architecture & Design | PrestoDB Architecture | Ahana
Presto Guide, Documentation & Support | Learn Presto | Ahana
Presto Guide, Documentation & Support Presto is a fast, flexible distributed SQL engine created and used by Facebook at scale. O’Reilly’s Learning and Operating Presto ebook is an approachable guide … Continue reading Presto Guide, Documentation & Support | Learn Presto | Ahana
Presto Data Sources | Cassandra Presto & Presto Accumlo | Ahana
Presto Data Sources Presto was designed and written from the ground up to efficiently query data against data sources of all sizes, ranging from gigabytes to petabytes. Presto connects to … Continue reading Presto Data Sources | Cassandra Presto & Presto Accumlo | Ahana
Presto Server Types | Presto Server | Ahana
Presto Server Types When it comes to Presto Server Types, Coordinators and workers are the two types of Presto servers. Here’s a look at each server type in more detail. … Continue reading Presto Server Types | Presto Server | Ahana
Is Presto Free | Sign Up For Presto Free | Presto Open source
Is Presto Free? Many folks may wonder “is Presto free”? In fact, PrestoDB is a free, open source federated, distributed SQL query engine used for ad hoc analytics. The PrestoDB … Continue reading Is Presto Free | Sign Up For Presto Free | Presto Open source
EMR Cluster with Spark vs Presto | Presto Elasticsearch & HBase | Ahana
Presto vs Spark With EMR Cluster Amazon EMR is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, solely on AWS. An … Continue reading EMR Cluster with Spark vs Presto | Presto Elasticsearch & HBase | Ahana
Presto vs Spark SQL | Compare Spark SQL & Presto | Ahana
Spark SQL vs Presto Spark SQL and Presto, have become increasingly popular due to their capabilities in processing large amounts of data from various sources. In this blog post, we … Continue reading Presto vs Spark SQL | Compare Spark SQL & Presto | Ahana
Presto Catalogs & Presto Use Catalogs | Data Catalog Connector | Ahana
Presto Catalogs Presto has several important components that allow you to easily manage data. These components are catalogs, tables and schemas, and connectors. Presto accesses data via connectors; each data … Continue reading Presto Catalogs & Presto Use Catalogs | Data Catalog Connector | Ahana
Spark Streaming Alternatives | Alternatives to Using Apache Spark | Ahana
Spark Streaming Alternatives When researching Spark alternatives it really depends on your use case. Are you processing streaming data or batch data? Do you prefer an open or closed source/proprietary … Continue reading Spark Streaming Alternatives | Alternatives to Using Apache Spark | Ahana
Presto Engine & Database | Presto Query Execution Model | Ahana
Presto Engine and Database Explained What is Presto? PrestoDB is an open-source distributed SQL query engine for running interactive analytic queries against all types of data sources. It enables self-service … Continue reading Presto Engine & Database | Presto Query Execution Model | Ahana
Querying AWS S3 with Presto | Ahana
Querying AWS S3 with Presto? In this guide, you’ll learn how to query AWS S3 using Presto. Learn more about how to get started.
EMR Presto Optimization & Configuration | Amazon EMR Presto Versions
Optimize Presto on Amazon EMR What is Amazon EMR? Amazon Elastic MapReduce (EMR) simplifies running big data and analytics frameworks like Presto for scalable compute in the cloud. It provides … Continue reading EMR Presto Optimization & Configuration | Amazon EMR Presto Versions
Hadoop Presto | Using Presto With Hadoop Interactive Queries | Ahana
Using Presto with Hadoop How does Presto work with Hadoop? What is Presto Hadoop? You use Presto to run interactive queries on Hadoop. The difference between using Presto versus something … Continue reading Hadoop Presto | Using Presto With Hadoop Interactive Queries | Ahana
Presto and ETL – Clarifying Common Misconceptions
Data integration and ETL are typically seen as crucial to gaining insights from data. But when it comes to Presto, things get trickier: Should you use Presto for ETL? Or … Continue reading Presto and ETL – Clarifying Common Misconceptions
Presto REST API | Presto Client & REST API Examples | Ahana
Presto REST API Presto’s HTTP API (Presto Rest API) is the communication protocol between server and client. It’s used to send query statements for execution on the server and to … Continue reading Presto REST API | Presto Client & REST API Examples | Ahana
Apache Presto Architecture & Design | PrestoDB Architecture | Ahana
Presto (and PrestoDB) Architecture and Design The basics The Presto architecture and design seamlessly integrates with an existing data ecosystem without needing to modify it. It’s an additional faster data … Continue reading Apache Presto Architecture & Design | PrestoDB Architecture | Ahana
Presto Data Sources | Cassandra Presto & Presto Accumlo | Ahana
Presto Data Sources Presto was designed and written from the ground up to efficiently query data against data sources of all sizes, ranging from gigabytes to petabytes. Presto connects to … Continue reading Presto Data Sources | Cassandra Presto & Presto Accumlo | Ahana
Presto Server Types | Presto Server | Ahana
Presto Server Types When it comes to Presto Server Types, Coordinators and workers are the two types of Presto servers. Here’s a look at each server type in more detail. … Continue reading Presto Server Types | Presto Server | Ahana
Is Presto Free | Sign Up For Presto Free | Presto Open source
Is Presto Free? Many folks may wonder “is Presto free”? In fact, PrestoDB is a free, open source federated, distributed SQL query engine used for ad hoc analytics. The PrestoDB … Continue reading Is Presto Free | Sign Up For Presto Free | Presto Open source
Presto Catalogs & Presto Use Catalogs | Data Catalog Connector | Ahana
Presto Catalogs Presto has several important components that allow you to easily manage data. These components are catalogs, tables and schemas, and connectors. Presto accesses data via connectors; each data … Continue reading Presto Catalogs & Presto Use Catalogs | Data Catalog Connector | Ahana
AWS Athena vs Snowflake
The High Level Overview Snowflake and Amazon Athena are both cloud analytics tools, but are significantly different in terms of their architecture. Athena is a serverless query engine based on … Continue reading AWS Athena vs Snowflake
Starburst vs Snowflake
The High Level Overview Starburst and Snowflake are both in the data analytics space but are significantly different in terms of their architecture and use cases. Starburst is the corporate … Continue reading Starburst vs Snowflake
S3 Select vs. AWS Athena – The Quick Comparison
Data analysts and data engineers need simpler ways to access business data stored on Amazon S3. Amazon Athena and S3 Select are two services that allow you to retrieve records … Continue reading S3 Select vs. AWS Athena – The Quick Comparison
4 Trino Alternatives for Better Price / Performance
Trino, a distributed SQL query engine, is known for its ability to process large amounts of semi-structured data using familiar SQL semantics. However, there are situations where an alternative may … Continue reading 4 Trino Alternatives for Better Price / Performance
Exploring Data Warehouse, Data Mesh, and Data Lakehouse: What’s right for you?
We’re hosting a free hands-on lab on building your own Data Lakehouse in AWS. You’ll get trained by Presto and Apache Hudi experts. When it comes to data management, there … Continue reading Exploring Data Warehouse, Data Mesh, and Data Lakehouse: What’s right for you?
AWS Athena vs. Databricks
In this article we’ll look at two different technologies in the data space and share more about which to use based on your use case and workloads. The High Level … Continue reading AWS Athena vs. Databricks
Starburst vs. Athena: Evaluating different Presto vendors
Starburst vs. Athena – what are the differences between these two Presto vendors and how should you choose based on your workloads and use case? In this article we’ll discuss both.
Hive vs Presto vs Spark for Data Analysis
Compare Presto to Hive and compare Presto to Spark for data analysis, Learn is best for you based on your data analytics goals.
Ahana Cloud for Presto Versus Amazon EMR
In this brief post, we’ll discuss some of the benefits of Ahana Cloud over Amazon Elastic MapReduce (EMR). While EMR offers optionality in the number of big data compute frameworks, … Continue reading Ahana Cloud for Presto Versus Amazon EMR
EMR Cluster with Spark vs Presto | Presto Elasticsearch & HBase | Ahana
Presto vs Spark With EMR Cluster Amazon EMR is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, solely on AWS. An … Continue reading EMR Cluster with Spark vs Presto | Presto Elasticsearch & HBase | Ahana
Presto vs Spark SQL | Compare Spark SQL & Presto | Ahana
Spark SQL vs Presto Spark SQL and Presto, have become increasingly popular due to their capabilities in processing large amounts of data from various sources. In this blog post, we … Continue reading Presto vs Spark SQL | Compare Spark SQL & Presto | Ahana
Spark Streaming Alternatives | Alternatives to Using Apache Spark | Ahana
Spark Streaming Alternatives When researching Spark alternatives it really depends on your use case. Are you processing streaming data or batch data? Do you prefer an open or closed source/proprietary … Continue reading Spark Streaming Alternatives | Alternatives to Using Apache Spark | Ahana
Presto Geospatial Functions Tutorial & Guide l Ahana
Presto Geospatial Functions Tutorial A question that often comes up is “how do I do geospatial queries and spatial joins in Presto?”. Fortunately Presto supports a wealth of functions and … Continue reading Presto Geospatial Functions Tutorial & Guide l Ahana
Python Connect to Presto | Python Script To Connect To Presto Cluster
How to Write a Python Script to Connect to a Presto Cluster If you need a quick tutorial on how to write a python script to connect to a Presto … Continue reading Python Connect to Presto | Python Script To Connect To Presto Cluster
Presto Resource Groups | Setting Up Presto Resource Group | Ahana
Setting-up Presto Resource Groups Before you start allowing users on your Presto cluster it’s best practice to configure resource groups. A resource group is a query admission control and workload … Continue reading Presto Resource Groups | Setting Up Presto Resource Group | Ahana
How To Connect To Presto | Connecting To Presto | Ahana
How To Connect To Presto Connections to and from Presto fall into three main categories: 1) The Presto connectors that provide access to source data in various databases and file … Continue reading How To Connect To Presto | Connecting To Presto | Ahana
How to Access Presto | Best Ways to Connect to Presto | Ahana
How To Access Presto There are several ways to access and use Presto. Here are 3 of the most common ways: Command Line Interface The Presto CLI provides a terminal-based … Continue reading How to Access Presto | Best Ways to Connect to Presto | Ahana
Presto Machine Learning | Machine Learning With Presto | Ahana
Machine Learning With Presto Presto is an invaluable tool for accessing and combining data, and performing feature engineering and feature extraction, in a scalable way ready for machine learning and … Continue reading Presto Machine Learning | Machine Learning With Presto | Ahana
How To Make Presto Query Engine Run Fastest | Ahana
How to Make The Presto Query Engine Run Fastest How do you make the Presto query engine run as fast as possible? The following list of things to consider, check … Continue reading How To Make Presto Query Engine Run Fastest | Ahana
Presto Connections & Connectors – JDBC, ODBC & More | Ahana
How do I connect to Presto? A Presto connection, connections to and from Presto, fall into two main categories: 1) The Presto connectors that provide access to source data in … Continue reading Presto Connections & Connectors – JDBC, ODBC & More | Ahana
How to Presto Data Share | Presto Data Sources & Usage | Ahana
How to Presto Data Share Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. … Continue reading How to Presto Data Share | Presto Data Sources & Usage | Ahana
Presto Performance | Presto Speed, Optimization & Tuning | Ahana
Presto Performance: Speed, Optimization & Tuning Presto is an open source distributed ANSI SQL query engine for analytics. Presto supports the separation of compute and storage (i.e. it queries data … Continue reading Presto Performance | Presto Speed, Optimization & Tuning | Ahana
Presto REST API | Presto Client & REST API Examples | Ahana
Presto REST API Presto’s HTTP API (Presto Rest API) is the communication protocol between server and client. It’s used to send query statements for execution on the server and to … Continue reading Presto REST API | Presto Client & REST API Examples | Ahana
Presto Parquet | Presto Parquet Tools & Tutorial | Ahana
Presto Parquet Parquet is a columnar storage format for Hadoop, supported by Presto. Storing data as columns as opposed to rows enables Presto to fulfill queries more efficiently. By discarding … Continue reading Presto Parquet | Presto Parquet Tools & Tutorial | Ahana
Presto Speed: How Fast is Presto?
Presto Speed Presto is an open source distributed ANSI SQL query engine for analytics. Presto supports the separation of compute and storage (i.e. it queries data that is stored externally … Continue reading Presto Speed: How Fast is Presto?
Presto Tools & Presto Tool | Key Presto Libraries | Ahana
Presto Tools Several tools and libraries are available, some are bundled with core Presto and others are downloadable extras. See the documentation Resources page at https://prestodb.io/resources.html for details. Core Presto … Continue reading Presto Tools & Presto Tool | Key Presto Libraries | Ahana
Presto Documentation & Presto Docs
Presto Documentation Presto’s documentation is maintained by the Presto Foundation and the current version is available here: https://prestodb.io/docs/current/ Releases can be found at: https://github.com/prestodb and the Release Notes are at: … Continue reading Presto Documentation & Presto Docs
Presto Query Examples | Presto CTE Examples & SQL Syntax| Ahana
Presto Query Example Since Presto is an ANSI SQL query engine, Presto’s SQL will be very familiar to practically anyone who has used a database, despite the fact Presto is … Continue reading Presto Query Examples | Presto CTE Examples & SQL Syntax| Ahana
Presto Best Practices | What Are The Best Practices for Presto | Ahana
Presto Best Practices to Improve Performance and Reduce Costs We’ve put together a list of Presto best practices, based on our experience. We hope you find these helpful. Configuration Configure … Continue reading Presto Best Practices | What Are The Best Practices for Presto | Ahana
Presto Guide, Documentation & Support | Learn Presto | Ahana
Presto Guide, Documentation & Support Presto is a fast, flexible distributed SQL engine created and used by Facebook at scale. O’Reilly’s Learning and Operating Presto ebook is an approachable guide … Continue reading Presto Guide, Documentation & Support | Learn Presto | Ahana