Presto Training & Learning Center

The Ahana™ Learning Center covers beginner to advanced level Presto topics, questions, and answers to help you learn Presto.

Topics

Virtual Lab: Building an Open Data Lakehouse with Presto, Hudi, and AWS S3

Aug 12, 20222 min read

Learn how to build an open data lakehouse stack using Presto, Apache Hudi and AWS S3 in this free hands-on lab.

Ahana Awarded Many Industry Recognitions and Accolades for Big Data, Data Analytics and Presto Innovations

Aug 3, 20225 min read

Ahana, the only SaaS for Presto, today announced many new industry accolades in 1H 2022.

Amazon Redshift Spectrum vs Redshift: Key Differences

Aug 2, 20224 min read

Redshift vs Redshift Spectrum: A Complete Comparison Amazon Redshift is a cloud-based data warehouse service offered by Amazon. Redshift is a columnar database which is optimized to handle the sort … Continue reading Amazon Redshift Spectrum vs Redshift: Key Differences

Using AWS Redshift Spectrum in AWS Lake Formation | Ahana

Jul 25, 20225 min read

Lake Formation makes it easier to set up the data lake, and to incorporate Redshift as part of the compute layer alongside other analytics tools and services.

Ahana to Present About Presto on the Open Data Lakehouse at PrestoCon Day; Ahana Customer Blinkit to Discuss Its Presto on AWS Use Case

Jul 14, 20225 min read

Ahana, the only SaaS for Presto, today announced its participation in PrestoCon Day, a day dedicated to all things Presto taking place virtually on Thursday, July 21, 2022.

Data Warehouse: Understanding the Types & Architecture

Jul 5, 20225 min read

Data Warehouse: A Comprehensive Guide Introduction A data warehouse is a data repository that is typically used for analytic systems and Business Intelligence tools. It is typically composed of operational … Continue reading Data Warehouse: Understanding the Types & Architecture

Data Warehouse Concepts for Beginners | Ahana

Jul 5, 20225 min read

Data Warehouse Concepts for Beginners A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. Typically a data warehouse contains historical … Continue reading Data Warehouse Concepts for Beginners | Ahana

Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine

Jun 23, 20223 min read

Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine San Mateo, Calif. – June 23, 2022 — Ahana, the only SaaS for Presto, … Continue reading Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine

AWS Redshift Limitations | Redshift Pros and Cons

Jun 17, 20225 min read

Redshift is an Amazon petabyte-scale data warehouse product that is based on PostgreSQL version 8.0.2. While there are pros to Redshift, there are also cons. One of which is query limits. In this article we will dive deeper into the restrictions of Redshift including query limitations.

Ahana Announces Additional $7.2 Million Funding Led by Liberty Global Ventures and Debuts Free Community Edition of Ahana Cloud for Presto for the Open Data Lakehouse

Jun 16, 20225 min read

Ahana Announces Additional $7.2 Million Funding Led by Liberty Global Ventures and Debuts Free Community Edition of Ahana Cloud for Presto for the Open Data Lakehouse 

Ahana Will Co-Lead Session At Open Source Summit About Presto SQL Query Engine

Jun 14, 20223 min read

Ahana, the only SaaS for Presto, today announced that Rohan Pednekar, Ahana’s senior product manager, will co-lead a session with Meta Developer Advocate Philip Bell at the Linux Foundation’s Open Source Summit about Presto, the Meta-born open source high performance, distributed SQL query engine.

Announcing the Cube integration with Ahana: Querying multiple data sources with managed Presto and Cube

Jun 8, 202233 min read

See how Ahana and Cube work together to help you set up a Presto cluster and build a single source of truth for metrics without spending days reading cryptic docs

AWS Athena Limitations

Jun 7, 20226 min read

AWS Athena query limits cause problems, and data engineering teams spend hours diagnosing them. Learn what the limitations are and how to fix.

What is Amazon Redshift Spectrum? | Ahana

Jun 7, 20225 min read

Launched in 2017, Redshift Spectrum is a feature within Redshift that enables you to query data stored in AWS S3 using SQL. Spectrum allows you to do federated queries from within the Redshift SQL query editor to data in S3, while also being able to combine it with data in Redshift.

Building a Data Lake Using Lake Formation on AWS

Jun 7, 20225 min read

AWS lake formation helps users to build, manage and secure their data lakes in a very short amount of time, meaning days instead of months as is common with a traditional data lake approach.

Building an Open Data Lakehouse with Presto, Hudi and AWS S3

Jun 6, 20228 min read

Learn how you can start building an Open Data Lakehouse analytics stack using Presto, Hudi and AWS S3 and solve the challenges of a data warehouse.

Understanding AWS Redshift Pricing | Ahana

Jun 6, 20225 min read

AWS Redshift is a completely managed cloud data warehouse service with the ability to scale on-demand and is compatible with multitudes of AWS tools and technologies. AWS Redshift is considered the preferred cloud data warehouse of choice for most customers but the pricing is not simple, since it tries to accommodate different use cases and customers. Let us try to understand the pricing details of Amazon Redshift.

From Lake to Shining Lakehouse, A New Era in Data

Jun 3, 20222 min read

During this episode of DM Radio you will learn from experts Raj K of General Dynamics Information technology and Wen Phan of Ahana.

Presto on AWS | Run Presto on AWS Athena & EMR

May 30, 20224 min read

What is Presto? Presto is an open-source distributed SQL query engine for running interactive analytic queries against all types of data sources. Learn more about what Presto is, how it’s used, and how to get started.

What is Presto? | Presto Caching, Data Sources & Usage Intro

May 30, 20225 min read

Learn more about what Presto is, how it was developed, and how to use it.

Presto vs Snowflake: Data Warehousing Comparisons

May 27, 20228 min read

Snowflake is a cloud data warehouse that offers a cloud-based data storage and analytics service. Snowflake runs completely on cloud infrastructure.

AWS Redshift Data Warehouse Architecture | Ahana

May 24, 20228 min read

Amazon Redshift is a cloud data warehouse offered as a managed service by AWS, and a popular choice for business intelligence and reporting use cases

Data warehouse or Data Lake, which one do I choose? | Ahana

May 20, 20222 min read

In this webinar, you’ll hear from Ali LeClerc who will discuss the data landscape and why many companies are moving to an open data lakehouse.

Ahana Announces New Presto Query Analyzer to Bring Instant Insights into Presto Clusters

May 18, 20223 min read

With the Presto Query Analyzer, data platform teams can get instant insights into their Presto clusters including query performance, bandwidth bottlenecks, and much more. The Presto Query Analyzer was built for the Presto community and is free to use.

What Is AWS Redshift Used For | Redshift Use Cases

May 13, 20226 min read

Amazon Redshift is one of the most widely-used services in the AWS ecosystem and is a familiar component in many cloud architectures.

ETL and ELT | What are the differences between ETL and ELT

May 12, 20224 min read

ETL and ELT in Data Warehousing What is ETL and ELT? ETL, or Extract Transform Load, is when an ETL tool or series of homegrown programs extracts data from a … Continue reading ETL and ELT | What are the differences between ETL and ELT

How to run SQL queries with Presto on Amazon Redshift

May 10, 20229 min read

In this tutorial, we use a step-by-step approach so you can learn how to query redshift using SQL with Presto (running with Kubernetes).

The Differences Between AWS RedShift Spectrum vs Athena

May 1, 20225 min read

AWS Redshift Spectrum vs Athena: Redshift is the storage and Redshift Spectrum is a SQL engine extension. Athena is a standalone SQL engine

How Much Does Amazon Athena Cost? | Ahana

Apr 25, 20224 min read

Understanding AWS Athena Costs with Examples What Is Amazon Athena?  Since you’re reading this to understand Athena costs, you likely already know, so we’ll just very briefly touch on what … Continue reading How Much Does Amazon Athena Cost? | Ahana

5 Components of Data Warehouse Architecture | Ahana

Apr 25, 20225 min read

In this article we’ll look at the contextual requirements of a data warehouse, which are the five components of a data warehouse.

What is a Data Lakehouse Architecture?

Apr 16, 20225 min read

Overview The term Data Lakehouse has become very popular over the last year or so, especially as more customers are migrating their workloads to the cloud. This article will help … Continue reading What is a Data Lakehouse Architecture?

An introduction to Ahana Cloud for Presto on AWS

Apr 14, 20222 min read

During this webinar we will share how to build an open data lakehouse with Presto and AWS S3 using Ahana Cloud.

Price-Performance Ratio of Athena vs Ahana

Apr 13, 20225 min read

Price-Performance Ratio of AWS Athena Presto vs Ahana Cloud for Presto Understand the price-performance ratio of Amazon Athena vs. Ahana. Both AWS Athena and Ahana Cloud are based on the … Continue reading Price-Performance Ratio of Athena vs Ahana

AWS Lake Formation for Enterprise Data Lakes | Ahana Cloud

Apr 6, 20223 min read

AWS Lake Formation is a service that makes it easy to set up a secure data lake very quickly (in a matter of days), providing a governance layer for data lakes on AWS S3. 

How to build an Open Data Lakehouse Analytics stack

Mar 31, 20222 min read

During this webinar we’ll show you how you can build an open data lakehouse stack. At the heart of this stack is Presto, the open source SQL query engine for the data lake, and the transaction manager / governance layer, which includes technologies like Apache Hudi, Delta Lake, and AWS Lake Formation.

How to Use AWS Athena to Query JSON Data | Ahana

Mar 30, 20224 min read

A popular use case is to use Athena to query Parquet, ORC, CSV and JSON files that are typically used for querying directly, or transformed and loaded into a data warehouse.

Tutorial: How to run SQL queries with Presto on BigQuery

Mar 20, 20227 min read

Presto has evolved into a unified SQL engine on top of cloud data lakes for both interactive queries as well as batch workloads with multiple data sources. This tutorial is … Continue reading Tutorial: How to run SQL queries with Presto on BigQuery

Unlocking the Business Value of the Data Lake

Mar 17, 20222 min read

During this webinar we’ll discuss how nearly three-fifths of organizations have gained competitive advantage from their data lake initiatives. That includes unleashing the intelligence-generating potential of a data lake that enables ad hoc data discovery and analytics in an open and flexible manner. We’ll cover:

What is an Open Data Lake in the Cloud?

Mar 16, 20223 min read

The Open Data Lake in the cloud is the solution to the massive data problem. Many companies are adopting that architecture because of better price-performance, scale, and non-proprietary architecture.

The Differences Between AWS Athena and AWS Glue | Ahana

Mar 16, 20224 min read

Here, we are going to talk about AWS Athena vs Glue, which is an interesting pairing as they are both complementary and competitive. So, what are they exactly?

Best Practices for Resource Management in PrestoDB

Mar 11, 20229 min read

Resource management in databases allows administrators to have control over resources and assign a priority to sessions, ensuring the most important transactions get the major share of system resources. Resource management in a distributed environment makes accessibility of data easier and manages resources over the network of autonomous computers (i.e. Distributed System). The basis of resource management in the distributed system is also resource sharing.

How to Query Parquet Files using Amazon Athena | Ahana

Mar 9, 20224 min read

Querying Parquet Files using AWS Amazon Athena Parquet is one of the latest file formats with many advantages over some of the more commonly used formats like CSV and JSON. … Continue reading How to Query Parquet Files using Amazon Athena | Ahana

Configuring RaptorX – a multi-level caching with Presto

Mar 8, 202211 min read

RaptorX Background and Context Meta introduced a multi-level cache at PrestoCon 2021. Code-named the “RaptorX Project,” it aims to make Presto 10x faster on Meta- scale petabyte workloads. Here at … Continue reading Configuring RaptorX – a multi-level caching with Presto

AWS Lake Formation Blueprints | Amazon Blueprint Types

Mar 7, 20222 min read

This article is focused on the first step and how AWS Lake Formation Blueprints can make that easy and automated. Before you can run analytics to get insights, you need your data continuously pooling into your lake!

Ahana Announces New Security Capabilities to Bring Next Level of Security to the Data Lake

Feb 23, 20226 min read

Ahana, the only SaaS for Presto, today announced significant new security features added to its Ahana Cloud for Presto managed service. They include multi-user support for Presto and Ahana, fine-grained access control for data lakes with deep Apache Ranger integration, and audit support for all access.

Difference Between AWS Lake Formation vs AWS Glue

Feb 22, 20223 min read

AWS Lake Formation vs AWS Glue – What are the differences? As you start building your analytics stack in AWS, there are several AWS technologies to understand as you begin. … Continue reading Difference Between AWS Lake Formation vs AWS Glue

Limitations of Amazon S3 Select | AWS Select Capabilities

Feb 2, 20226 min read

Amazon S3 Select Limitations What is Amazon S3 Select? Amazon S3 Select allows you to use simple structured query language (SQL) statements to filter the contents of an Amazon S3 … Continue reading Limitations of Amazon S3 Select | AWS Select Capabilities

How To Query Data in AWS S3 Using Athena | Ahana

Feb 1, 20224 min read

Learn how to use Athena to query Amazon S3 and start running queries on your S3 data lake. Query JSON, Apache Parquet, Apache ORC, CSV, and more.

What is AWS Lake Formation? | Amazon S3 Lake formation

Jan 31, 20222 min read

What is AWS Lake Formation? For AWS users who want to get governance on their data lake, AWS Lake Formation is a service that makes it easy to set up … Continue reading What is AWS Lake Formation? | Amazon S3 Lake formation

How does Presto work with LDAP | Presto LDAP Authentication

Jan 28, 20223 min read

LDAP is an industry standard application used for directory services authentication. Learn how does Presto work with LDAP

Virtual Lab: Building an Open Data Lakehouse with Presto, Hudi, and AWS S3

Aug 12, 20222 min read

Learn how to build an open data lakehouse stack using Presto, Apache Hudi and AWS S3 in this free hands-on lab.

Ahana Awarded Many Industry Recognitions and Accolades for Big Data, Data Analytics and Presto Innovations

Aug 3, 20225 min read

Ahana, the only SaaS for Presto, today announced many new industry accolades in 1H 2022.

Amazon Redshift Spectrum vs Redshift: Key Differences

Aug 2, 20224 min read

Redshift vs Redshift Spectrum: A Complete Comparison Amazon Redshift is a cloud-based data warehouse service offered by Amazon. Redshift is a columnar database which is optimized to handle the sort … Continue reading Amazon Redshift Spectrum vs Redshift: Key Differences

Using AWS Redshift Spectrum in AWS Lake Formation | Ahana

Jul 25, 20225 min read

Lake Formation makes it easier to set up the data lake, and to incorporate Redshift as part of the compute layer alongside other analytics tools and services.

Ahana to Present About Presto on the Open Data Lakehouse at PrestoCon Day; Ahana Customer Blinkit to Discuss Its Presto on AWS Use Case

Jul 14, 20225 min read

Ahana, the only SaaS for Presto, today announced its participation in PrestoCon Day, a day dedicated to all things Presto taking place virtually on Thursday, July 21, 2022.

Data Warehouse: Understanding the Types & Architecture

Jul 5, 20225 min read

Data Warehouse: A Comprehensive Guide Introduction A data warehouse is a data repository that is typically used for analytic systems and Business Intelligence tools. It is typically composed of operational … Continue reading Data Warehouse: Understanding the Types & Architecture

Data Warehouse Concepts for Beginners | Ahana

Jul 5, 20225 min read

Data Warehouse Concepts for Beginners A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. Typically a data warehouse contains historical … Continue reading Data Warehouse Concepts for Beginners | Ahana

Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine

Jun 23, 20223 min read

Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine San Mateo, Calif. – June 23, 2022 — Ahana, the only SaaS for Presto, … Continue reading Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine

AWS Redshift Limitations | Redshift Pros and Cons

Jun 17, 20225 min read

Redshift is an Amazon petabyte-scale data warehouse product that is based on PostgreSQL version 8.0.2. While there are pros to Redshift, there are also cons. One of which is query limits. In this article we will dive deeper into the restrictions of Redshift including query limitations.

Ahana Announces Additional $7.2 Million Funding Led by Liberty Global Ventures and Debuts Free Community Edition of Ahana Cloud for Presto for the Open Data Lakehouse

Jun 16, 20225 min read

Ahana Announces Additional $7.2 Million Funding Led by Liberty Global Ventures and Debuts Free Community Edition of Ahana Cloud for Presto for the Open Data Lakehouse 

Ahana Will Co-Lead Session At Open Source Summit About Presto SQL Query Engine

Jun 14, 20223 min read

Ahana, the only SaaS for Presto, today announced that Rohan Pednekar, Ahana’s senior product manager, will co-lead a session with Meta Developer Advocate Philip Bell at the Linux Foundation’s Open Source Summit about Presto, the Meta-born open source high performance, distributed SQL query engine.

Announcing the Cube integration with Ahana: Querying multiple data sources with managed Presto and Cube

Jun 8, 202233 min read

See how Ahana and Cube work together to help you set up a Presto cluster and build a single source of truth for metrics without spending days reading cryptic docs

AWS Athena Limitations

Jun 7, 20226 min read

AWS Athena query limits cause problems, and data engineering teams spend hours diagnosing them. Learn what the limitations are and how to fix.

What is Amazon Redshift Spectrum? | Ahana

Jun 7, 20225 min read

Launched in 2017, Redshift Spectrum is a feature within Redshift that enables you to query data stored in AWS S3 using SQL. Spectrum allows you to do federated queries from within the Redshift SQL query editor to data in S3, while also being able to combine it with data in Redshift.

Building a Data Lake Using Lake Formation on AWS

Jun 7, 20225 min read

AWS lake formation helps users to build, manage and secure their data lakes in a very short amount of time, meaning days instead of months as is common with a traditional data lake approach.

Building an Open Data Lakehouse with Presto, Hudi and AWS S3

Jun 6, 20228 min read

Learn how you can start building an Open Data Lakehouse analytics stack using Presto, Hudi and AWS S3 and solve the challenges of a data warehouse.

Understanding AWS Redshift Pricing | Ahana

Jun 6, 20225 min read

AWS Redshift is a completely managed cloud data warehouse service with the ability to scale on-demand and is compatible with multitudes of AWS tools and technologies. AWS Redshift is considered the preferred cloud data warehouse of choice for most customers but the pricing is not simple, since it tries to accommodate different use cases and customers. Let us try to understand the pricing details of Amazon Redshift.

From Lake to Shining Lakehouse, A New Era in Data

Jun 3, 20222 min read

During this episode of DM Radio you will learn from experts Raj K of General Dynamics Information technology and Wen Phan of Ahana.

Presto on AWS | Run Presto on AWS Athena & EMR

May 30, 20224 min read

What is Presto? Presto is an open-source distributed SQL query engine for running interactive analytic queries against all types of data sources. Learn more about what Presto is, how it’s used, and how to get started.

What is Presto? | Presto Caching, Data Sources & Usage Intro

May 30, 20225 min read

Learn more about what Presto is, how it was developed, and how to use it.

Presto vs Snowflake: Data Warehousing Comparisons

May 27, 20228 min read

Snowflake is a cloud data warehouse that offers a cloud-based data storage and analytics service. Snowflake runs completely on cloud infrastructure.

AWS Redshift Data Warehouse Architecture | Ahana

May 24, 20228 min read

Amazon Redshift is a cloud data warehouse offered as a managed service by AWS, and a popular choice for business intelligence and reporting use cases

Data warehouse or Data Lake, which one do I choose? | Ahana

May 20, 20222 min read

In this webinar, you’ll hear from Ali LeClerc who will discuss the data landscape and why many companies are moving to an open data lakehouse.

Ahana Announces New Presto Query Analyzer to Bring Instant Insights into Presto Clusters

May 18, 20223 min read

With the Presto Query Analyzer, data platform teams can get instant insights into their Presto clusters including query performance, bandwidth bottlenecks, and much more. The Presto Query Analyzer was built for the Presto community and is free to use.

What Is AWS Redshift Used For | Redshift Use Cases

May 13, 20226 min read

Amazon Redshift is one of the most widely-used services in the AWS ecosystem and is a familiar component in many cloud architectures.

ETL and ELT | What are the differences between ETL and ELT

May 12, 20224 min read

ETL and ELT in Data Warehousing What is ETL and ELT? ETL, or Extract Transform Load, is when an ETL tool or series of homegrown programs extracts data from a … Continue reading ETL and ELT | What are the differences between ETL and ELT

How to run SQL queries with Presto on Amazon Redshift

May 10, 20229 min read

In this tutorial, we use a step-by-step approach so you can learn how to query redshift using SQL with Presto (running with Kubernetes).

The Differences Between AWS RedShift Spectrum vs Athena

May 1, 20225 min read

AWS Redshift Spectrum vs Athena: Redshift is the storage and Redshift Spectrum is a SQL engine extension. Athena is a standalone SQL engine

How Much Does Amazon Athena Cost? | Ahana

Apr 25, 20224 min read

Understanding AWS Athena Costs with Examples What Is Amazon Athena?  Since you’re reading this to understand Athena costs, you likely already know, so we’ll just very briefly touch on what … Continue reading How Much Does Amazon Athena Cost? | Ahana

5 Components of Data Warehouse Architecture | Ahana

Apr 25, 20225 min read

In this article we’ll look at the contextual requirements of a data warehouse, which are the five components of a data warehouse.

What is a Data Lakehouse Architecture?

Apr 16, 20225 min read

Overview The term Data Lakehouse has become very popular over the last year or so, especially as more customers are migrating their workloads to the cloud. This article will help … Continue reading What is a Data Lakehouse Architecture?

An introduction to Ahana Cloud for Presto on AWS

Apr 14, 20222 min read

During this webinar we will share how to build an open data lakehouse with Presto and AWS S3 using Ahana Cloud.

Price-Performance Ratio of Athena vs Ahana

Apr 13, 20225 min read

Price-Performance Ratio of AWS Athena Presto vs Ahana Cloud for Presto Understand the price-performance ratio of Amazon Athena vs. Ahana. Both AWS Athena and Ahana Cloud are based on the … Continue reading Price-Performance Ratio of Athena vs Ahana

AWS Lake Formation for Enterprise Data Lakes | Ahana Cloud

Apr 6, 20223 min read

AWS Lake Formation is a service that makes it easy to set up a secure data lake very quickly (in a matter of days), providing a governance layer for data lakes on AWS S3. 

How to build an Open Data Lakehouse Analytics stack

Mar 31, 20222 min read

During this webinar we’ll show you how you can build an open data lakehouse stack. At the heart of this stack is Presto, the open source SQL query engine for the data lake, and the transaction manager / governance layer, which includes technologies like Apache Hudi, Delta Lake, and AWS Lake Formation.

How to Use AWS Athena to Query JSON Data | Ahana

Mar 30, 20224 min read

A popular use case is to use Athena to query Parquet, ORC, CSV and JSON files that are typically used for querying directly, or transformed and loaded into a data warehouse.

Tutorial: How to run SQL queries with Presto on BigQuery

Mar 20, 20227 min read

Presto has evolved into a unified SQL engine on top of cloud data lakes for both interactive queries as well as batch workloads with multiple data sources. This tutorial is … Continue reading Tutorial: How to run SQL queries with Presto on BigQuery

Unlocking the Business Value of the Data Lake

Mar 17, 20222 min read

During this webinar we’ll discuss how nearly three-fifths of organizations have gained competitive advantage from their data lake initiatives. That includes unleashing the intelligence-generating potential of a data lake that enables ad hoc data discovery and analytics in an open and flexible manner. We’ll cover:

What is an Open Data Lake in the Cloud?

Mar 16, 20223 min read

The Open Data Lake in the cloud is the solution to the massive data problem. Many companies are adopting that architecture because of better price-performance, scale, and non-proprietary architecture.

The Differences Between AWS Athena and AWS Glue | Ahana

Mar 16, 20224 min read

Here, we are going to talk about AWS Athena vs Glue, which is an interesting pairing as they are both complementary and competitive. So, what are they exactly?

Best Practices for Resource Management in PrestoDB

Mar 11, 20229 min read

Resource management in databases allows administrators to have control over resources and assign a priority to sessions, ensuring the most important transactions get the major share of system resources. Resource management in a distributed environment makes accessibility of data easier and manages resources over the network of autonomous computers (i.e. Distributed System). The basis of resource management in the distributed system is also resource sharing.

How to Query Parquet Files using Amazon Athena | Ahana

Mar 9, 20224 min read

Querying Parquet Files using AWS Amazon Athena Parquet is one of the latest file formats with many advantages over some of the more commonly used formats like CSV and JSON. … Continue reading How to Query Parquet Files using Amazon Athena | Ahana

Configuring RaptorX – a multi-level caching with Presto

Mar 8, 202211 min read

RaptorX Background and Context Meta introduced a multi-level cache at PrestoCon 2021. Code-named the “RaptorX Project,” it aims to make Presto 10x faster on Meta- scale petabyte workloads. Here at … Continue reading Configuring RaptorX – a multi-level caching with Presto

AWS Lake Formation Blueprints | Amazon Blueprint Types

Mar 7, 20222 min read

This article is focused on the first step and how AWS Lake Formation Blueprints can make that easy and automated. Before you can run analytics to get insights, you need your data continuously pooling into your lake!

Ahana Announces New Security Capabilities to Bring Next Level of Security to the Data Lake

Feb 23, 20226 min read

Ahana, the only SaaS for Presto, today announced significant new security features added to its Ahana Cloud for Presto managed service. They include multi-user support for Presto and Ahana, fine-grained access control for data lakes with deep Apache Ranger integration, and audit support for all access.

Difference Between AWS Lake Formation vs AWS Glue

Feb 22, 20223 min read

AWS Lake Formation vs AWS Glue – What are the differences? As you start building your analytics stack in AWS, there are several AWS technologies to understand as you begin. … Continue reading Difference Between AWS Lake Formation vs AWS Glue

Limitations of Amazon S3 Select | AWS Select Capabilities

Feb 2, 20226 min read

Amazon S3 Select Limitations What is Amazon S3 Select? Amazon S3 Select allows you to use simple structured query language (SQL) statements to filter the contents of an Amazon S3 … Continue reading Limitations of Amazon S3 Select | AWS Select Capabilities

How To Query Data in AWS S3 Using Athena | Ahana

Feb 1, 20224 min read

Learn how to use Athena to query Amazon S3 and start running queries on your S3 data lake. Query JSON, Apache Parquet, Apache ORC, CSV, and more.

What is AWS Lake Formation? | Amazon S3 Lake formation

Jan 31, 20222 min read

What is AWS Lake Formation? For AWS users who want to get governance on their data lake, AWS Lake Formation is a service that makes it easy to set up … Continue reading What is AWS Lake Formation? | Amazon S3 Lake formation

How does Presto work with LDAP | Presto LDAP Authentication

Jan 28, 20223 min read

LDAP is an industry standard application used for directory services authentication. Learn how does Presto work with LDAP

Virtual Lab: Building an Open Data Lakehouse with Presto, Hudi, and AWS S3

Aug 12, 20222 min read

Learn how to build an open data lakehouse stack using Presto, Apache Hudi and AWS S3 in this free hands-on lab.

Ahana Awarded Many Industry Recognitions and Accolades for Big Data, Data Analytics and Presto Innovations

Aug 3, 20225 min read

Ahana, the only SaaS for Presto, today announced many new industry accolades in 1H 2022.

Amazon Redshift Spectrum vs Redshift: Key Differences

Aug 2, 20224 min read

Redshift vs Redshift Spectrum: A Complete Comparison Amazon Redshift is a cloud-based data warehouse service offered by Amazon. Redshift is a columnar database which is optimized to handle the sort … Continue reading Amazon Redshift Spectrum vs Redshift: Key Differences

Using AWS Redshift Spectrum in AWS Lake Formation | Ahana

Jul 25, 20225 min read

Lake Formation makes it easier to set up the data lake, and to incorporate Redshift as part of the compute layer alongside other analytics tools and services.

Ahana to Present About Presto on the Open Data Lakehouse at PrestoCon Day; Ahana Customer Blinkit to Discuss Its Presto on AWS Use Case

Jul 14, 20225 min read

Ahana, the only SaaS for Presto, today announced its participation in PrestoCon Day, a day dedicated to all things Presto taking place virtually on Thursday, July 21, 2022.

Data Warehouse: Understanding the Types & Architecture

Jul 5, 20225 min read

Data Warehouse: A Comprehensive Guide Introduction A data warehouse is a data repository that is typically used for analytic systems and Business Intelligence tools. It is typically composed of operational … Continue reading Data Warehouse: Understanding the Types & Architecture

Data Warehouse Concepts for Beginners | Ahana

Jul 5, 20225 min read

Data Warehouse Concepts for Beginners A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. Typically a data warehouse contains historical … Continue reading Data Warehouse Concepts for Beginners | Ahana

Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine

Jun 23, 20223 min read

Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine San Mateo, Calif. – June 23, 2022 — Ahana, the only SaaS for Presto, … Continue reading Ahana Will Co-Lead Session At Data & AI Summit About Presto Open Source SQL Query Engine

AWS Redshift Limitations | Redshift Pros and Cons

Jun 17, 20225 min read

Redshift is an Amazon petabyte-scale data warehouse product that is based on PostgreSQL version 8.0.2. While there are pros to Redshift, there are also cons. One of which is query limits. In this article we will dive deeper into the restrictions of Redshift including query limitations.

Ahana Announces Additional $7.2 Million Funding Led by Liberty Global Ventures and Debuts Free Community Edition of Ahana Cloud for Presto for the Open Data Lakehouse

Jun 16, 20225 min read

Ahana Announces Additional $7.2 Million Funding Led by Liberty Global Ventures and Debuts Free Community Edition of Ahana Cloud for Presto for the Open Data Lakehouse 

Ahana Will Co-Lead Session At Open Source Summit About Presto SQL Query Engine

Jun 14, 20223 min read

Ahana, the only SaaS for Presto, today announced that Rohan Pednekar, Ahana’s senior product manager, will co-lead a session with Meta Developer Advocate Philip Bell at the Linux Foundation’s Open Source Summit about Presto, the Meta-born open source high performance, distributed SQL query engine.

Announcing the Cube integration with Ahana: Querying multiple data sources with managed Presto and Cube

Jun 8, 202233 min read

See how Ahana and Cube work together to help you set up a Presto cluster and build a single source of truth for metrics without spending days reading cryptic docs

AWS Athena Limitations

Jun 7, 20226 min read

AWS Athena query limits cause problems, and data engineering teams spend hours diagnosing them. Learn what the limitations are and how to fix.

What is Amazon Redshift Spectrum? | Ahana

Jun 7, 20225 min read

Launched in 2017, Redshift Spectrum is a feature within Redshift that enables you to query data stored in AWS S3 using SQL. Spectrum allows you to do federated queries from within the Redshift SQL query editor to data in S3, while also being able to combine it with data in Redshift.

Building a Data Lake Using Lake Formation on AWS

Jun 7, 20225 min read

AWS lake formation helps users to build, manage and secure their data lakes in a very short amount of time, meaning days instead of months as is common with a traditional data lake approach.

Building an Open Data Lakehouse with Presto, Hudi and AWS S3

Jun 6, 20228 min read

Learn how you can start building an Open Data Lakehouse analytics stack using Presto, Hudi and AWS S3 and solve the challenges of a data warehouse.

Understanding AWS Redshift Pricing | Ahana

Jun 6, 20225 min read

AWS Redshift is a completely managed cloud data warehouse service with the ability to scale on-demand and is compatible with multitudes of AWS tools and technologies. AWS Redshift is considered the preferred cloud data warehouse of choice for most customers but the pricing is not simple, since it tries to accommodate different use cases and customers. Let us try to understand the pricing details of Amazon Redshift.

From Lake to Shining Lakehouse, A New Era in Data

Jun 3, 20222 min read

During this episode of DM Radio you will learn from experts Raj K of General Dynamics Information technology and Wen Phan of Ahana.

Presto on AWS | Run Presto on AWS Athena & EMR

May 30, 20224 min read

What is Presto? Presto is an open-source distributed SQL query engine for running interactive analytic queries against all types of data sources. Learn more about what Presto is, how it’s used, and how to get started.

What is Presto? | Presto Caching, Data Sources & Usage Intro

May 30, 20225 min read

Learn more about what Presto is, how it was developed, and how to use it.

Presto vs Snowflake: Data Warehousing Comparisons

May 27, 20228 min read

Snowflake is a cloud data warehouse that offers a cloud-based data storage and analytics service. Snowflake runs completely on cloud infrastructure.

AWS Redshift Data Warehouse Architecture | Ahana

May 24, 20228 min read

Amazon Redshift is a cloud data warehouse offered as a managed service by AWS, and a popular choice for business intelligence and reporting use cases

Data warehouse or Data Lake, which one do I choose? | Ahana

May 20, 20222 min read

In this webinar, you’ll hear from Ali LeClerc who will discuss the data landscape and why many companies are moving to an open data lakehouse.

Ahana Announces New Presto Query Analyzer to Bring Instant Insights into Presto Clusters

May 18, 20223 min read

With the Presto Query Analyzer, data platform teams can get instant insights into their Presto clusters including query performance, bandwidth bottlenecks, and much more. The Presto Query Analyzer was built for the Presto community and is free to use.

What Is AWS Redshift Used For | Redshift Use Cases

May 13, 20226 min read

Amazon Redshift is one of the most widely-used services in the AWS ecosystem and is a familiar component in many cloud architectures.

ETL and ELT | What are the differences between ETL and ELT

May 12, 20224 min read

ETL and ELT in Data Warehousing What is ETL and ELT? ETL, or Extract Transform Load, is when an ETL tool or series of homegrown programs extracts data from a … Continue reading ETL and ELT | What are the differences between ETL and ELT

How to run SQL queries with Presto on Amazon Redshift

May 10, 20229 min read

In this tutorial, we use a step-by-step approach so you can learn how to query redshift using SQL with Presto (running with Kubernetes).

The Differences Between AWS RedShift Spectrum vs Athena

May 1, 20225 min read

AWS Redshift Spectrum vs Athena: Redshift is the storage and Redshift Spectrum is a SQL engine extension. Athena is a standalone SQL engine

How Much Does Amazon Athena Cost? | Ahana

Apr 25, 20224 min read

Understanding AWS Athena Costs with Examples What Is Amazon Athena?  Since you’re reading this to understand Athena costs, you likely already know, so we’ll just very briefly touch on what … Continue reading How Much Does Amazon Athena Cost? | Ahana

5 Components of Data Warehouse Architecture | Ahana

Apr 25, 20225 min read

In this article we’ll look at the contextual requirements of a data warehouse, which are the five components of a data warehouse.

What is a Data Lakehouse Architecture?

Apr 16, 20225 min read

Overview The term Data Lakehouse has become very popular over the last year or so, especially as more customers are migrating their workloads to the cloud. This article will help … Continue reading What is a Data Lakehouse Architecture?

An introduction to Ahana Cloud for Presto on AWS

Apr 14, 20222 min read

During this webinar we will share how to build an open data lakehouse with Presto and AWS S3 using Ahana Cloud.

Price-Performance Ratio of Athena vs Ahana

Apr 13, 20225 min read

Price-Performance Ratio of AWS Athena Presto vs Ahana Cloud for Presto Understand the price-performance ratio of Amazon Athena vs. Ahana. Both AWS Athena and Ahana Cloud are based on the … Continue reading Price-Performance Ratio of Athena vs Ahana

AWS Lake Formation for Enterprise Data Lakes | Ahana Cloud

Apr 6, 20223 min read

AWS Lake Formation is a service that makes it easy to set up a secure data lake very quickly (in a matter of days), providing a governance layer for data lakes on AWS S3. 

How to build an Open Data Lakehouse Analytics stack

Mar 31, 20222 min read

During this webinar we’ll show you how you can build an open data lakehouse stack. At the heart of this stack is Presto, the open source SQL query engine for the data lake, and the transaction manager / governance layer, which includes technologies like Apache Hudi, Delta Lake, and AWS Lake Formation.

How to Use AWS Athena to Query JSON Data | Ahana

Mar 30, 20224 min read

A popular use case is to use Athena to query Parquet, ORC, CSV and JSON files that are typically used for querying directly, or transformed and loaded into a data warehouse.

Tutorial: How to run SQL queries with Presto on BigQuery

Mar 20, 20227 min read

Presto has evolved into a unified SQL engine on top of cloud data lakes for both interactive queries as well as batch workloads with multiple data sources. This tutorial is … Continue reading Tutorial: How to run SQL queries with Presto on BigQuery

Unlocking the Business Value of the Data Lake

Mar 17, 20222 min read

During this webinar we’ll discuss how nearly three-fifths of organizations have gained competitive advantage from their data lake initiatives. That includes unleashing the intelligence-generating potential of a data lake that enables ad hoc data discovery and analytics in an open and flexible manner. We’ll cover:

What is an Open Data Lake in the Cloud?

Mar 16, 20223 min read

The Open Data Lake in the cloud is the solution to the massive data problem. Many companies are adopting that architecture because of better price-performance, scale, and non-proprietary architecture.

The Differences Between AWS Athena and AWS Glue | Ahana

Mar 16, 20224 min read

Here, we are going to talk about AWS Athena vs Glue, which is an interesting pairing as they are both complementary and competitive. So, what are they exactly?

Best Practices for Resource Management in PrestoDB

Mar 11, 20229 min read

Resource management in databases allows administrators to have control over resources and assign a priority to sessions, ensuring the most important transactions get the major share of system resources. Resource management in a distributed environment makes accessibility of data easier and manages resources over the network of autonomous computers (i.e. Distributed System). The basis of resource management in the distributed system is also resource sharing.

How to Query Parquet Files using Amazon Athena | Ahana

Mar 9, 20224 min read

Querying Parquet Files using AWS Amazon Athena Parquet is one of the latest file formats with many advantages over some of the more commonly used formats like CSV and JSON. … Continue reading How to Query Parquet Files using Amazon Athena | Ahana

Configuring RaptorX – a multi-level caching with Presto

Mar 8, 202211 min read

RaptorX Background and Context Meta introduced a multi-level cache at PrestoCon 2021. Code-named the “RaptorX Project,” it aims to make Presto 10x faster on Meta- scale petabyte workloads. Here at … Continue reading Configuring RaptorX – a multi-level caching with Presto

AWS Lake Formation Blueprints | Amazon Blueprint Types

Mar 7, 20222 min read

This article is focused on the first step and how AWS Lake Formation Blueprints can make that easy and automated. Before you can run analytics to get insights, you need your data continuously pooling into your lake!

Ahana Announces New Security Capabilities to Bring Next Level of Security to the Data Lake

Feb 23, 20226 min read

Ahana, the only SaaS for Presto, today announced significant new security features added to its Ahana Cloud for Presto managed service. They include multi-user support for Presto and Ahana, fine-grained access control for data lakes with deep Apache Ranger integration, and audit support for all access.

Difference Between AWS Lake Formation vs AWS Glue

Feb 22, 20223 min read

AWS Lake Formation vs AWS Glue – What are the differences? As you start building your analytics stack in AWS, there are several AWS technologies to understand as you begin. … Continue reading Difference Between AWS Lake Formation vs AWS Glue

Limitations of Amazon S3 Select | AWS Select Capabilities

Feb 2, 20226 min read

Amazon S3 Select Limitations What is Amazon S3 Select? Amazon S3 Select allows you to use simple structured query language (SQL) statements to filter the contents of an Amazon S3 … Continue reading Limitations of Amazon S3 Select | AWS Select Capabilities

How To Query Data in AWS S3 Using Athena | Ahana

Feb 1, 20224 min read

Learn how to use Athena to query Amazon S3 and start running queries on your S3 data lake. Query JSON, Apache Parquet, Apache ORC, CSV, and more.

What is AWS Lake Formation? | Amazon S3 Lake formation

Jan 31, 20222 min read

What is AWS Lake Formation? For AWS users who want to get governance on their data lake, AWS Lake Formation is a service that makes it easy to set up … Continue reading What is AWS Lake Formation? | Amazon S3 Lake formation

How does Presto work with LDAP | Presto LDAP Authentication

Jan 28, 20223 min read

LDAP is an industry standard application used for directory services authentication. Learn how does Presto work with LDAP