
Ahana blog for Presto
All the latest from Ahana and the Presto community
Joining forces to further Presto together: Ahana acquired by IBM
Today I am thrilled to finally share more about Ahana joining forces with IBM. A few weeks ago, we shared with the Linux Foundation Presto open source community that IBM … Continue reading Joining forces to further Presto together: Ahana acquired by IBM
The top 5 reasons a data engineer should use Presto
Open source software has become very widely used over the last decade or so. It’s not just developers and engineers adopting open source – companies ranging from the Fortune 500 … Continue reading The top 5 reasons a data engineer should use Presto
How to Cut the Cost of Your Lakehouse Solution
Lakehouse solutions are becoming more and more popular as an augmentation or replacement for expensive locked-in data warehouses. However, many organizations still struggle with the cost of these implementations. Let’s … Continue reading How to Cut the Cost of Your Lakehouse Solution
The Case for Unbundling Your Lakehouse
When you are looking for a lakehouse, do you think about getting an all in one solution from one vendor? If so, you may be missing out on a great … Continue reading The Case for Unbundling Your Lakehouse
AWS Athena Alternatives: Best Amazon Athena Alternatives
Looking for Athena alternatives? Ahana gives you the scale and performance of PrestoDB – the same underlying technology that powers Athena, and which is used for petabyte-scale analytics at Meta … Continue reading AWS Athena Alternatives: Best Amazon Athena Alternatives
Using JMeter with Presto
Apache JMeter is an open source application written in Java that is designed for load testing. This article presents how to install JMeter, and create and run a test plan for testing SQL workloads on Presto clusters.
Hands-on Presto Tutorial: How to run Presto on Kubernetes
In this tutorial we’ll explain how to run Presto on Kubernetes. We’ll cover how to get started, what you need, and what to expect.
Announcing the Cube integration with Ahana: Querying multiple data sources with managed Presto and Cube
See how Ahana and Cube work together to help you set up a Presto cluster and build a single source of truth for metrics without spending days reading cryptic docs
AWS Athena Limitations
AWS Athena query limits cause problems, and data engineering teams spend hours diagnosing them. Learn what the limitations are and how to fix.
Building an Open Data Lakehouse with Presto, Hudi and AWS S3
Learn how to build an Open Data Lakehouse stack using Presto, Hudi and AWS S3.
PrestoDB on AWS
What is Presto? Presto is an open-source distributed SQL query engine for running interactive analytic queries against all types of data sources. Learn more about PrestoDB on AWS.
How to run SQL queries with Presto on Amazon Redshift
Just learning or looking for a refresher about running SQL queries, then this guide is for you. We’ll focus on SQL with Presto and Redshift.
Tutorial: How to run SQL queries with Presto on BigQuery
Presto has evolved into a unified SQL engine on top of cloud data lakes for both interactive queries as well as batch workloads with multiple data sources. This tutorial is … Continue reading Tutorial: How to run SQL queries with Presto on BigQuery
What is an Open Data Lake in the Cloud?
The Open Data Lake in the cloud is the solution to the massive data problem. Many companies are adopting that architecture because of better price-performance, scale, and non-proprietary architecture.
Best Practices for Resource Management in PrestoDB
Resource management in databases allows administrators to have control over resources and assign a priority to sessions, ensuring the most important transactions get the major share of system resources. Resource management in a distributed environment makes accessibility of data easier and manages resources over the network of autonomous computers (i.e. Distributed System). The basis of resource management in the distributed system is also resource sharing.
Configuring RaptorX – a multi-level caching with Presto
RaptorX Background and Context Meta introduced a multi-level cache at PrestoCon 2021. Code-named the “RaptorX Project,” it aims to make Presto 10x faster on Meta- scale petabyte workloads. Here at … Continue reading Configuring RaptorX – a multi-level caching with Presto
Benchmark Presto | Benchmarking Warehouse Workloads
For businesses looking to understand how to evaluate Presto performance, this document will help to learn how to benchmark Presto.
Announcing the Ahana Cloud for Presto integration with AWS Lake Formation
We’re excited to announce that Ahana Cloud for Presto now integrates with AWS Lake Formation, including support for the recent general availability of row-level security. AWS Lake Formation is a … Continue reading Announcing the Ahana Cloud for Presto integration with AWS Lake Formation
What is Presto on Spark?
Overview Presto was originally designed to run interactive queries against data warehouses. However, now it has evolved into a unified SQL engine on top of open data lake analytics for … Continue reading What is Presto on Spark?
Tutorial: How to define SQL functions with Presto across all connectors
Presto supports many native functions, which are usually sufficient for most use cases. To simplify this, Presto allows users to define expressions as SQL functions.
Presto 105: Running Presto with AWS Glue as catalog on your Laptop
Introduction This is the 5th tutorial in our Getting Started with Presto series. To recap, here are the first 4 tutorials: Presto 101: Installing & Configuring Presto locally Presto 102: … Continue reading Presto 105: Running Presto with AWS Glue as catalog on your Laptop
Presto 104: Running Presto with Hive Metastore on your Laptop
Introduction This is the 4th tutorial in our Getting Started with Presto series. To recap, here are the first 3 tutorials: Presto 101: Installing & Configuring Presto locally Presto 102: … Continue reading Presto 104: Running Presto with Hive Metastore on your Laptop
Connect Superset to Presto
Curious how to connect Superset to Presto? This article will break down the steps needed to use Presto with Superset.
Presto Tutorial 103: PrestoDB cluster on GCP
Introduction This tutorial is Part III of our Getting started with PrestoDB series. As a reminder, Prestodb is an open source distributed SQL query engine. In tutorial 102 we covered … Continue reading Presto Tutorial 103: PrestoDB cluster on GCP
Announcing the workload profile feature in Ahana Cloud
Ahana Cloud for Presto is the first fully integrated, cloud native managed service that simplifies the ability of cloud and data platform teams. With the managed Presto service, we provide … Continue reading Announcing the workload profile feature in Ahana Cloud
Announcing the Ahana $20M Series A – Furthering our Vision of Open Data Lake Analytics with Presto
I’m very excited to announce that Ahana, the SaaS for Presto company, has raised a jumbo $20M Series A round from lead investor Third Point Ventures. Our SaaS managed service … Continue reading Announcing the Ahana $20M Series A – Furthering our Vision of Open Data Lake Analytics with Presto
Autoscale your Presto cluster in Ahana Cloud
One of the challenges of running a Presto cluster is to make the right decision in terms of the number of worker nodes. Learn here how to autoscale
Snowflake may not be the silver bullet you wanted for your long term data strategy… here’s why
Since COVID, every business has pivoted and moved everything online, accelerating digital transformation with data and AI. Self-service, accelerated analytics has become more and more critical for businesses and Snowflake … Continue reading Snowflake may not be the silver bullet you wanted for your long term data strategy… here’s why
Presto 102 Tutorial: Install PrestoDB on a Laptop or PC
Summary Prestodb is an open source distributed parallel query SQL engine. In tutorial 101 we walk through manual installation and configuration on a bare metal server or on a VM. It … Continue reading Presto 102 Tutorial: Install PrestoDB on a Laptop or PC
Enabling spill to disk for optimal price per performance
Presto was born out of the need for low-latency interactive queries on large scale data, and hence, continually optimized for that use case. In such scenarios, the best practice is … Continue reading Enabling spill to disk for optimal price per performance
Presto 101 Tutorial: Installing & Configuring Presto
Installing & Configuring Presto locally Presto Installation Presto can be installed manually or using docker images on: Single Node: Both co-ordinator and workers run on the same machine. or even … Continue reading Presto 101 Tutorial: Installing & Configuring Presto
Why I’m betting on PrestoDB, and why you should too!
By Dipti Borkar, Ahana Cofounder, Chief Product Officer & Chief Evangelist I’ve been in open source software companies and communities for over 10 years now, and in the database industry … Continue reading Why I’m betting on PrestoDB, and why you should too!
5 main reasons Data Engineers move from AWS Athena to Ahana Cloud
In this brief post, we’ll discuss the 5 main reasons why data platform engineers decide to move their data analytics workloads from Amazon Athena to Ahana Cloud for Presto. While … Continue reading 5 main reasons Data Engineers move from AWS Athena to Ahana Cloud
Presto Foundation compared to Single Vendor-Driven Open Source Projects
Why Project Neutrality Matters Steven Mih, Cofounder and CEO, Ahana PrestoCon Day was last week on March 24, and it was an incredible event with lots of great speakers, great … Continue reading Presto Foundation compared to Single Vendor-Driven Open Source Projects
Upsolver and Ahana Announce Technology Partnership to Accelerate Adoption of Open Data Lake Analytics in the Cloud with Presto
Todd Odess, Head of Partnerships at Upsolver | Ali LeClerc, Head of Marketing at Ahana Today we’re thrilled to announce our joint technology partnership. Upsolver, the first no-code, high-speed cloud … Continue reading Upsolver and Ahana Announce Technology Partnership to Accelerate Adoption of Open Data Lake Analytics in the Cloud with Presto
Athena Partition Limits | Comparing AWS Athena & PrestoDB
This is our 3rd blog in the comparing AWS Athena to PrestoDB series. If you missed the first two, you can find them here: Part 1: AWS Athena vs. PrestoDB … Continue reading Athena Partition Limits | Comparing AWS Athena & PrestoDB
Athena Query Limits | Comparing AWS Athena & PrestoDB
Welcome to the 2nd blog in our blog series on comparing AWS Athena, a serverless Presto service, to open source PrestoDB. In this series we’ll discuss Amazon’s Athena service versus … Continue reading Athena Query Limits | Comparing AWS Athena & PrestoDB
Ahana Cloud for Presto 🌤 – Now “Open” for “Open Analytics”
I’m thrilled to share that today Ahana Cloud for Presto on AWS is generally available and open for Open Analytics. Ahana Cloud 🌤 is the first fully integrated, cloud-native managed … Continue reading Ahana Cloud for Presto 🌤 – Now “Open” for “Open Analytics”
5 reasons why you should use PrestoDB
Data platform teams are increasingly using the federated SQL query engine PrestoDB to run such analytics for a variety of use cases across a wide range of data lakes and databases in-place, without the need to move data. Here’s a look at some important characteristics of Presto that account for its growing adoption.
Building Managed Services: Architecting Ahana Cloud for Presto with the In-VPC Deployment Model
This blog post discusses how AWS users have evolved their big data requirements and how the Ahana team architected its managed service offering, highlighting the best practice of providing an “In-VPC” deployment.
Ahana Cloud for Presto: Analytics for Disaggregated Stack
Today I am incredibly excited to announce Ahana Cloud for Presto, the first fully integrated, cloud-native managed service for Presto – that simplifies the ability of cloud and data platform teams of all sizes to provide self-service, SQL analytics for their data analysts and scientists.
Bridging the gap to bring two Presto communities together: Welcoming Starburst Data to the Presto Foundation
Why are there two Presto projects and how many do we need? For the benefit of the community, I hope we can see the confusion end with: 1 foundation, 1 primary code base for new development, and 1 community.
Getting Perspective on Open Source Projects and Forks with Ted Dunning, Apache Software Expert
We spent some time with Apache Software Foundation board member, PMC member, and open source expert Ted Dunning to get his thoughts on the state of open source software.
Introducing Ahana
Curious about Ahana? In this article you’ll earn what Ahana is, what it’s used for, and how Ahana got started.