Running Presto Queries – Do I need to move data first?

Presto Queries

Do I need to move data when running Presto queries?

No, when you are running Presto queries your data stays in-place for the SQL query. Because of this, you will not need to move your data before executing Presto queries. If you’re using AWS S3 for your data lake, for example, you wouldn’t need to ingest it to query as you would if you were using a data warehouse like AWS Redshift. 

To bring Presto compute to your data, you can leverage Ahana Cloud.

With Ahana Cloud, it’s very easy to leverage the power of Presto to query AWS S3. You just connect your data source to Ahana and everything continues to run in your cloud account (called in-vpc). It’s just a click of a button to add data sources to Ahana Cloud for querying. Ahana Cloud would be a replacement for Amazon Athena, EMR Presto, or in the event that you’re running Presto on your own in AWS. It’s a managed service for Presto that takes care of all the configuration, tuning, deployment, managing, attaching/detaching data sources, etc.
You can learn more about Ahana Cloud, and you can also sign up to trial it too.

running Presto queries

Data Warehouse: A Comprehensive Guide

A data warehouse is a data repository, typically used for analytic systems and Business Intelligence tools. Take a look at this article to get a better understand of what it is and how it’s used.

Data Warehouse Concepts for Beginners

A relational database that is designed for query and analysis rather than for transaction processing. Learn more here.