Presto Data Sources
Presto was designed and written from the ground up to efficiently query data against data sources of all sizes, ranging from gigabytes to petabytes. Presto connects to a wide variety of data sources, from HDFS to traditional relational databases, as well as NoSQL data sources such as Cassandra. Presto is particularly equipped to perform multiple concurrent interactive queries against a data source.
Presto is obviously a great fit for companies that have disparate data sources. For those organizations that can’t consolidate all of their data into one centralized store, Presto’s data federation capabilities can create a unified query layer that enables you to blend your data across different data sources together. With Presto, you can leverage many data sources at once, which means Presto can handle very large volumes of data.
Some of popular combinations include Presto AWS S3, Presto Cassandra/cassandra presto, presto accumulo, and more. Here are more data sources Presto connects to:
Accumulo
Alluxio
Amazon Redshift
Amazon S3
Cassandra
Druid
Elastic
HDFS
Kafka
Kudu
Microsoft SQL Server
MongoDB
Phoenix
Pinto
RDS PostgreSQL
RDS MySQL
Redis
Teradata
If you want to get up and running with Presto quickly, check out Ahana Cloud which is SaaS for Presto.