Presto Data Sources
Presto was designed and written from the ground up to efficiently query data against data sources of all sizes, ranging from gigabytes to petabytes. Presto connects to a wide variety of data sources, from HDFS to traditional relational databases, as well as NoSQL data sources such as Cassandra. Presto is particularly equipped to perform multiple concurrent interactive queries against a data source.
Presto is obviously a great fit for companies that have disparate data sources. For those organizations that can’t consolidate all of their data into one centralized store, Presto’s data federation capabilities can create a unified query layer that enables you to blend your data across different data sources together. With Presto, you can leverage many data sources at once, which means Presto can handle very large volumes of data.
Here’s a list of the most popular data sources that Presto connects to:
Accumulo
Alluxio
Amazon Redshift
Amazon S3
Cassandra
Druid
Elastic
HDFS
Kafka
Kudu
Microsoft SQL Server
MongoDB
Phoenix
Pinto
RDS PostgreSQL
RDS MySQL
Redis
Teradata