Demystifying Presto: PrestoDB and PrestoSQL

Despite similar names, PrestoDB and PrestoSQL are two different github repos. We hope this page highlights the principles that make open source communities like Presto thrive and explains the history of the two projects. Ahana is a premier member of the Presto Foundation, which oversees PrestoDB.


PrestoDB runs at Facebook

“At Facebook alone, over a thousand employees use Presto, running several million queries and processing petabytes of data per day. After creating Presto we open sourced it to see if other companies were having the same issues and wanted to collaborate. It turns out many other companies were interested and so under The Linux Foundation, we believe the project can engage others and grow the community for the benefit of all.”

Kathy Kam, Head of Open Source at Facebook.

PrestoDB Community-driven Innovations

To hear about the PrestoDB roadmap, watch this video from the head of the Technical Steering Committee and this one from Biswapesh Chattopadhyay of Facebook. Come join us in developing Presto, we welcome community participation. Below are some of the recent and current innovations.

Project Aria – PrestoDB can now push down entire expressions to the data source for some file formats like ORC. Blog Design

Project Presto Unlimited – Introduced exchange materialization to create temporary in-memory bucketed tables to use significantly less memory. PR Blog

User Defined Functions – Support for dynamic SQL functions is now available in experimental mode. Docs

Apache Pinot and Druid Connectors Docs

Presto-on-Spark Runs Presto code as a library within Spark executor. Design Docs

Disaggregated Coordinator (a.k.a. Fireball) – Scale out the coordinator horizontally and revamp the RPC stack. Beta in Q4 2020. Issues

Ahana is an ongoing contributor to PrestoDB.

Here are some of the open source features we are working on now:

  • CSV support for Hive Connector
  • Upgrade Hive support to 3.x
  • Apache Iceberg connector 
  • Apache Ranger plugin for Presto
  • Explore efficacy of caching strategies at different levels of abstraction
  • Better AWS IAM support

Linux Foundation and Open Source

The Linux Foundation is dedicated to building sustainable ecosystems around open source projects to accelerate technology development and industry adoption. Founded in 2000, the Linux Foundation provides unparalleled support for open source communities through financial and intellectual resources, infrastructure, services, events, and training. Working together, the Linux Foundation and its projects form the most ambitious and successful investment in the creation of shared technology.

Open Source creates a lot of goodness for developers, technologists and organizations. But to foster collaboration, developers prefer projects that have governance & transparency. The Presto Foundation under the Linux Foundation brings this to PrestoDB.


Presto Foundation: Community-driven Presto

The Presto Foundation is guided by three main principles defined here

1

One open, neutral and united Presto community

2

Open, transparent technical leadership and direction

3

No one individual or company is greater than the project and its community

Every member in the Presto Foundation works together towards these principles.

The Value of the Presto Foundation

A Brief History on Presto

2012: Facebook develops Presto

2013: Facebook open sources PrestoDB under the Apache License

Jan 2019: A Presto Software Foundation around the PrestoSQL fork is announced, led by the original creators of Presto who left Facebook

Sept 2019: Facebook donates the original project to the Linux Foundation and establishes the Presto Foundation which includes founding members Facebook, Uber, Twitter, and Alibaba

Sept 2020: Coinciding with the 1 year anniversary of the Presto Foundation, 650+ attendees at first PrestoCon, with all sessions available on the Presto Foundation’s YouTube channel. Hear about all the innovations for PrestoDB in the past year in the keynote, what Facebook has planned, and what is the future vision as discussed by a panel of experts.

“The Linux Foundation is dedicated to building sustainable, governed, open source ecosystems to greatly benefit users globally, like what we have achieved with CNCF and the Kubernetes project. We are excited to welcome Ahana to the Presto Foundation as the first and only company focused on supporting Presto of the Presto Foundation.”

Chris Aniszczyk, Vice President of Developer Relations, The Linux Foundation


Presto Rising

Presto is one of the fastest growing open source projects in the data systems space.

The Star history tool is fun. You can look back in time to see the star history of various open source projects. Try to chart out your favorite Github repos. Link