As promised earlier this year, Pivotal released the code for Greenplum Database into Open Source.
Greenplum Database is based on PostgreSQL (was forked from PG 8.2), and features a massive parallel processing system (MPP) to run SQL queries on very large data sets. The code base is licensed under the Apache 2.0 license, and available on GitHub. You can fork the project from there, or submit patches and new features.
One of the main goals of the engineering team is to merge the existing code base with a recent PostgreSQL version. Although many features from newer PostgreSQL versions made it into Greenplum, there are many differences in terms of code. Also Greenplum offers unique features (new query optimizer, SQL support for partitioning, append-optimized tables, columnar storage, storage compression and many more), which over time will be ported to PostgreSQL and submitted for community review.
Most of the development will move into the public (except some internal customer related work), and will be managed using newly created mailinglists on the greenplum.org website.
As Josh Berkus blogged before, Pivotal is hosting the inaugural South Bay PostgreSQL Meetup in the Palo Alto office.
We were able to secure two very good speaker:
- Mason Sharp, from TransLattice: Scale-out PostgreSQL with PostgreSQL-XL
- Peter van Hardenberg, from Heroku: Postgres 9.4, new opportunities for users, admins and hackers
Please sign up at the Meetup website, if you plan to attend.
Date: Monday, February 16th, 2015
Location: 3495 Deer Creek Road Palo Alto, CA 94304 United States
Happy to announce that Maxence Ahlouche has been accepted by both the PostgreSQL Project and by Google to implement clustering algorithms in MADlib during the Google Summer of Code 2014.
This project is mentored by Atri Sharma (former GSoC student) and me, with technical help from Pivotal (product owner of MADlib).
Looking forward to another successful GSoC year!