As promised earlier this year, Pivotal released the code for Greenplum Database into Open Source.
Greenplum Database is based on PostgreSQL (was forked from PG 8.2), and features a massive parallel processing system (MPP) to run SQL queries on very large data sets. The code base is licensed under the Apache 2.0 license, and available on GitHub. You can fork the project from there, or submit patches and new features.
One of the main goals of the engineering team is to merge the existing code base with a recent PostgreSQL version. Although many features from newer PostgreSQL versions made it into Greenplum, there are many differences in terms of code. Also Greenplum offers unique features (new query optimizer, SQL support for partitioning, append-optimized tables, columnar storage, storage compression and many more), which over time will be ported to PostgreSQL and submitted for community review.
Most of the development will move into the public (except some internal customer related work), and will be managed using newly created mailinglists on the greenplum.org website.
Pivotal hosted the September PostgreSQL Conference in Beijing. I posted about this event in the Pivotal blog
What happens, if you ask the Chinese PostgreSQL community for a Meetup-like event, one or two speakers? You end up getting a full day conference ;-)
On September 12, Saturday, a full day PostgreSQL conference will take place at the Pivotal office in Beijing. If you want to attend, please sign up here.
The full address is: 17/F, South Block, Tower C, Raycom Info Tech Park, NO.2, Kexueyuan South Road, Beijing 100190, China.
Continue reading "PostgreSQL Day in Beijing"
As Josh Berkus blogged before, Pivotal is hosting the inaugural South Bay PostgreSQL Meetup in the Palo Alto office.
We were able to secure two very good speaker:
- Mason Sharp, from TransLattice: Scale-out PostgreSQL with PostgreSQL-XL
- Peter van Hardenberg, from Heroku: Postgres 9.4, new opportunities for users, admins and hackers
Please sign up at the Meetup website, if you plan to attend.
Date: Monday, February 16th, 2015
Location: 3495 Deer Creek Road Palo Alto, CA 94304 United States
Google Summer of Code 2014 is wrapped up: Maxence Ahlouche did an excellent job implementing one new algorithm for MADlib and refactored the code base for another one.
I posted a more detailled explanation in the Pivotal blog.
Blogged about how Pivotal Greenplum Database is using all available CPU resources when executing queries.
More in the Pivotal Blog: CPU Usage in Massively Distributed Analytic Data Warehouses
Together with Atri Sharma (former GSoC student) and Pivotal Engineer Hai Qian I'm mentoring Maxence Ahlouche in his Google Summer of Code MADlib project.
In the Pivotal Blog I've posted a more detailed explanation.