Skip to content

Greenplum Database is Open Source

As promised earlier this year, Pivotal released the code for Greenplum Database into Open Source.

Greenplum Database is based on PostgreSQL (was forked from PG 8.2), and features a massive parallel processing system (MPP) to run SQL queries on very large data sets. The code base is licensed under the Apache 2.0 license, and available on GitHub. You can fork the project from there, or submit patches and new features.

One of the main goals of the engineering team is to merge the existing code base with a recent PostgreSQL version. Although many features from newer PostgreSQL versions made it into Greenplum, there are many differences in terms of code. Also Greenplum offers unique features (new query optimizer, SQL support for partitioning, append-optimized tables, columnar storage, storage compression and many more), which over time will be ported to PostgreSQL and submitted for community review.

Most of the development will move into the public (except some internal customer related work), and will be managed using newly created mailinglists on the greenplum.org website.

  • Twitter
  • Bookmark Greenplum Database is Open Source at del.icio.us
  • Facebook
  • Google Bookmarks
  • FriendFeed
  • Digg Greenplum Database is Open Source
  • Bloglines Greenplum Database is Open Source
  • Technorati Greenplum Database is Open Source
  • Fark this: Greenplum Database is Open Source
  • Bookmark Greenplum Database is Open Source at YahooMyWeb
  • Bookmark Greenplum Database is Open Source at Furl.net
  • Bookmark Greenplum Database is Open Source at reddit.com
  • Bookmark Greenplum Database is Open Source at blinklist.com
  • Bookmark Greenplum Database is Open Source at Spurl.net
  • Bookmark Greenplum Database is Open Source at Simpy.com
  • Bookmark Greenplum Database is Open Source at blogmarks
  • Bookmark Greenplum Database is Open Source with wists
  • wong it!
  • Bookmark using any bookmark manager!
  • Stumble It!
  • Identi.ca

Wrap-up: MADlib Google Summer of Code

Google Summer of Code 2014 is wrapped up: Maxence Ahlouche did an excellent job implementing one new algorithm for MADlib and refactored the code base for another one.

I posted a more detailled explanation in the Pivotal blog.

  • Twitter
  • Bookmark Wrap-up: MADlib Google Summer of Code at del.icio.us
  • Facebook
  • Google Bookmarks
  • FriendFeed
  • Digg Wrap-up: MADlib Google Summer of Code
  • Bloglines Wrap-up: MADlib Google Summer of Code
  • Technorati Wrap-up: MADlib Google Summer of Code
  • Fark this: Wrap-up: MADlib Google Summer of Code
  • Bookmark Wrap-up: MADlib Google Summer of Code at YahooMyWeb
  • Bookmark Wrap-up: MADlib Google Summer of Code at Furl.net
  • Bookmark Wrap-up: MADlib Google Summer of Code at reddit.com
  • Bookmark Wrap-up: MADlib Google Summer of Code at blinklist.com
  • Bookmark Wrap-up: MADlib Google Summer of Code at Spurl.net
  • Bookmark Wrap-up: MADlib Google Summer of Code at Simpy.com
  • Bookmark Wrap-up: MADlib Google Summer of Code at blogmarks
  • Bookmark Wrap-up: MADlib Google Summer of Code with wists
  • wong it!
  • Bookmark using any bookmark manager!
  • Stumble It!
  • Identi.ca

CPU Usage in Pivotal Greenplum Database

Blogged about how Pivotal Greenplum Database is using all available CPU resources when executing queries.

 

More in the Pivotal Blog: CPU Usage in Massively Distributed Analytic Data Warehouses

  • Twitter
  • Bookmark CPU Usage in Pivotal Greenplum Database at del.icio.us
  • Facebook
  • Google Bookmarks
  • FriendFeed
  • Digg CPU Usage in Pivotal Greenplum Database
  • Bloglines CPU Usage in Pivotal Greenplum Database
  • Technorati CPU Usage in Pivotal Greenplum Database
  • Fark this: CPU Usage in Pivotal Greenplum Database
  • Bookmark CPU Usage in Pivotal Greenplum Database at YahooMyWeb
  • Bookmark CPU Usage in Pivotal Greenplum Database at Furl.net
  • Bookmark CPU Usage in Pivotal Greenplum Database at reddit.com
  • Bookmark CPU Usage in Pivotal Greenplum Database at blinklist.com
  • Bookmark CPU Usage in Pivotal Greenplum Database at Spurl.net
  • Bookmark CPU Usage in Pivotal Greenplum Database at Simpy.com
  • Bookmark CPU Usage in Pivotal Greenplum Database at blogmarks
  • Bookmark CPU Usage in Pivotal Greenplum Database with wists
  • wong it!
  • Bookmark using any bookmark manager!
  • Stumble It!
  • Identi.ca

Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib

Together with Atri Sharma (former GSoC student) and Pivotal Engineer Hai Qian I'm mentoring Maxence Ahlouche in his Google Summer of Code MADlib project.

 

In the Pivotal Blog I've posted a more detailed explanation.

  • Twitter
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at del.icio.us
  • Facebook
  • Google Bookmarks
  • FriendFeed
  • Digg Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib
  • Bloglines Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib
  • Technorati Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib
  • Fark this: Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at YahooMyWeb
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at Furl.net
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at reddit.com
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at blinklist.com
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at Spurl.net
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at Simpy.com
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib at blogmarks
  • Bookmark Pivotal's Google Summer of Code 2014: Implementing Clustering Algorithms in MADlib with wists
  • wong it!
  • Bookmark using any bookmark manager!
  • Stumble It!
  • Identi.ca

How to not export data in Ispirer SQLWays

Ispirer SQLWays is a nice (although commercial) tool to convert DDL and data from one supported database to another. We are using it regularly, the list of supported databases is impressive: PostgreSQL, Greenplum, Oracle, SQL Server, IBM DB2, MySQL, Sybase, Informix, Teradata, Netezza and some more.

There's just one thing which I always forget: SQLways exports all data by default, which makes the export unnecessary big and slow.


Continue reading "How to not export data in Ispirer SQLWays"
  • Twitter
  • Bookmark How to not export data in Ispirer SQLWays at del.icio.us
  • Facebook
  • Google Bookmarks
  • FriendFeed
  • Digg How to not export data in Ispirer SQLWays
  • Bloglines How to not export data in Ispirer SQLWays
  • Technorati How to not export data in Ispirer SQLWays
  • Fark this: How to not export data in Ispirer SQLWays
  • Bookmark How to not export data in Ispirer SQLWays at YahooMyWeb
  • Bookmark How to not export data in Ispirer SQLWays at Furl.net
  • Bookmark How to not export data in Ispirer SQLWays at reddit.com
  • Bookmark How to not export data in Ispirer SQLWays at blinklist.com
  • Bookmark How to not export data in Ispirer SQLWays at Spurl.net
  • Bookmark How to not export data in Ispirer SQLWays at Simpy.com
  • Bookmark How to not export data in Ispirer SQLWays at blogmarks
  • Bookmark How to not export data in Ispirer SQLWays with wists
  • wong it!
  • Bookmark using any bookmark manager!
  • Stumble It!
  • Identi.ca

Greenplum Community Edition - Connections from the host system into the virtual machine

If you download the Greenplum Community Edition (GP CE) from the Greenplum website and if you want to connect tools like pgAdminIII from your host machine to the database inside the virtual machine, you have to disable the firewall. The CE is installed on CentOS and by default the firewall is enabled and blocks connections from outside to most ports. Continue reading "Greenplum Community Edition - Connections from the host system into the virtual machine"
  • Twitter
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at del.icio.us
  • Facebook
  • Google Bookmarks
  • FriendFeed
  • Digg Greenplum Community Edition - Connections from the host system into the virtual machine
  • Bloglines Greenplum Community Edition - Connections from the host system into the virtual machine
  • Technorati Greenplum Community Edition - Connections from the host system into the virtual machine
  • Fark this: Greenplum Community Edition - Connections from the host system into the virtual machine
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at YahooMyWeb
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at Furl.net
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at reddit.com
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at blinklist.com
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at Spurl.net
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at Simpy.com
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine at blogmarks
  • Bookmark Greenplum Community Edition - Connections from the host system into the virtual machine with wists
  • wong it!
  • Bookmark using any bookmark manager!
  • Stumble It!
  • Identi.ca