Thursday, October 16, 2008

A Petabyte sized database

Came across a very good article today about the Greenplum Database. Its an open source database which supports databases as big as a Petabyte or more (thats 1024 terabytes). All for free.

Who needs something like this? Its currently being geared towards data warehouses and Business Intelligence (BI) solutions. I might use it someday for the implementation of a BI solution as those database tend to get huge. For those unaware of what a BI solution does, suppose you have a company with multiple applications being used each with its own database, no integration. Now you want to make some analysis or generate some reports which require data to be read from these different databases, a BI solution can help you bring together all the data from each database, co-relate them, and then let you generate your report.

A BI solution actually ends up creating a database of its own which tends to get bigger as you try to integrate more and more databases together. Greenplum can help your BI solution save this newly acquired data.

A good open source BI solution is from Pentaho.

1 comment:

Unknown said...

If you want to share this, more on the petabyte database from Greenplum can be found here: http://www.greenplum.com/products/greenplum-database/