Comments on: VectorWise, Ingres, and MonetDB

By: Snowflake Computing | DBMS 2 : DataBase Management System Services

Snowflake Computing | DBMS 2 : DataBase Management System Services — Wed, 22 Oct 2014 08:45:54 +0000

[…] 2 techie founders out of Oracle, plus Marcin Zukowski. […]

By: Actian Vector Hadoop Edition | DBMS 2 : DataBase Management System Services

Actian Vector Hadoop Edition | DBMS 2 : DataBase Management System Services — Tue, 30 Sep 2014 05:50:34 +0000

[…] Peter Boncz isn’t exactly an Actian employee. Rather, he’s the professor who supervised Marcin Zukowski’s PhD thesis that became Vectorwise, and I chatted with Peter by Skype while he was at home in Amsterdam. I believe his assurances that […]

By: Ingres VectorWise technical highlights | DBMS2 -- DataBase Management System Services

Fri, 11 Jun 2010 11:28:25 +0000

[…] caught up with me for a regrettably brief call. Peter gave me the strong impression that what I’d written in the past about VectorWise had been and remained accurate, so I focused on filling in the gaps. Highlights […]

By: Martin Kersten on issues in scientific data management | DBMS2 -- DataBase Management System Services

Sat, 03 Oct 2009 10:33:55 +0000

[…] Martin Kersten emailed a response to my post on issues in scientific data management. With his permission, I’ve lightly edited it, and am posting it below. […]

By: HadoopDB | DBMS2 -- DataBase Management System Services

HadoopDB | DBMS2 -- DataBase Management System Services — Sun, 20 Sep 2009 00:05:20 +0000

[…] where X=2. Column-store guru Abadi has repeatedly signaled his intention to try out HadoopDB with VectorWise at the nodes instead. (Recall that VectorWise is shared-everything.) It will be interesting to see […]

By: Do hash tables work in constant time?

Do hash tables work in constant time? — Tue, 18 Aug 2009 14:15:21 +0000

[…] Am I being pedantic? Does the time required to multiply integers on modern machine depend on the size of the integers? It certainly does if you are using vectorization. And vectorization is used in commercial databases! […]

By: Edward

Edward — Wed, 05 Aug 2009 02:03:04 +0000

There’s a 2008 talk by Peter Boncz about MonetDB/X100 project that illustrates principles that seem to be used by VectorWise’s DBMS:

http://www.youtube.com/watch?v=yrLd-3lnZ58

Cool stuff,
E.

By: Marcin Zukowski

Marcin Zukowski — Tue, 04 Aug 2009 19:36:02 +0000

@Daniel

One thing to note is that the opinion of working on compressed data sets is mostly useful for the major ordering columns only refers to the RLE compression. Like you write, in cases with large domain cardinality RLE won’t do much for non-sorted data.

Still, other forms of compression can be used and data compressed with those can be analyzed without decompressing, see e.g. http://scholar.google.com/scholar?q=%22The+Implementation+and+Performance+of+Compressed+Databases.%22

By: Curt Monash

Curt Monash — Tue, 04 Aug 2009 15:58:26 +0000

Thanks, Marcin!

I edited in two corrections (Ph.D, CPU cycles).

Best,

CAM

By: Marcin Zukowski

Marcin Zukowski — Tue, 04 Aug 2009 15:31:40 +0000

Hi Curt,

Thank you for a nice writeup on VectorWise. While generally correct, here are some clarifications:

– the VectorWise technology belongs fully to our company (no academic institution, including CWI, can control it)

– the MonetDB open-source system originated from the PhD research of Peter Boncz under supervision of Martin Kersten, while the VectorWise database engine is a technology generation later and came out of my own PhD (not MSc) research, supervised in turn by Peter Boncz. Other CWI group members also have significant contributions to both projects.

– we do hope to make VectorWise technology available as early as possible, and 2010 is very possible, but please do not treat it as an official plan

– as for the string compression, we use something called PDICT, which is a new – outlier resistant – form of dictionary encoding.

– like you wrote, the main thing about the compression methods in VectorWise is that they are much faster than existing methods. As for the performance, we take a few “CPU cycles” (not “steps”) for one element. Links to publications with more technical info can be found on: http://www.vectorwise.com/index_js.php?page=company_origins

– the place to visit for more info on the Ingres VectorWise project is http://www.ingres.com/vectorwise

Best regards,
Marcin Zukowski