February 16, 2008

Mike Stonebraker’s DBMS taxonomy

In a response to my recent five-part series on DBMS diversity, Mike Stonebraker has proposed his own taxonomy of data management technologies over on Vertica’s Database Column blog. (Edit: Some good stuff disappeared when Vertica nuked that blog.)

OLTP DBMSs focused on fast, reliable transaction processing

Analytic/Data Warehouse DBMSs focused on efficient load and ad-hoc query performance

Science DBMSs — after all MatLab does not scale to disk-sized arrays

RDF stores focused on efficiently storing semi-structured data in this format

XML stores focused on semi-structured data in this format

Search engines — the big players all use proprietary engines in this area

Stream Processing Engines focused on real-time StreamSQL

“Lean and Mean,” less-than-a-database engines focused on doing a small number of things very well (embedded databases are probably in this category)

MapReduce and Hadoop — after all Google has enough “throw weight” to define a category

He goes on to say that each will be architected differently, except that — as he already convinced me back in July — RDF will be well-managed by specialty data warehouse DBMS.

I must confess that I didn’t explicitly mention array-based data stores, whether scientific ones, nor the remaining native MOLAP (Multi-Dimensional OnLine Analytic Processing) engines, nor the sui generis SAS Intelligence Storage relational data warehouse product. So great catch there. On the not-so-great side, I think Mike’s definitions of categories #8 and #9 are a bit fuzzy (embedded DBMS tend to be full DBMS, but MapReduce is less than a DBMS). And of course any finite list like his will make over-general assumptions (e.g., it’s not obvious the StreamSQL-based CEP vendors will blow away rule-oriented Apama) and omit edge cases.

But there’s really only one point on which we have meaningful disagreement — Mike dumps all OLTP and general-purpose relational DBMS into a single bucket. Considering that such products currently represent a large majority of the multi-billion dollar DBMS market, I think some finer distinctions are in order. At a minimum, let’s break them into two categories — high-end vs. mid-range. High-end systems have maximum robustness, whether because there’s a real application need or because it just makes their owners feel good. Mid-range systems do everything high-end systems did in the 1990s, and are a cheaper/better alternative for ever more database management tasks.

The series on database diversity (more links at the bottom of Part 1):

Categories: Data types, Database diversity, Michael Stonebraker, Mid-range, OLTP, RDF and graphs, Theory and architecture

Subscribe to our complete feed!

Comments

6 Responses to “Mike Stonebraker’s DBMS taxonomy”

My own data management software taxonomy | DBMS2 -- DataBase Management System Services on June 26th, 2008 3:36 am

[…] a recent webcast, I presented an 11-node data management software taxonomy, updating a post commenting on Mike Stonebraker’s. It […]
Mike Stonebraker may be oversimplifying data warehousing just a tad | DBMS2 -- DataBase Management System Services on June 26th, 2008 3:39 am

[…] Earlier I thought Mike was forgetting about the distinction between high-end and mid-range RDBMS. Naturally, that didn’t last long. He’s actually calling the mid-range systems “open source”, but that’s a decent first approximation to a hard-to-define category. […]
Database diversity revisited | DBMS 2 : DataBase Management System Services on July 8th, 2012 8:55 pm

[…] build a little taxonomy for the variety in database technology. One effort was 4 1/2 years ago, in a pre-planned exchange with Mike Stonebraker (his side, alas, has since been taken down). A year ago I spelled out eight kinds of analytic […]
One database to rule them all? | DBMS 2 : DataBase Management System Services on February 28th, 2013 10:05 pm

[…] DBMS attempts with Postgres and Illustra/Informix, then more recently suggesting the world needs 9 or so kinds of database technology. As for me — well, I agreed with Mike both […]
YouTube on May 9th, 2014 3:58 pm

That is really interesting, You are a very professional blogger.
I’ve joined your rss feed and stay up for looking for more of your magnificent
post. Also, I have shared your website in my social networks
law on May 16th, 2014 8:42 pm

Hey I know this is off topic but I was wondering if you knew of any widgets I could add to my blog that automatically tweet my newest twitter updates.
I’ve been looking for a plug-in like this for quite some time and
was hoping maybe you would have some experience with something like this.
Please let me know if you run into anything. I truly enjoy reading
your blog and I look forward to your new updates.

Search our blogs and white papers

Monash Research blogs

DBMS 2 covers database management, analytics, and related technologies.
Text Technologies covers text mining, search, and social software.
Strategic Messaging analyzes marketing and messaging strategy.
The Monash Report examines technology and public policy issues.
Software Memories recounts the history of the software industry.

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.

Links
- Monash Research
- White Papers
Admin
- Log in

Mike Stonebraker’s DBMS taxonomy

Comments

Search our blogs and white papers

Monash Research blogs

User consulting

Vendor advisory

Monash Research highlights

Recent posts

Categories

Date archives

Admin