August 7, 2012

Notes on some basic database terminology

In a call Monday with a prominent company, I was told:

That, to put it mildly, is not accurate. So I shall try, yet again, to set the record straight.

In an industry where people often call a DBMS just a “database” — so that a database is something that manages a database! — one may wonder why I bother. Anyhow …

1. The products commonly known as Oracle, Exadata, DB2, Sybase, SQL Server, Teradata, Sybase IQ, Netezza, Vertica, Greenplum, Aster, Infobright, SAND, ParAccel, Exasol, Kognitio et al. all either are or incorporate relational database management systems, aka RDBMS or relational DBMS.

2. In principle, there can be difficulties in judging whether or not a DBMS is “relational”. In practice, those difficulties don’t arise — yet. Every significant DBMS still falls into one of two categories:

*I expect the distinction to get more confusing soon, at which point I’ll adopt terms more precise than “relational things” and “relational stuff”.

3. There are two chief kinds of relational DBMS:

* “General-purpose” is usually a better term than “OLTP”; most OLTP DBMS can handle at least basic reporting, and the leading ones go well beyond that.

4. Some analytic RDBMS were designed to be columnar. Some were designed to be row-based. Multiple systems from both groups now offer both column- and row-based storage options. But they’re all equally relational.

And once again, I remind you that columnar storage and columnar compression are not the same thing.

5. An appliance can include a DBMS, and indeed exist for no purpose other than to run a DBMS; but a DBMS is not an appliance. At a minimum, a data warehouse appliance is a computing system (hardware, storage, operating system, etc.) with an analytic RDBMS preinstalled.

Occasionally somebody suggests that a “virtual appliance” doesn’t have to have hardware included, but they usually draw little attention.

However, reasonable people can disagree about pickier questions, such as:

For example, I think:

Again, reasonable people can disagree — just so long as they don’t slap the label “appliance” onto software-only analytic RDBMS.

Comments

14 Responses to “Notes on some basic database terminology”

  1. Three PIpe Problem on August 7th, 2012 7:36 am

    Forgive the metonymists

  2. Thomas W Dinsmore on August 7th, 2012 8:07 am

    And EMC’s Greenplum DCA is not an appliance, but a rack of commodity blades with a factory-installed instance of Greenplum database.

  3. Curt Monash on August 7th, 2012 10:46 am

    Thomas,

    I’m OK with that being called an appliance.

    I suspect your reasoning for saying it’s not an appliance would have more force if you instead were arguing that it’s not a GOOD (enough) appliance.

  4. Thomas W Dinsmore on August 7th, 2012 11:40 am

    Curt,

    If Greenplum DCA is an appliance, anything is an appliance.

    TD

  5. Ofir Manor on August 7th, 2012 3:03 pm

    not sure what’s the semantic issue of Thomas from Netezza … But you can rest assured that Greenplum’s Data Computing Appliance (DCA) is an appliance, just check the acronym :)
    BTW, the DCA is not a rack of blades (there are no blade servers in it). It is a rack of servers and network equipment, with all the efforts to make it the best and easiest Greenplum environment to deploy – from call home, to regular monitoring (like unified SNMP support for HW and SW), to pre-optimized setup on all layers (network, firmware, O/S, DB etc), to well tested operational procedures (from expansion to field service to advanced networking to customers network) to tested integration with other products (like EMC Data Domain) to support for mixing GPDB and GPHD (Hadoop) in one appliance, to unified admin web GUI for GPDB, GPHD and HW etc etc.
    All this making it easy, fast way to deploy, run and get best end-to-end support (your box is the same as the engineering box), which is what appliances are all about.
    BTW – I find it “purpose built” without any specialized HW. I like that – this way you can run your dev environments on a VM and get all the same features, or deploy a software-only version on commodity servers from any vendor, if you want (maybe want to run GP on IBM servers? we have a reference config for you)
    Anyway, to each his own.
    Ofir (Greenplum)

  6. Curt Monash on August 7th, 2012 9:36 pm

    No, Thomas.

    For example, something without hardware included isn’t an appliance.

  7. just_me on August 8th, 2012 5:21 am

    the main point is not is it an appliance or not, but which kind of appliance is it. While netezza is a datawarehouse appliance, Oracle Exadata is a database appliance.
    The first one runs a DWH out of the box, the second runs a DB out of the box, which can be tuned to run a DWH.

  8. Carl Buellwinkle on August 12th, 2012 12:58 pm

    An appliance is just hardware and software bundled together to make it easier for a customer to have a successful implementation with the least amount of effort and maintenance.

    It’s easy to call Netezza an appliance because they use proprietary hardware and that’s the only way to buy it and their model of an appliance is disposability, sort of like a having a 2 slice toaster, your family grows out of and you need a 4 slice toaster, you throw the 2 slice toaster out.

    Teradata typically considered an appliance because it’s software on commodity hardware is bundle together but can be purchased as software only and run on customer hardware or even downloaded for free from their website. Calling Greenplum DCA not an appliance is the same as calling Teradata not an appliance.

    A gray area would be software like SAP and HANA or Microsoft PDW that only works on “reference hardware”. To me it’s a cop out because while it couples the technology, it decouples the support and responsibility between two companies.

  9. Paul Johnson on August 13th, 2012 5:16 am

    Curt, I admire your restraint, and the quality of the article.

    The most interesting piece of this entire thread would be the name of the clearly misguided ‘prominent company’…which I’m sure we’ll never get to know. Shame!

  10. Data warehouse appliance — analytic glossary draft entry | DBMS 2 : DataBase Management System Services on August 19th, 2012 5:32 pm

    […] Opinions differ as to whether a system has to have some of these further advantages to properly be called an “appliance”. […]

  11. adamo on August 31st, 2012 4:23 pm

    If your Systems Administrator calls it an Appliance, then it is an Appliance. This has a lot to do with how much effort he invested in building it, writing detailed specifications for it and of course the depth of detail he’s determined to go into to understand the inner workings of the combination of the software and hardware at hand.

    Appliance is an abstraction that is more meaningful to system administrators and DBAs than anybody else.

  12. Curt Monash on August 31st, 2012 6:37 pm

    @adamo,

    I love your first sentence! Indeed, your whole argument is good.

  13. lokesh on February 10th, 2013 11:59 pm

    terms of dbms with definition

  14. ramya on November 25th, 2014 10:50 pm

    Can you please explain about basic database terminology like entity attribute field etc.in detail

Leave a Reply




Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:

Login

Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.