Comments on: Interpreting the results of data warehouse proofs-of-concept (POCs)

By: Database Customer Benchmarketing Reports | Structured Data

Database Customer Benchmarketing Reports | Structured Data — Fri, 12 Dec 2008 09:00:29 +0000

[…] few weeks ago I read Curt Monash’s report on interpreting the results of data warehouse proofs-of-concept (POCs) and I have to say, I’m quite surprised that this topic hasn’t been covered more by […]

By: Oracle Exadata Storage Server: 485x Faster Than…Oracle Exadata Storage Server. Part I. « Kevin Closson’s Oracle Blog: Platform, Storage & Clustering Topics Related to Oracle Databases

Wed, 10 Dec 2008 23:40:21 +0000

[…] Published December 10, 2008 oracle I recently read an article by Curt Monash entitled Interpreting the results of data warehouse proofs-of-concept (POCs). Curt’s post touched on a topic that continually mystifies me. I’m not sure when the […]

By: Curt Monash

Curt Monash — Sun, 07 Dec 2008 08:15:23 +0000

Dominika,

1. I believe so.
2. Seconds.

CAM

By: Dominika

Dominika — Sun, 07 Dec 2008 02:16:47 +0000

Curt-

Are the numbers with the heading “Old running time” the numbers from the current production environment? I’m wondering if this is another case of comparing new product on new hardware with new vendor supervision/assistance to current product on old hardware w/o vendor assistance.

Are the units minutes or seconds?

By: Curt Monash

Curt Monash — Thu, 20 Nov 2008 13:58:38 +0000

Good comments!

I’ll update the post and spreadsheet w/ a geometric option as soon as I can.

Best,

CAM

By: Andy E

Andy E — Thu, 20 Nov 2008 12:21:48 +0000

Great comment Chris; I couldn’t agree more. As a rule, Vertica uses geometric mean query time (gmqt) to calculate the “nX-times faster” summary (e.g., http://www.vertica.com/benchmarks displays that, although we drop the word “geometric” in the tables to save space for cosmetic reasons).

We’ll make an exception to this if a customer uses another metric (like ‘average query time’)–if that’s what mattered to the customer/evaluator, then who are we to override them.

IT’S ALL ABOUT SLAs…

Another metric we often see, and I think this is what REALLY matters to customers, is related to the service level agreement (SLA) the database application must meet.

It’s the % of queries that run under ‘n’ seconds (or some other unit of time, usually).

In a recent POC, Vertica outperformed a competitive database by 19x (gmqt)–that’s solid, but not very flashy from a marketing perspective.

But the gmqt comparison didn’t matter at all to the prospect. What did matter was that 100% of the queries were answered in under 10 seconds (their performance SLA) vs. 0% for the competitor.

We also see compression (Raw data volume : DB size) measured and compared quite often in POCs. It directly relates to cost of ownership over time.

To sum up, the value of the DBMS to the customer and the motivation to buy it is often based on SLAs (as it should be). Factoring in SLAs (e.g., % of queries that meet SLA) puts the POC results into a context business people will understand (and fund, hopefully).

my 2 cents…

By: Christopher Browne

Christopher Browne — Thu, 20 Nov 2008 01:43:54 +0000

Just as a thought, the natural form of a “mean” for a set of things that are indicating factors/multiples would be a *geometric* mean, computed as…

————————–
/
M = n / f_1 * f_2 * … * f_n
\ /
+

This still suffers from the typical problems of a “mean,” but at least it’s suited to the thing being measured.

The need for artificial weighting goes away; this provides a relatively unbiased measure of the “midpoint” of the multipliers.

Of course the *real* improvement is to have a valuation metric tied to what you’re actually using the system for.

Thus, if you’ve got 15 tests, 14 of which found immaterial effects on their runtimes from the tests, and the 15th of which is Truly Essential to run in under 24 hours, where it doesn’t, now, then *really*, the evaluation of goodness properly ought to be almost totally based on that 15th report’s runtime.

Now, that’s TOTALLY context sensitive. If the DW tool spectacularly improves performance on that one report of yours, there is no reason to expect it to have any particular relevant effect on *my* workload.

What we’d like is some way to be able to generalize from performance on your workload to assert something about expected performance on my workload.

Unfortunately, I’m not sure there’s anything other than a POC that can really get at the low level factors that contribute to the actual behaviour. A geometric mean may be better than an arithmetic one by some small margin, but it seems to me you need *way* more parameters than one number/factor in order to do any generalization.