<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Compare/constrast of Vertica, ParAccel, and Exasol</title>
	<atom:link href="http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 09 Feb 2012 09:19:33 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
	<item>
		<title>By: Marv</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-116701</link>
		<dc:creator>Marv</dc:creator>
		<pubDate>Mon, 13 Apr 2009 03:35:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-116701</guid>
		<description>Interested to know more about your views on these columnar DB&#039;s and SybaseIQ. Even though everyone bags Sybase, it seems some are holding tightly to SybaseIQ as they have had it for aeon&#039;s in comparison to these new comers.</description>
		<content:encoded><![CDATA[<p>Interested to know more about your views on these columnar DB&#8217;s and SybaseIQ. Even though everyone bags Sybase, it seems some are holding tightly to SybaseIQ as they have had it for aeon&#8217;s in comparison to these new comers.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Infology.Ru &#187; Blog Archive &#187; Сравнение/контраст между Vertica, ParAccel и Exasol</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-101290</link>
		<dc:creator>Infology.Ru &#187; Blog Archive &#187; Сравнение/контраст между Vertica, ParAccel и Exasol</dc:creator>
		<pubDate>Thu, 06 Nov 2008 04:50:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-101290</guid>
		<description>[...] Автор: Curt Monash  Дата публикации оригинала: 2008-08-12 Источник: Блог Курта Монаша [...]</description>
		<content:encoded><![CDATA[<p>[...] Автор: Curt Monash  Дата публикации оригинала: 2008-08-12 Источник: Блог Курта Монаша [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: More mysteries regarding Oracle CDR load speed &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-97506</link>
		<dc:creator>More mysteries regarding Oracle CDR load speed &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Wed, 17 Sep 2008 05:56:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-97506</guid>
		<description>[...] Last spring, DATAllegro user John Devolites of TEOCO told me of troubles his firm had had loading CDRs (Call Detail Records) into Oracle, and how those had been instrumental in his eventual adoption of DATAllegro.  That claim was contemptously challenged in a couple of comment threads. [...]</description>
		<content:encoded><![CDATA[<p>[...] Last spring, DATAllegro user John Devolites of TEOCO told me of troubles his firm had had loading CDRs (Call Detail Records) into Oracle, and how those had been instrumental in his eventual adoption of DATAllegro.  That claim was contemptously challenged in a couple of comment threads. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Balaji</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-94262</link>
		<dc:creator>Balaji</dc:creator>
		<pubDate>Fri, 15 Aug 2008 22:12:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-94262</guid>
		<description>Seth. Thanks very much for having a great website. I got to know that Clareos Crosscut is in fact ParAccel Analytic database and as usual googled around and got a technical architecture doc for Crosscut. Amazing.But correct me if i am wrong. Now since i know a lot more about vertica or C-Store, i can truly compare vertica~C-Ctore with Paraccel~Clareos Crosscut. Regarding your comment on pentaho, that was a very generic question i asked to Curt and i know very well that Pentaho does not belong to this Genre of columnar DB. Again a very generic question. I am fascinated by Pentaho because of all the material that their website provided as well as its partnership with both Vertica and Paraccel.
My 2cents -- Paraccel and Vertica are on a collision course because both are great products with superb engineering brains behind them. Now who will blink first? ;-)</description>
		<content:encoded><![CDATA[<p>Seth. Thanks very much for having a great website. I got to know that Clareos Crosscut is in fact ParAccel Analytic database and as usual googled around and got a technical architecture doc for Crosscut. Amazing.But correct me if i am wrong. Now since i know a lot more about vertica or C-Store, i can truly compare vertica~C-Ctore with Paraccel~Clareos Crosscut. Regarding your comment on pentaho, that was a very generic question i asked to Curt and i know very well that Pentaho does not belong to this Genre of columnar DB. Again a very generic question. I am fascinated by Pentaho because of all the material that their website provided as well as its partnership with both Vertica and Paraccel.<br />
My 2cents &#8212; Paraccel and Vertica are on a collision course because both are great products with superb engineering brains behind them. Now who will blink first? <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Seth Grimes</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-94052</link>
		<dc:creator>Seth Grimes</dc:creator>
		<pubDate>Thu, 14 Aug 2008 15:18:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-94052</guid>
		<description>Pentaho sells open source and OS-based business intelligence tools.  They use the Mondrian ROLAP server, which relies on a back-end DBMS, but Pentaho does not itself provide database technology.

Therefore Pentaho literally not comparable to Exasol, ParAccel, Vertica.  Pentaho is a different genre of product.</description>
		<content:encoded><![CDATA[<p>Pentaho sells open source and OS-based business intelligence tools.  They use the Mondrian ROLAP server, which relies on a back-end DBMS, but Pentaho does not itself provide database technology.</p>
<p>Therefore Pentaho literally not comparable to Exasol, ParAccel, Vertica.  Pentaho is a different genre of product.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-94034</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Thu, 14 Aug 2008 12:22:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-94034</guid>
		<description>Re in-memory vs. cache:

I&#039;m pretty sure that, say, Exasol, ParAccel, QlikView, and SAP BI Accelerator all do a lot better jobs than row-based DBMS&#039; caches do.  Compression lets you put more in RAM.  Convincing the cache to preload exactly what you want isn&#039;t always as straightforward as running the right query at the right time.  Etc.

CAM</description>
		<content:encoded><![CDATA[<p>Re in-memory vs. cache:</p>
<p>I&#8217;m pretty sure that, say, Exasol, ParAccel, QlikView, and SAP BI Accelerator all do a lot better jobs than row-based DBMS&#8217; caches do.  Compression lets you put more in RAM.  Convincing the cache to preload exactly what you want isn&#8217;t always as straightforward as running the right query at the right time.  Etc.</p>
<p>CAM</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-94033</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Thu, 14 Aug 2008 12:17:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-94033</guid>
		<description>Re: EBKAC -- well, it was a phone call. :)

But yeah, I&#039;d say there was something quite confusing about how the statement was framed.  With the numbers that far out of whack, the task has to have been something very different from what we commonly think of as &quot;load&quot;.

CAM</description>
		<content:encoded><![CDATA[<p>Re: EBKAC &#8212; well, it was a phone call. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>But yeah, I&#8217;d say there was something quite confusing about how the statement was framed.  With the numbers that far out of whack, the task has to have been something very different from what we commonly think of as &#8220;load&#8221;.</p>
<p>CAM</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dominika</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-94017</link>
		<dc:creator>Dominika</dc:creator>
		<pubDate>Thu, 14 Aug 2008 09:52:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-94017</guid>
		<description>&lt;b&gt;WRT [possible] great disk-centric performance for ParAccel:&lt;/b&gt;

I&#039;m quite certain they don&#039;t have it, at least not yet.  This is why they only used memory based solutions at the very bottom of the scale factors (100GB, 300GB and 1000GB).

It would seem that the memory based solutions (ParAccel, Exasol) are only effective if all of the required data is in memory.  For example take &lt;a href=&quot;http://www.exasol.com/insurance_customer.html&quot; rel=&quot;nofollow&quot;&gt;this customer benchmark&lt;/a&gt; from Exasol.  The first run of the queries took 20 minutes compared to 24 minutes on the customer&#039;s existing system.  The explanation from Exasol is that during the first run EXASolution &quot;completely reorganized the internal data and performed internal optimizations, e.g. it generated an index&quot;.  Of course, the subsequent runs took significantly less time, but let&#039;s be realistic, that is not a new trick.   Both DB2 and Oracle have features (Query Patroller and Results Cache) that can just return the result if a given query is run more than once.  IMO so called customer benchmarks where the same queries are executed more than once are quite unimpressive.

Now if my data warehouse or data mart follows most, nightly bulk load, then automated KPI reports (or similar) this is there any advantage with a product like this (ParAccel, Exasol)?  I don&#039;t know the answer but I am interested in knowing if you know.

&lt;b&gt;WRT to the validity of TPC-H or conclusions drawn from them:&lt;/b&gt;

Whether or not you think TPC-H is valid or not, there are audited and validated metrics that are in the full disclosure reports that would probably allow you cross-check some of the metrics that you report on.  For instance, in &lt;a href=&quot;http://www.dbms2.com/2008/05/23/data-warehouse-appliance-power-user-teoco/&quot; rel=&quot;nofollow&quot;&gt;your post on TEOCO&lt;/a&gt; you wrote: &lt;blockquote&gt;&quot;&lt;em&gt;Oracle couldn’t get the load time for 100 million call detail records (CDRs) below 24 hours&lt;/em&gt;&quot;&lt;/blockquote&gt; &lt;a href=&quot;http://tpc.org/results/FDR/tpch/hp_tpch_sd_30TB_fdr.pdf&quot; rel=&quot;nofollow&quot;&gt;This full disclosure report&lt;/a&gt; shows that an Oracle database was able to load the data for entire 30TB scale factor (almost &lt;strong&gt;260 billion&lt;/strong&gt; rows) in just over 16 hours.  Loading data is not rocket science, but it appears that with TEOCO there would appear that there was a bit of &lt;a href=&quot;http://www.urbandictionary.com/define.php?term=EBKAC&quot; rel=&quot;nofollow&quot;&gt;EBKAC&lt;/a&gt; going on. This would seem to be also confirmed from &lt;a href=&quot;http://www.dbms2.com/2008/05/23/data-warehouse-appliance-power-user-teoco/#comment-91573&quot; rel=&quot;nofollow&quot;&gt;Paul&#039;s comment&lt;/a&gt;.  Would you agree Curt?</description>
		<content:encoded><![CDATA[<p><b>WRT [possible] great disk-centric performance for ParAccel:</b></p>
<p>I&#8217;m quite certain they don&#8217;t have it, at least not yet.  This is why they only used memory based solutions at the very bottom of the scale factors (100GB, 300GB and 1000GB).</p>
<p>It would seem that the memory based solutions (ParAccel, Exasol) are only effective if all of the required data is in memory.  For example take <a href="http://www.exasol.com/insurance_customer.html" rel="nofollow">this customer benchmark</a> from Exasol.  The first run of the queries took 20 minutes compared to 24 minutes on the customer&#8217;s existing system.  The explanation from Exasol is that during the first run EXASolution &#8220;completely reorganized the internal data and performed internal optimizations, e.g. it generated an index&#8221;.  Of course, the subsequent runs took significantly less time, but let&#8217;s be realistic, that is not a new trick.   Both DB2 and Oracle have features (Query Patroller and Results Cache) that can just return the result if a given query is run more than once.  IMO so called customer benchmarks where the same queries are executed more than once are quite unimpressive.</p>
<p>Now if my data warehouse or data mart follows most, nightly bulk load, then automated KPI reports (or similar) this is there any advantage with a product like this (ParAccel, Exasol)?  I don&#8217;t know the answer but I am interested in knowing if you know.</p>
<p><b>WRT to the validity of TPC-H or conclusions drawn from them:</b></p>
<p>Whether or not you think TPC-H is valid or not, there are audited and validated metrics that are in the full disclosure reports that would probably allow you cross-check some of the metrics that you report on.  For instance, in <a href="http://www.dbms2.com/2008/05/23/data-warehouse-appliance-power-user-teoco/" rel="nofollow">your post on TEOCO</a> you wrote:<br />
<blockquote>&#8220;<em>Oracle couldn’t get the load time for 100 million call detail records (CDRs) below 24 hours</em>&#8220;</p></blockquote>
<p> <a href="http://tpc.org/results/FDR/tpch/hp_tpch_sd_30TB_fdr.pdf" rel="nofollow">This full disclosure report</a> shows that an Oracle database was able to load the data for entire 30TB scale factor (almost <strong>260 billion</strong> rows) in just over 16 hours.  Loading data is not rocket science, but it appears that with TEOCO there would appear that there was a bit of <a href="http://www.urbandictionary.com/define.php?term=EBKAC" rel="nofollow">EBKAC</a> going on. This would seem to be also confirmed from <a href="http://www.dbms2.com/2008/05/23/data-warehouse-appliance-power-user-teoco/#comment-91573" rel="nofollow">Paul&#8217;s comment</a>.  Would you agree Curt?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-93936</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Wed, 13 Aug 2008 22:50:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-93936</guid>
		<description>I haven&#039;t talked w/ Pentaho.  Both Lance Walter and I have been guilty at various times over the past year of being slow getting back to each other.  I&#039;m the guiltier of the two.

CAM</description>
		<content:encoded><![CDATA[<p>I haven&#8217;t talked w/ Pentaho.  Both Lance Walter and I have been guilty at various times over the past year of being slow getting back to each other.  I&#8217;m the guiltier of the two.</p>
<p>CAM</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/#comment-93935</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Wed, 13 Aug 2008 22:50:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=485#comment-93935</guid>
		<description>Doug,

Perhaps I should have said that most conclusions drawn from TPCs are jokes.  I wouldn&#039;t say that TPCs provide no evidence for any claim at any time.

But if you think about it, in the post I mainly was illustrating what TPCs did NOT show -- namely, great disk-centric performance for ParAccel.  They may have it, but the TPCs don&#039;t show that, because the TPCs weren&#039;t done on a disk-centric configuration.

CAM</description>
		<content:encoded><![CDATA[<p>Doug,</p>
<p>Perhaps I should have said that most conclusions drawn from TPCs are jokes.  I wouldn&#8217;t say that TPCs provide no evidence for any claim at any time.</p>
<p>But if you think about it, in the post I mainly was illustrating what TPCs did NOT show &#8212; namely, great disk-centric performance for ParAccel.  They may have it, but the TPCs don&#8217;t show that, because the TPCs weren&#8217;t done on a disk-centric configuration.</p>
<p>CAM</p>
]]></content:encoded>
	</item>
</channel>
</rss>

