<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; ParAccel</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/paraccel/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 02 Sep 2010 09:06:44 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Links and observations</title>
		<link>http://www.dbms2.com/2010/08/09/links-and-observations/</link>
		<comments>http://www.dbms2.com/2010/08/09/links-and-observations/#comments</comments>
		<pubDate>Tue, 10 Aug 2010 02:37:51 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Calpont]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Northscale]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[XtremeData]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2743</guid>
		<description><![CDATA[I&#8217;m back from a trip to the SF Bay area, with a lot of writing ahead of me. I&#8217;ll dive in with some quick comments here, then write at greater length about some of these points when I can. From my trip:  

Aster Data showed me a lot of customer names and deal sizes, across [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m back from a trip to the SF Bay area, with a lot of writing ahead of me. I&#8217;ll dive in with some quick comments here, then write at greater length about some of these points when I can. From my trip:  <span id="more-2743"></span></p>
<ul>
<li>Aster Data showed me a lot of customer names and deal sizes, across a bunch of industries (mainly enterprise rather than web). Yes, Aster&#8217;s market success is for real. (But almost all those details are NDA.)</li>
<li>Sybase&#8217;s product plans for IQ are pretty impressive. (But the most interesting parts are, you guessed it, NDA.)</li>
<li>I&#8217;ve kissed and made up* with ParAccel, now that they&#8217;ve replaced their CEO, replaced their marketing chief, and stopped the worst of the <a href="http://www.dbms2.com/2010/01/15/there-sure-seem-to-be-a-lot-of-inaccuracies-on-paraccels-website/" >marketing</a> <a href="http://www.dbms2.com/2009/06/22/the-tpc-h-benchmark-is-a-blight-upon-the-industry/" >nonsense</a> I used to complain about. ParAccel has some interesting plans for ParAccel 3.0 which are, naturally, NDA.</li>
<li>The Peoplesoft guys are doing it over again at Workday. Only this time, their platform isn&#8217;t a relational DBMS. Rather, it&#8217;s an in-memory, completely object-oriented data model, with disk used only on a &#8220;Just in case the power ever goes out&#8221; basis. (Thankfully, nothing at all about our conversation was NDA.)</li>
<li>I&#8217;m finally feeling good about <a href="# I spent considerable time  with my clients at both Greenplum and EMC (if we ignore the fact that  the deal has closed and they're now the same company). I also had more  of  a hardcore engineering discussion than I've had with Greenplum for  quite a while (I should have been pushier about that earlier). Takeaways  included:      * This is starting off as a honeymoon deal. Everything  Greenplum was planning to do is being continued. Additional resources  are being poured into Greenplum to do more.     * Some Greenplum execs  seem to envision staying long term, some seem to envision moving on to  their next startups. The ones who envision moving on are, however, going  to work hard first to make the merger a success.     * Greenplum has,  for quite a while, had more of an advanced analytics/embedded predictive  modeling story than I realized. Bad on them for not fleshing it out  more in marketing and product packaging alike.     * Greenplum both  denies the concurrency problems I previously noted and also has a very  credible story as to how it will eliminate them. :) Seriously, Greenplum  tells of one customer that routinely runs 150 simultaneously queries -  on what I think is not a terribly big system -- and a number of POCs  (Proofs of Concept) that simulated similar levels of concurrency.">Northscale&#8217;s  memcached-compatible persistent store Membase</a>. The main reason is  that they showed me a near-term path to interfaces that are richer than  key-value. Also, Todd Hoff reassured me that even pure persistent  memcached has a place.</li>
<li>Rumor says that even the one app for which Facebook was using Cassandra &#8212; in-box search &#8212; has been decommissioned. On the other hand, numerous other scale-0ut DBMS (SQL or otherwise) seem to have Facebook footholds. But details are &#8212; all together now! &#8212; NDA.</li>
</ul>
<p><em>*If you know ParAccel&#8217;s new marketing chief Michael Weir, you  surely guessed I mean that only in a figurative sense.</em></p>
<p>From elsewhere:</p>
<ul>
<li>Daniel Abadi offered <a href="http://dbmsmusings.blogspot.com/2010/08/thoughts-on-kickfires-apparent-demise.html" onclick="javascript:pageTracker._trackPageview('/dbmsmusings.blogspot.com');">his  analysis</a> of <a href="../2010/07/27/kickfire-unlikely-to-survive/">Kickfire&#8217;s  demise</a>. In general I agree, but Daniel neglected to mention one  hugely important factor &#8212; the chicken-egg negative effect of Kickfire&#8217;s  lack of market or marketing traction. Customers were extremely reluctant to buy from Kickfire  because they perceived, correctly, that Kickfire&#8217;s survivability was far  from assured.</li>
<li>While the <a href="http://infinidb.org/community/forums/11-general-infinidb/1000-strange-issue-with-drop-table" onclick="javascript:pageTracker._trackPageview('/infinidb.org');">InfiniDB forums</a> suggest that there are at least a couple of production users of Calpont&#8217;s free InfiniDB, Calpont seemingly has a long way to go to be even as successful as Kickfire. But Calpont does have a bit of money to spend on lead generation; maybe some day they&#8217;ll even have actual customers.</li>
<li>In a response to a question I messaged over, <a href="http://www.dbms2.com/2010/03/18/xtremedata-update/" >XtremeData</a> tells me they have actual customers now. Press releases to follow.</li>
<li>The <a href="http://news.cnet.com/8301-31021_3-20013111-260.html?part=rss&amp;subj=news&amp;tag=2547-1_3-0-20" onclick="javascript:pageTracker._trackPageview('/news.cnet.com');">admiration for the job Mark Hurd did at HP</a> is in my opinion overstated. Sure, the financial/operational management appeared to work, but HP did little on Hurd&#8217;s watch to strengthen its reputation or customers&#8217; loyalty. In particular:
<ul>
<li>HP&#8217;s analytics efforts have accomplished little.</li>
<li>HP&#8217;s data warehouse appliance efforts have failed pathetically.</li>
<li>From what I hear, HP&#8217;s execution in its Exadata partnership was not good.</li>
<li>HP&#8217;s server business in general is distinguished mainly by HP being a big company.</li>
<li>HP&#8217;s EDS acquisition has been rocky, not that EDS was sailing so smoothly on its own beforehand.</li>
<li>HP&#8217;s success in PCs amounts to &#8220;arguably, HP sucks a little less than the other guys&#8221;.</li>
<li>HP&#8217;s elite reputation is long gone (admittedly, for the most part that predates Hurd).</li>
</ul>
</li>
<li><a href="http://intelligent-enterprise.informationweek.com/blog/archives/2010/08/software_innova.html" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');">Doug Henschen</a> evidently favors really strong intellectual property protection for software, even forbidding plug-compatible reverse engineering. I agree with Doug up to the point that <a href="http://www.monashreport.com/2010/07/19/my-view-of-intellectual-property/" onclick="javascript:pageTracker._trackPageview('/www.monashreport.com');">it should be forbidden to copy proprietary software</a>, but I don&#8217;t see why he (or a court) would view such behavior as copying.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/08/09/links-and-observations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What kinds of data warehouse load latency are practical?</title>
		<link>http://www.dbms2.com/2010/06/21/data-warehouse-load-latency/</link>
		<comments>http://www.dbms2.com/2010/06/21/data-warehouse-load-latency/#comments</comments>
		<pubDate>Mon, 21 Jun 2010 12:15:17 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2319</guid>
		<description><![CDATA[I took advantage of my recent conversations with Netezza and IBM to discuss what kinds of data warehouse load latency were practical. In both cases I got the impression:

Subsecond load latency is 	substantially impossible. Doing that amounts to OLTP.
5 seconds or so is doable with 	aggressive investment and tuning.
Several minute load latency is 	pretty easy.
10-15 [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">I took advantage of my recent conversations with <a href="http://www.dbms2.com/2010/06/21/netezza-database-software-technology-overview/" >Netezza</a> and <a href="http://www.dbms2.com/2010/06/21/netezza-ibm-db2-compression/" >IBM</a> to discuss what kinds of data warehouse load latency were practical. In both cases I got the impression:</p>
<ul>
<li>Subsecond load latency is 	substantially impossible. Doing that amounts to OLTP.</li>
<li>5 seconds or so is doable with 	aggressive investment and tuning.</li>
<li>Several minute load latency is 	pretty easy.</li>
<li>10-15 minute latency or longer is 	now very routine.</li>
</ul>
<p style="margin-bottom: 0in;">There&#8217;s generally a throughput/latency tradeoff, so if you want very low latency with good throughput, you may have to throw a lot of hardware at the problem.</p>
<p style="margin-bottom: 0in;">I&#8217;d expect to hear similar things from any other vendor with reasonably mature analytic DBMS technology. Low-latency load is a problem for columnar systems, but both <a href="http://www.dbms2.com/2008/08/12/vertica-paraccel-exasol/" >Vertica <span style="font-style: normal;">and</span> ParAccel</a> designed in workarounds from the getgo. Aster Data probably didn&#8217;t meet these criteria until <a href="http://www.dbms2.com/2009/10/30/aster-data-application-server-ncluster/" >Version 4.0</a>, its old “<a href="http://www.dbms2.com/2008/10/22/aster-data-systems-ncluster/" >frontline</a>” positioning notwithstanding, but I think it does now.</p>
<p style="margin-bottom: 0in;"><em><strong>Related link</strong></em></p>
<ul>
<li>
<p style="margin-bottom: 0in;"><a href="http://www.dbms2.com/2009/09/10/analytic-speed-latency/" >Just what is your need for speed</a> anyway?</p>
</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/06/21/data-warehouse-load-latency/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Best practices for analytic DBMS POCs</title>
		<link>http://www.dbms2.com/2010/06/14/best-practices-analytic-database-poc/</link>
		<comments>http://www.dbms2.com/2010/06/14/best-practices-analytic-database-poc/#comments</comments>
		<pubDate>Mon, 14 Jun 2010 12:53:33 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2297</guid>
		<description><![CDATA[When you are selecting an analytic DBMS or appliance, most of the evaluation boils down to two questions:

How quickly 	and cost-effectively does it execute SQL?
What 	analytic functionality, SQL or otherwise, does it do a good job of 	executing?

And so, in undertaking such a selection, you need to start by addressing three issues:

What 	does “speed” mean [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">When you are selecting an analytic DBMS or appliance, most of the evaluation boils down to two questions:</p>
<ul>
<li>How q<span style="font-style: normal;">uickly 	and cost-effectively does it execute SQL?</span></li>
<li><span style="font-style: normal;">What 	analytic functionality, SQL or otherwise, does it do a good job of 	executing?</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">And so, in undertaking such a selection, you need to start by addressing three issues:</span></p>
<ul>
<li><a href="../2009/09/10/analytic-speed-latency/">What 	does “speed” mean to you</a>?</li>
<li>What does “cost” mean to you?</li>
<li>What analytic functionality do you 	need anyway?</li>
</ul>
<p style="margin-bottom: 0in;"><span id="more-2297"></span>Key elements of cost* include:</p>
<ul>
<li>Software license and maintenance</li>
<li>Hardware purchase cost, 	maintenance, electric power, and computer room burden</li>
<li>Database and system administration</li>
<li>(For some uses cases) Programming</li>
</ul>
<p style="margin-bottom: 0in;"><em>*Assuming a classical in-house IT shop, where products are typically bought rather than leased/rented. With outsourced and/or monthly-fee structures, the details change but the principles remain the same.</em></p>
<p style="margin-bottom: 0in;"><em></em>Most of that can be evaluated pretty well via a spreadsheet, although things can get a bit tricky when you get to people costs, which are a large fraction of the whole. In particular, different analytic DBMS product suites have great, high-performance support for different (and often rapidly growing) sets of functionality – basic and advanced SQL, statistics, and more. Figuring out which ones will be best for your programmers, and how significant the differences are &#8212; well, that&#8217;s a lot like any other programming language evaluation, and those are rarely neat or clean-cut.</p>
<p style="margin-bottom: 0in; font-style: normal;">But when it comes to evaluating speed, <strong>there&#8217;s no substitute for a well-designed proof of concept (POC).</strong> Many analytic DBMS and appliance vendors are happy to let you do a POC, on your own premises (or remotely if you prefer), under your control, at no cost to you. And that&#8217;s great. <strong>It is crucial that a POC be run either by you, by a consultant* answerable to you,</strong><span style="font-weight: normal;"> or – if you decide the vendor must run it for you – at least </span><strong>with you watching every step of the way</strong><span style="font-weight: normal;"> and knowing exactly what is being done. Applianc</span>e vendors do find it cheaper to run POCs on their own premises, so a certain reluctance to ship you a box is understandable. But <strong>make no compromises about the transparency of a POC, or about your control of exactly what it is that gets tested.</strong></p>
<p style="margin-bottom: 0in;"><em>*Since I sell <a href="http://www.monash.com/adviseusers.html" onclick="javascript:pageTracker._trackPageview('/www.monash.com');">consulting services</a> for users evaluating analytic DBMS, I naturally am biased to think that consultants can be very useful in the process. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  But whether you should use them a little (sanity check), a medium amount (work with you through the process), or heavily (actually drive the process for you and/or execute the POCs) is very dependent upon your specific situation.</em></p>
<p style="margin-bottom: 0in; font-style: normal;">So far as I&#8217;ve been able to tell:</p>
<ul>
<li><span style="font-style: normal;">Netezza 	loves to ship boxes to prospects for POCs, and have them set up the 	boxes and do POCs themselves. That&#8217;s a big reason why <a href="../2009/02/18/the-netezza-guys-propose-a-poc-checklist/">Netezza 	wants to call attention to this subject</a>.</span></li>
<li><span style="font-style: normal;">Oracle 	has generally been pretty <a href="../2009/02/01/oracle-says-they-do-onsite-exadata-pocs-after-all/">reluctant 	to ship Exadata boxes out for POCs</a>. That&#8217;s the other reason 	Netezza wants to call attention to the issue. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </span></li>
<li><span style="font-style: normal;">Open 	source vendors make it easy for you to download and test at least 	their community editions.</span></li>
<li><span style="font-style: normal;">Vertica 	makes it pretty easy for you to test its software too (download or 	cloud).</span></li>
<li><span style="font-style: normal;">ParAccel 	has generally insisted on running POCs itself, although it will do 	so on your premises if you insist.</span></li>
<li><span style="font-style: normal;">Teradata 	naturally tries to do POCs on its own premises, but doesn&#8217;t insist 	too hard.<em> (Edit: Randy Lea of Teradata says that Teradata is now doing over half its POCs onsite.)</em><br />
</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Most of the criticisms I&#8217;ve heard of vendors&#8217; POC practices have been directed at Oracle or ParAccel.</span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">For most POCs, it&#8217;s a good conceptual template to </span><span style="font-style: normal;"><strong>form and then test a hypothesis</strong></span><span style="font-style: normal;"> to the effect of:</span></p>
<ul>
<li><span style="font-style: normal;">For 	a given technology product assemblage (brand of DBMS, number of 	nodes, etc.), and</span></li>
<li><span style="font-style: normal;">For 	a given level of human effort (e.g., administrative effort), you can</span></li>
<li><span style="font-style: normal;">Run 	a given a workload, with</span></li>
<li><span style="font-style: normal;">Satisfactory 	and satisfactorily consistent response times</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Sometimes absolute throughput and price/performance are important </span><em>secondary</em><span style="font-style: normal;"> considerations; sometimes they&#8217;re less germane. But either way, it&#8217;s almost always right to focus </span><em>primarily</em><span style="font-style: normal;"> on the questions of </span><span style="font-style: normal;"><strong>“What do I want this system to do?”</strong></span><span style="font-style: normal;"> and </span><span style="font-style: normal;"><strong>“What do I think we&#8217;re going to have to invest in it?</strong></span><span style="font-style: normal;">” By way of contrast, it&#8217;s often misleading to focus too much on questions like “<a href="../2008/11/19/data-warehouse-proof-of-concept-pocs/">What&#8217;s the one number that best describes the performance of this system?</a>” &#8212; even if you customize that calculation for your environment – or, even worse, “How much speed-up can I get on my single worst <a href="../2008/11/15/query-from-hell/">Query from Hell</a>?” </span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">The fundamental rule of POC construction is: </span><span style="font-style: normal;"><strong>Model your entire use case as best you can.</strong></span><span style="font-style: normal;"> That means you need to consider, at a minimum:</span></p>
<ul>
<li><span style="font-style: normal;">Your 	whole concurrent query, other analytic, and low-latency update 	workload (peak).</span></li>
<li><span style="font-style: normal;">Your 	whole query, analytic, load, backup, and maintenance workload 	(ongoing).</span></li>
<li><span style="font-style: normal;"><a href="../2008/12/14/the-%E2%80%9Cbaseball-bat%E2%80%9D-test-for-analytic-dbms-and-data-warehouse-appliances/">Partial-failure 	scenarios</a>.</span></li>
<li><span style="font-style: normal;">Your 	core SLAs (Service-Level Agreements).</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Of course, that&#8217;s not as easy as it sounds. Presumably, the main reason you&#8217;re getting a new analytic DBMS is that you want to do new kinds of analysis. By the very nature of analytics, you won&#8217;t know what analytic operations are most useful until you try them out and see what their results are. On the other hand – if you haven&#8217;t done considerable thinking about how you&#8217;re going to use your new analytic database, how did you ever get funding for the project in the first place? <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Seriously, I could write multiple posts, each as long as this one (but more application-oriented), about how to upgrade your analytic capabilities (and which fool&#8217;s gold to avoid). But this has gotten pretty long already, so for now I&#8217;ll just stop here.</span></p>
<p style="margin-bottom: 0in;"><em>Note: My clients at Netezza asked me to write something short about POCs they could use as a kind of foreword to some collateral, where by &#8220;short&#8221; they meant single-paragraph or something like that. They&#8217;re great clients, so I said yes, under the condition I could also use it as a blog post. Except … this post didn&#8217;t turn out to be nearly as short as they envisioned. Oops. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </em></p>
<p style="margin-bottom: 0in;"><em><strong>Related links</strong></em></p>
<ul>
<li>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">My 	February, 2009 <a href="../2009/02/25/even-more-final-version-of-my-tdwi-slide-deck/">slide 	deck on how to select an analytic DBMS</a> is in many parts still 	pretty current</span></p>
</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/06/14/best-practices-analytic-database-poc/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Quick reactions to SAP acquiring Sybase</title>
		<link>http://www.dbms2.com/2010/05/12/sap-acquire-sybase/</link>
		<comments>http://www.dbms2.com/2010/05/12/sap-acquire-sybase/#comments</comments>
		<pubDate>Wed, 12 May 2010 23:48:37 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[ANTs Software]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Business Objects]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[SAP AG]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Theory and architecture]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2105</guid>
		<description><![CDATA[SAP is acquiring Sybase. On the conference call SAP said Sybase would be run as a separate division of SAP (no surprise). Most of the focus was on Sybase&#8217;s mobile technology, which is forecast at &#62;$400 million in 2010 revenues (which would be 30%ish of the total). My quick reactions include: 

Sybase&#8217;s main businesses are:

Classic [...]]]></description>
			<content:encoded><![CDATA[<p>SAP is acquiring Sybase. On the conference call SAP said Sybase would be run as a separate division of SAP (no surprise). Most of the focus was on Sybase&#8217;s mobile technology, which is forecast at &gt;$400 million in 2010 revenues (which would be 30%ish of the total). My quick reactions include: <span id="more-2105"></span></p>
<ul>
<li>Sybase&#8217;s main businesses are:
<ul>
<li><strong>Classic OLTP DBMS</strong> (Sybase ASE, for Adapative Server Enterprise, unless I&#8217;ve missed yet another name change).</li>
<li><strong>Analytic technology</strong> &#8212; mainly <strong>Sybase IQ,</strong> but more generally <a href="http://www.dbms2.com/2010/02/05/sybase-aleri-rap/" >Sybase RAP</a>.</li>
<li><strong>Mobile technology. </strong>(The frequently renamed small DBMS SQL Anywhere was the foundational product of and still is included in the mobile division.)</li>
</ul>
</li>
<li><a href="http://www.dbms2.com/2009/07/07/hasso-plattner-calls-for-in-memory-oltp-column-stores/" >SAP&#8217;s thoughts on in-memory database management</a> are interesting. However, I think SAP&#8217;s oft-repeated claim that it has a lot of important in-memory database technology to bring to Sybase (or for that matter SAP customers) is mainly smoke and mirrors. <strong>Cool data access methods, good niche database products, and broadly applicable multi-domain DBMS innovations are three different things.</strong> Granting that SAP probably has the first and thinks it has the second is not the same as giving it much credence for having the third.</li>
<li>SAP claims that, 15 years after its refusal to support Sybase turned Sybase into a DBMS also-ran, it by now is &#8220;relatively simple&#8221; to port SAP&#8217;s apps to Sybase ASE, and that they will make that happen. I actually believe that <strong>SAP&#8217;s apps will soon run on Sybase ASE,</strong> where by &#8220;soon&#8221; I mean &#8220;in a couple of years for no-apologies general availability.&#8221; (Certifying a DBMS for SAP is a long process.) The main missing features &#8212; e.g., row-level locking &#8212; were already put into Sybase back in the last millenium. Nor could there be fundamental architectural problems that keep SAP from supporting Sybase ASE, or else SAP couldn&#8217;t have supported Microsoft SQL Server (which, long ago, was a Sybase fork).</li>
<li><strong>I don&#8217;t see any market or competitive dynamics that would lead the SAP acquisition to hurt Sybase&#8217;s ASE or mobile businesses. </strong>General merger management mishegas is, of course, always a possibility.</li>
<li>SAP Business Objects partners with Sybase IQ&#8217;s competitors. That could be a problem. However, <strong>coopetition is pretty strong in the business intelligence market</strong>. I don&#8217;t think any of SAP Business Objects, IBM Cognos, or Oracle Business Intelligence are much held back from partnering by competitive dislike of their parent companies.</li>
<li><strong>The rest of SAP might be able to drum up some extra business for Sybase IQ.</strong></li>
<li><strong>It would be natural for IBM/Cognos to now buy a columnar DBMS of its own.</strong> Vertica is an obvious first choice. ParAccel would surely come much cheaper. Since ParAccel has little chance of surviving as an independent company &#8212; <a href="http://www.dbms2.com/2010/04/16/story-of-an-analytic-dbms-evaluation/" >too immature</a> and too little differentiation to overcome that &#8212; I&#8217;d expect ParAccel&#8217;s board to jump at the chance to sell out.</li>
<li>It would be interesting if SAP Business Objects would revive the <a href="http://www.dbms2.com/2009/03/25/aleri-update/" >CEP-based BI</a> idea.</li>
<li>I gather Sybase&#8217;s AnswersAnywhere concept network/object model-based natural language/speech recognition technology never went anywhere. Unsurprising (it seemed like it needed too much hand-building to scale semantically), but regrettable even so.</li>
<li>I don&#8217;t see anything in this acquisition that would revive PowerBuilder (Sybase&#8217;s Visual Basic competitor), Sybase&#8217;s CASE (Computer-Aided Software Engineering) tools, and so on.</li>
<li>And on the personal side &#8212; I&#8217;ll probably lose Sybase as a customer due to this merger, but it could have been worse. A lot of vendors smaller than Sybase are bigger customers for Monash Research.</li>
</ul>
<p><em>Edit: Right after I posted this, I saw email from Sybase clarifying that Sybase&#8217;s in-memory technology, while slightly influenced by some ANTs IP Sybase bought non-exclusive rights to, is essentially home-grown. That&#8217;s what I thought, but the call sounded like it was saying something different.</em></p>
<p><strong><em>Further coverage of SAP/Sybase:</em></strong></p>
<ul>
<li><a href="http://www.dbms2.com/2010/05/13/sap-database-proliferation/" >SAP believes in database proliferation</a></li>
<li><a href="http://www.dbms2.com/2010/05/13/sap-sybase-reactions/" >More quick reactions to SAP/Sybase</a></li>
<li><a href="http://www.dbms2.com/2010/05/17/technical-basics-of-sybase-iq/" >Technical basics of Sybase IQ</a><strong><em><br />
</em></strong></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/05/12/sap-acquire-sybase/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Story of an analytic DBMS evaluation</title>
		<link>http://www.dbms2.com/2010/04/16/story-of-an-analytic-dbms-evaluation/</link>
		<comments>http://www.dbms2.com/2010/04/16/story-of-an-analytic-dbms-evaluation/#comments</comments>
		<pubDate>Sat, 17 Apr 2010 02:56:56 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[Buying processes]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1900</guid>
		<description><![CDATA[One of our readers was kind enough to walk me through his analytic DBMS evaluation process. The story is:

The X Company (XCo) has a &#60;1 TB 	database.
100s of XCo&#8217;s customers log in at 	once to run reports. 50-200 concurrent queries is a good target 	number.
XCo had been “suffering” with 	Oracle and wanted to upgrade.
XCo didn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">One of our readers was kind enough to walk me through his analytic DBMS evaluation process. The story is:</p>
<ul>
<li>The X Company (XCo) has a &lt;1 TB 	database.</li>
<li>100s of XCo&#8217;s customers log in at 	once to run reports. 50-200 concurrent queries is a good target 	number.</li>
<li>XCo had been “suffering” with 	Oracle and wanted to upgrade.</li>
<li>XCo didn&#8217;t have a lot of money to 	spend. <strong>Netezza</strong> pulled out of the sales cycle early due to 	budget (and this was recently enough that Netezza <a href="http://www.dbms2.com/2010/01/25/netezza-skimmer/" >Skimmer</a> could have been bid).</li>
<li><strong>Greenplum</strong> didn&#8217;t offer any 	references that approached the desired number of concurrent users.</li>
<li>Ultimately the evaluation came 	down to <strong>Vertica</strong> and <strong>ParAccel.</strong></li>
<li><strong>Vertica won.</strong></li>
</ul>
<p style="margin-bottom: 0in; font-weight: normal;">Notes on the Vertica vs. ParAccel selection include:<span id="more-1900"></span></p>
<ul>
<li>ParAccel sent an engineer on-site 	to do a proof-of-concept (POC), and generally competed very hard for 	the deal.</li>
<li>Vertica dropped by for a sales 	call once, and let XCo do the Vertica POC itself.</li>
<li>Not surprisingly, XCo got the 	impression that Vertica was easier to set up and administer than 	ParAccel.</li>
<li>Also, when ParAccel emphasized 	architectural features such as custom “backplane” and compiled 	queries, XCo got the impression – right or wrong – that 	ParAccel&#8217;s performance was more brittle or situational than 	Vertica&#8217;s.</li>
<li>ParAccel was modestly faster than 	Vertica in the POC. (I think &#8212; Vertica&#8217;s numbers were described as being &#8220;very competitive.&#8221;)</li>
<li>In multiple ways, Vertica gave the 	impression of greater product and vendor maturity than ParAccel.</li>
</ul>
<p style="margin-bottom: 0in;">My contact continues to be interested in all things Greenplum, and has recommended <a href="http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/" >Greenplum Single-Node Edition</a> to his analyst colleagues.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/04/16/story-of-an-analytic-dbms-evaluation/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Vertica update</title>
		<link>http://www.dbms2.com/2010/03/19/vertica-update-4/</link>
		<comments>http://www.dbms2.com/2010/03/19/vertica-update-4/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 13:42:06 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Petabyte-scale data management]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1738</guid>
		<description><![CDATA[I caught up with Jerry Held (Chairman) and Dave Menninger (VP Marketing) of Vertica for a chat yesterday. The immediate reason for the call was that a competitor had tipped me off to the departure of Vertica CEO Ralph Breslauer, which of course raises a host of questions.  Highlights of the call included:

Vertica had [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">I caught up with Jerry Held (Chairman) and Dave Menninger (VP Marketing) of Vertica for a chat yesterday. The immediate reason for the call was that a competitor had tipped me off to the departure of Vertica CEO Ralph Breslauer, which of course raises a host of questions.  Highlights of the call included:</p>
<ul>
<li>Vertica had a “killer” Q4 and 	is doing very well in Q1 again.</li>
<li>Vertica burned hardly any cash 	last year; i.e., it was close to cash-flow neutral in 2009.</li>
<li>Vertica is hiring aggressively, 	e.g., in sales.</li>
<li>Vertica is well down the path with 	several CEO candidates who Jerry regards as outstanding. He is 	hopeful there will be a new CEO in April. (But I bet that would be 	late April, given what Jerry mentioned about his own travel plans.)</li>
<li>Absent a full-time CEO, Jerry and 	Andy Palmer are spending a lot more time with Vertica.</li>
<li>One Vertica customer is 	approaching a petabyte of user data. The last time Vertica had 	checked, that customer had been more in the ¼ petabyte range.</li>
<li>Other multi-hundred terabyte 	Vertica databases were mentioned, including one where Vertica claims 	to have beaten Teradata and perhaps other competitors in a 	head-to-head competition (it sounds like that one&#8217;s too recent to be 	deployed yet).</li>
<li>Vertica se<span style="font-style: normal;">es 	Aster and Greenplum competitively more often than it sees ParAccel.</span></li>
<li>Vertica sees 	Sybase IQ competitively a lot in financial services (in new-name 	accounts for Sybase as well as where some kind of Sybase DBMS is an 	incumbent), and more occasionally in other sectors.</li>
</ul>
<p style="margin-bottom: 0in; font-style: normal;">NDA parts of the conversation also gave me the impression that Vertica is moving forward just as eagerly as its peers. I.e., I didn&#8217;t uncover any reason to think that Ralph&#8217;s departure is a sign of trouble, of the company being shopped, etc.<span id="more-1738"></span></p>
<p style="margin-bottom: 0in; font-style: normal;">On the other hand, I didn&#8217;t uncover any other clear reason for Ralph&#8217;s departure either. The party line is that Ralph left for “personal reasons”.  It was his decision to leave. He did a great job while at Vertica.  Basically, Vertica is saying that, even though all was going swimmingly, Ralph just up and quit, leaving some very valuable unvested stock options on the table at what had been his first CEO gig ever.</p>
<p style="margin-bottom: 0in; font-style: normal;">Contacting Ralph didn&#8217;t add any immediate insight. He responded quickly via a Twitter DM, but was in the airport for a trip to his home country of South Africa, and deferred discussion until after his return.</p>
<p style="margin-bottom: 0in; font-style: normal;">Incidentally, <a href="http://www.dbms2.com/2010/03/19/infobright-blog-update/" >Infobright</a> and ParAccel have both also had recent CEO turnover. Stated reasons in each case were of the “Right person to lead the next stage of the company&#8217;s growth” variety.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/03/19/vertica-update-4/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>There sure seem to be a lot of inaccuracies on ParAccel&#8217;s website</title>
		<link>http://www.dbms2.com/2010/01/15/there-sure-seem-to-be-a-lot-of-inaccuracies-on-paraccels-website/</link>
		<comments>http://www.dbms2.com/2010/01/15/there-sure-seem-to-be-a-lot-of-inaccuracies-on-paraccels-website/#comments</comments>
		<pubDate>Fri, 15 Jan 2010 04:47:00 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Telecommunications]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1393</guid>
		<description><![CDATA[In what is actually an interesting post on database compression, ParAccel CTO Barry Zane threw in
Anyone who has met with us knows ParAccel shies away from hype.
But like many things ParAccel says, that is not true.
The latest whoppers came in the form of several customers ParAccel listed on its website who hadn&#8217;t actually bought ParAccel&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>In what is actually an <a href="http://paraccel.com/data_warehouse_blog/?p=192" onclick="javascript:pageTracker._trackPageview('/paraccel.com');">interesting post on database compression</a>, ParAccel CTO Barry Zane threw in</p>
<blockquote><p>Anyone who has met with us knows ParAccel shies away from hype.</p></blockquote>
<p>But like many things ParAccel says, that is not true.</p>
<p>The latest whoppers came in the form of several customers ParAccel listed on its website who hadn&#8217;t actually bought ParAccel&#8217;s DBMS, nor even decided to do so. It is fairly common to to claim a customer win, then retract the claim due to lack of permission to disclose. But that&#8217;s not what happened in these cases. Based on emails helpfully shared by a ParAccel competitor competing in some of those accounts, it seems clear that <strong>ParAccel actually posted fabricated claims of customer wins.</strong> <span id="more-1393"></span></p>
<p>Another thing that was both technically and substantively false was ParAccel&#8217;s claim to be <a href="http://www.dbms2.com/2009/09/30/facts-and-rumors/" >CERTIFIED price-performance leader</a>. Obviously, this was meant to give the impression that ParAccel had been &#8220;certified&#8221; as the leader in price/performance, when the closest thing to that that was remotely true was that ParAccel had a leading position in the category of &#8220;price/performance measurements that happen to have a certification process.&#8221; At least, that was true for a short time; then ParAccel&#8217;s certification was found to have been erroneous, and got revoked, which did not however inspire ParAccel to immediately take the claim off the front page of its website.</p>
<p>ParAccel&#8217;s website also reflects a lot of praise from flagship customer LatiNode. What it perhaps understandably neglects to mention is that LatiNode is in a <a href="http://www.pepperlaw.com/publications_update.aspx?ArticleKey=1651" onclick="javascript:pageTracker._trackPageview('/www.pepperlaw.com');">dormant state</a>, placed there by acquirer Elandia due to LatiNode&#8217;s criminally corrupt customer acquisition practices.</p>
<p>I also don&#8217;t believe ParAccel&#8217;s endlessly-repeated claim that is has never lost a benchmark on performance. However, I must in fairness note that while I&#8217;ve been given names of customers who are supposed counterexamples to this claim by somebody I trust, I&#8217;ve never been able to actually verify those supposed ParAccel losses.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/01/15/there-sure-seem-to-be-a-lot-of-inaccuracies-on-paraccels-website/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
		</item>
		<item>
		<title>Facts and rumors</title>
		<link>http://www.dbms2.com/2009/09/30/facts-and-rumors/</link>
		<comments>http://www.dbms2.com/2009/09/30/facts-and-rumors/#comments</comments>
		<pubDate>Wed, 30 Sep 2009 06:21:44 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[DATAllegro]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Petabyte-scale data management]]></category>
		<category><![CDATA[Specific users]]></category>
		<category><![CDATA[Telecommunications]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=963</guid>
		<description><![CDATA[
Vertica is putting out a press 	release today touting its 100th customer, and talking of triple 	digit growth last year.
Multiple sources have told me that 	the DATAllegro system is being thrown out of Dell, so evidently Dell is telling this to one and all. If that goes 	through, this would presumably leave TEOCO as DATAllegro&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>Vertica is putting out a press 	release today touting its 100th customer, and talking of triple 	digit growth last year.</li>
<li>Multiple sources have told me that 	the DATAllegro system is being thrown out of <a href="http://www.dbms2.com/2009/03/02/closing-the-book-on-the-datallegro-customer-base/" >Dell</a>, so evidently Dell is telling this to one and all. If that goes 	through, this would presumably leave <a href="http://www.dbms2.com/2008/05/23/data-warehouse-appliance-power-user-teoco/" >TEOCO</a> as DATAllegro&#8217;s single happy 	customer. (I haven&#8217;t checked with Microsoft for its view.)</li>
<li>A rumor has it that Infiniband 	technology vendor Voltaire, Ltd. privately claims triple-digit sales 	of switches for Exadata 1 (I think that one would be one switch per Exadata installation, not per rack). Based just on a quick glance, this is far from confirmed by 	Voltaire&#8217;s earnings <a href="http://seekingalpha.com/article/135775-voltaire-ltd-q1-2009-earnings-call-transcript" onclick="javascript:pageTracker._trackPageview('/seekingalpha.com');">conference 	call</a> <a href="http://seekingalpha.com/article/152278-voltaire-q2-2009-earnings-transcript" onclick="javascript:pageTracker._trackPageview('/seekingalpha.com');">transcripts</a> or <a href="http://sec.gov/cgi-bin/browse-edgar?action=getcompany&amp;CIK=0001401678&amp;owner=exclude&amp;count=40" onclick="javascript:pageTracker._trackPageview('/sec.gov');">SEC 	filings</a>. However, the most recent transcript does seem to 	indicate Voltaire got multiple Exadata deals in the 	telecommunications sector, and suggests some Exadata penetration in 	other sectors as well.</li>
<li>I was told of a 	classified-agency user that has &gt;1 petabyte of data on Exadata 1 	and 600 terabytes or so on Netezza. My not-obviously-biased source says 	the agency is distinctly happier with Netezza than Exadata.</li>
<li>Like <a href="http://paraccel.com/data_warehouse_blog/?p=104" onclick="javascript:pageTracker._trackPageview('/paraccel.com');">ParAccel</a>, 	<a href="http://www.theregister.co.uk/2009/09/29/tpc_slaps_oracle/" onclick="javascript:pageTracker._trackPageview('/www.theregister.co.uk');">Oracle 	just got dinged for TPC-related misbehavior</a>.</li>
<li>Rumor has it that Sun has no 	intention of helping ParAccel rerun its withdrawn TPC-H benchmark.</li>
<li>ParAccel has withdrawn the claim 	from its home page to be the &#8220;CERTIFIED&#8221; price-performance 	leader. This seems to confirm that the claim was a reference to the 	TPC-H. In my opinion, that was a gross misrepresentation of what the 	TPC-H shows.</li>
</ul>
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/09/30/facts-and-rumors/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Progress in figuring out what ParAccel is doing</title>
		<link>http://www.dbms2.com/2009/07/08/progress-in-figuring-out-what-paraccel-is-doing/</link>
		<comments>http://www.dbms2.com/2009/07/08/progress-in-figuring-out-what-paraccel-is-doing/#comments</comments>
		<pubDate>Wed, 08 Jul 2009 23:46:05 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[ParAccel]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=835</guid>
		<description><![CDATA[(Oops: Thought I&#8217;d posted this before I went out for the afternoon &#8230;)
Barry Zane of ParAccel has &#8212; finally! &#8212; started a blog.  Barrry&#8217;s first post, probably in connection with ParAccel&#8217;s recent TPC-H submission and subsequent brouhaha, consisted mainly of metaphor + very elementary and well-known arguments for column stores. Barry&#8217;s second post, however, was [...]]]></description>
			<content:encoded><![CDATA[<p><em>(Oops: Thought I&#8217;d posted this before I went out for the afternoon &#8230;)</em></p>
<p>Barry Zane of ParAccel has &#8212; finally! &#8212; started a blog.  Barrry&#8217;s <a href="http://paraccel.com/data_warehouse_blog/?p=34" onclick="javascript:pageTracker._trackPageview('/paraccel.com');">first post</a>, probably in connection with ParAccel&#8217;s recent TPC-H submission and subsequent brouhaha, consisted mainly of metaphor + very elementary and well-known arguments for column stores. Barry&#8217;s <a href="http://paraccel.com/data_warehouse_blog/?p=57" onclick="javascript:pageTracker._trackPageview('/paraccel.com');">second post</a>, however, was in direct response to <a href="http://www.dbms2.com/2009/07/07/daniel-abadi-has-a-theory-about-paraccel/" >Daniel Abadi&#8217;s speculation about ParAccel&#8217;s architecture</a>.  That post also promises a follow-up addressing the TPC-H in a more substantive way.</p>
<p>Barry&#8217;s points include:</p>
<ul>
<li><strong>ParAccel never used the row-oriented Postgres execution engine.</strong> This is contrary to Daniel&#8217;s speculation.</li>
<li><strong>ParAccel previously used an adaption of the Postgres cost-based optimizer, but now has written a new one from scratch. </strong></li>
<li><strong>ParAccel has designed its optimizer to handle lots and lots of joins.</strong> One reason Barry offers is that ParAccel wants to run customers&#8217; old schemas unaltered, whether or not those are really optimal for the ParAccel DBMS.  That approach is somewhat in contrast to Vertica, which originally focused entirely on star schemas.   And it goes well with ParAccel&#8217;s interest in appealing to customers who at least think they want to run ParAccel in Oracle or SQL Server emulation mode.</li>
</ul>
<p>Also in the post, Barry:</p>
<ul>
<li>Makes an extremely silly marketing exaggeration by referring to &#8221; the only other vendor that was <em>able</em> to run the 30TB TPC-H&#8221; (emphasis mine).</li>
<li>Makes the more excusable marketing exaggeration &#8220;Publishing the benchmark with unmatched performance is simply one way to demonstrate robustness and flexibility.  Nothing more, nothing less.&#8221;</li>
<li>Makes the very clear marketing claim &#8220;For customers, the real test will be their own bake-offs, where our performance has <em>never</em> been beaten.&#8221; (Emphasis mine.) That last one directly contradicts what I&#8217;ve been told by at least two ParAccel competitors, so I&#8217;ll be curious to see what they come up with to substantiate their version of the story.</li>
</ul>
<p>Anyhow, it&#8217;s great to see ParAccel retreating from its obsessive secrecy, which in my opinion has been even worse than <a href="http://www.dbms2.com/2006/09/20/dealing-with-netezza-has-not-been-easy/" >Netezza&#8217;s</a> used to be.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/07/08/progress-in-figuring-out-what-paraccel-is-doing/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Daniel Abadi has a theory about ParAccel</title>
		<link>http://www.dbms2.com/2009/07/07/daniel-abadi-has-a-theory-about-paraccel/</link>
		<comments>http://www.dbms2.com/2009/07/07/daniel-abadi-has-a-theory-about-paraccel/#comments</comments>
		<pubDate>Tue, 07 Jul 2009 22:46:19 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=833</guid>
		<description><![CDATA[When I was at SIGMOD last week, ParAccel and its SIGMOD talk were mentioned several times, always in puzzled and at least slightly unflattering terms.  (Typical comment: &#8220;Why did they present a paper about that? We were doing the same thing in our company years ago.&#8221;) That doesn&#8217;t prove much per se, since most of [...]]]></description>
			<content:encoded><![CDATA[<p>When I was at SIGMOD last week, ParAccel and its SIGMOD talk were mentioned several times, always in puzzled and at least slightly unflattering terms.  (Typical comment: &#8220;Why did they present a paper about that? We were doing the same thing in our company years ago.&#8221;) That doesn&#8217;t prove much <em>per se,</em> since most of the mentions were by competitors and/or Vertica-affiliated academics, and since my own <a href="http://www.dbms2.com/2009/06/22/the-tpc-h-benchmark-is-a-blight-upon-the-industry/" >unflattering ParAccel-related comments</a> were rather fresh at the time.</p>
<p>But now Daniel Abadi has done <a href="http://dbmsmusings.blogspot.com/2009/07/paraccel-and-their-puzzling-tpc-h.html" onclick="javascript:pageTracker._trackPageview('/dbmsmusings.blogspot.com');">a brilliant, detailed, speculative analysis of ParAccel&#8217;s publications</a>.  Here&#8217;s the meat, emphasis mine:<span id="more-833"></span></p>
<blockquote><p>(1) Why did they configure their TPC-H application with such a high amount of disk I/O throughput capabilty when they are a column-store? (Stonebraker&#8217;s question)<br />
(2) <strong>Why did queries spend seemingly 6X more time doing I/O than a column-store should have to do?</strong><br />
(3) Why are they worried about queries with thousands of joins?<br />
(4) <strong>Why do they think TPC-H/TPC-DS queries have 42 joins?</strong><br />
<strong><br />
And then a theory that answers all four questions at the same time came to me.</strong> Perhaps ParAccel directly followed my advice (see option 1) on &#8220;<a href="http://cs-www.cs.yale.edu/homes/dna/talks/abadi-nedbday.pdf" onclick="javascript:pageTracker._trackPageview('/cs-www.cs.yale.edu');">How to create a new column-store DBMS product in a week</a>&#8220;. <strong>They&#8217;re not a column-store. They&#8217;re a vertically partitioned row-store</strong> (this is how column-stores were built back in the 70s before we knew any better). Each column is stored in its own separate table inside the row-store (PostgreSQL in ParAccel&#8217;s case). Queries over the original schema are then automatically rewritten into queries over the vertically partitioned schema and the row-store&#8217;s regular query execution engine can be used unmodified. But now, <strong>every attribute accessed by the query now adds an additional join to the query plan</strong> (since the vertical partitions for each column in a table have to be joined together).</p>
<p>This immediately explains why they are worried about queries with hundreds to thousands of joins (questions 3 and 4). But it also explains why they seem to be doing much more I/O than a native column-store. <strong>Since each vertical partition is its own table, then each tuple in a vertical partition (which contains just one value) is preceded by the row-store&#8217;s tuple header.</strong> In PostgreSQL this tuple header is on the order of 27 bytes. So <strong>if the column width is 4 bytes, then there is a factor of 7 extra space used up for the tuple header relative to actual user data.</strong> And if the implementation is super naive, they also will need an additional 4 bytes to store a tuple identifier for joining vertical partitions from the same original table with each other. This answers questions 1 and 2, as <strong>the factor of 6 worse I/O efficiency is now obvious.</strong></p></blockquote>
<p>It will be interesting to see whether ParAccel comments, but even it does, I wouldn&#8217;t necessarily take ParAccel&#8217;s statements as dispositive.  For example &#8212; and illustrative of my view of ParAccel&#8217;s trustworthiness &#8212; I believe ParAccel&#8217;s competition who tell me that ParAccel&#8217;s claim to have won or at least tied all POCs on performance is flat-out untrue.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/07/07/daniel-abadi-has-a-theory-about-paraccel/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
		</item>
	</channel>
</rss>
