<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; Greenplum</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/greenplum/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 02 Sep 2010 09:06:44 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Links and observations</title>
		<link>http://www.dbms2.com/2010/08/09/links-and-observations/</link>
		<comments>http://www.dbms2.com/2010/08/09/links-and-observations/#comments</comments>
		<pubDate>Tue, 10 Aug 2010 02:37:51 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Calpont]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Northscale]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[XtremeData]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2743</guid>
		<description><![CDATA[I&#8217;m back from a trip to the SF Bay area, with a lot of writing ahead of me. I&#8217;ll dive in with some quick comments here, then write at greater length about some of these points when I can. From my trip:  

Aster Data showed me a lot of customer names and deal sizes, across [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m back from a trip to the SF Bay area, with a lot of writing ahead of me. I&#8217;ll dive in with some quick comments here, then write at greater length about some of these points when I can. From my trip:  <span id="more-2743"></span></p>
<ul>
<li>Aster Data showed me a lot of customer names and deal sizes, across a bunch of industries (mainly enterprise rather than web). Yes, Aster&#8217;s market success is for real. (But almost all those details are NDA.)</li>
<li>Sybase&#8217;s product plans for IQ are pretty impressive. (But the most interesting parts are, you guessed it, NDA.)</li>
<li>I&#8217;ve kissed and made up* with ParAccel, now that they&#8217;ve replaced their CEO, replaced their marketing chief, and stopped the worst of the <a href="http://www.dbms2.com/2010/01/15/there-sure-seem-to-be-a-lot-of-inaccuracies-on-paraccels-website/" >marketing</a> <a href="http://www.dbms2.com/2009/06/22/the-tpc-h-benchmark-is-a-blight-upon-the-industry/" >nonsense</a> I used to complain about. ParAccel has some interesting plans for ParAccel 3.0 which are, naturally, NDA.</li>
<li>The Peoplesoft guys are doing it over again at Workday. Only this time, their platform isn&#8217;t a relational DBMS. Rather, it&#8217;s an in-memory, completely object-oriented data model, with disk used only on a &#8220;Just in case the power ever goes out&#8221; basis. (Thankfully, nothing at all about our conversation was NDA.)</li>
<li>I&#8217;m finally feeling good about <a href="# I spent considerable time  with my clients at both Greenplum and EMC (if we ignore the fact that  the deal has closed and they're now the same company). I also had more  of  a hardcore engineering discussion than I've had with Greenplum for  quite a while (I should have been pushier about that earlier). Takeaways  included:      * This is starting off as a honeymoon deal. Everything  Greenplum was planning to do is being continued. Additional resources  are being poured into Greenplum to do more.     * Some Greenplum execs  seem to envision staying long term, some seem to envision moving on to  their next startups. The ones who envision moving on are, however, going  to work hard first to make the merger a success.     * Greenplum has,  for quite a while, had more of an advanced analytics/embedded predictive  modeling story than I realized. Bad on them for not fleshing it out  more in marketing and product packaging alike.     * Greenplum both  denies the concurrency problems I previously noted and also has a very  credible story as to how it will eliminate them. :) Seriously, Greenplum  tells of one customer that routinely runs 150 simultaneously queries -  on what I think is not a terribly big system -- and a number of POCs  (Proofs of Concept) that simulated similar levels of concurrency.">Northscale&#8217;s  memcached-compatible persistent store Membase</a>. The main reason is  that they showed me a near-term path to interfaces that are richer than  key-value. Also, Todd Hoff reassured me that even pure persistent  memcached has a place.</li>
<li>Rumor says that even the one app for which Facebook was using Cassandra &#8212; in-box search &#8212; has been decommissioned. On the other hand, numerous other scale-0ut DBMS (SQL or otherwise) seem to have Facebook footholds. But details are &#8212; all together now! &#8212; NDA.</li>
</ul>
<p><em>*If you know ParAccel&#8217;s new marketing chief Michael Weir, you  surely guessed I mean that only in a figurative sense.</em></p>
<p>From elsewhere:</p>
<ul>
<li>Daniel Abadi offered <a href="http://dbmsmusings.blogspot.com/2010/08/thoughts-on-kickfires-apparent-demise.html" onclick="javascript:pageTracker._trackPageview('/dbmsmusings.blogspot.com');">his  analysis</a> of <a href="../2010/07/27/kickfire-unlikely-to-survive/">Kickfire&#8217;s  demise</a>. In general I agree, but Daniel neglected to mention one  hugely important factor &#8212; the chicken-egg negative effect of Kickfire&#8217;s  lack of market or marketing traction. Customers were extremely reluctant to buy from Kickfire  because they perceived, correctly, that Kickfire&#8217;s survivability was far  from assured.</li>
<li>While the <a href="http://infinidb.org/community/forums/11-general-infinidb/1000-strange-issue-with-drop-table" onclick="javascript:pageTracker._trackPageview('/infinidb.org');">InfiniDB forums</a> suggest that there are at least a couple of production users of Calpont&#8217;s free InfiniDB, Calpont seemingly has a long way to go to be even as successful as Kickfire. But Calpont does have a bit of money to spend on lead generation; maybe some day they&#8217;ll even have actual customers.</li>
<li>In a response to a question I messaged over, <a href="http://www.dbms2.com/2010/03/18/xtremedata-update/" >XtremeData</a> tells me they have actual customers now. Press releases to follow.</li>
<li>The <a href="http://news.cnet.com/8301-31021_3-20013111-260.html?part=rss&amp;subj=news&amp;tag=2547-1_3-0-20" onclick="javascript:pageTracker._trackPageview('/news.cnet.com');">admiration for the job Mark Hurd did at HP</a> is in my opinion overstated. Sure, the financial/operational management appeared to work, but HP did little on Hurd&#8217;s watch to strengthen its reputation or customers&#8217; loyalty. In particular:
<ul>
<li>HP&#8217;s analytics efforts have accomplished little.</li>
<li>HP&#8217;s data warehouse appliance efforts have failed pathetically.</li>
<li>From what I hear, HP&#8217;s execution in its Exadata partnership was not good.</li>
<li>HP&#8217;s server business in general is distinguished mainly by HP being a big company.</li>
<li>HP&#8217;s EDS acquisition has been rocky, not that EDS was sailing so smoothly on its own beforehand.</li>
<li>HP&#8217;s success in PCs amounts to &#8220;arguably, HP sucks a little less than the other guys&#8221;.</li>
<li>HP&#8217;s elite reputation is long gone (admittedly, for the most part that predates Hurd).</li>
</ul>
</li>
<li><a href="http://intelligent-enterprise.informationweek.com/blog/archives/2010/08/software_innova.html" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');">Doug Henschen</a> evidently favors really strong intellectual property protection for software, even forbidding plug-compatible reverse engineering. I agree with Doug up to the point that <a href="http://www.monashreport.com/2010/07/19/my-view-of-intellectual-property/" onclick="javascript:pageTracker._trackPageview('/www.monashreport.com');">it should be forbidden to copy proprietary software</a>, but I don&#8217;t see why he (or a court) would view such behavior as copying.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/08/09/links-and-observations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Notes on EMC&#8217;s Greenplum subsidiary</title>
		<link>http://www.dbms2.com/2010/08/09/emc-greenplum/</link>
		<comments>http://www.dbms2.com/2010/08/09/emc-greenplum/#comments</comments>
		<pubDate>Tue, 10 Aug 2010 00:02:17 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2744</guid>
		<description><![CDATA[I spent considerable time last week with my clients at both Greenplum and EMC  (if we ignore the fact that the deal has closed and they&#8217;re now the same  company). I also had more of  a hardcore engineering discussion than  I&#8217;ve had with Greenplum for quite a while (I should have been [...]]]></description>
			<content:encoded><![CDATA[<p>I spent considerable time last week with my clients at both Greenplum and EMC  (if we ignore the fact that the deal has closed and they&#8217;re now the same  company). I also had more of  a hardcore engineering discussion than  I&#8217;ve had with Greenplum for quite a while (I should have been pushier  about that earlier). Takeaways included:</p>
<ul>
<li>This is starting off as a honeymoon deal. Everything Greenplum was  planning to do is being continued. Additional resources are being  poured into Greenplum to do more.</li>
<li>Some Greenplum execs seem to envision staying long term, some seem  to envision moving on to their next startups. The ones who envision  moving on are, however, going to work hard first to make the merger a  success.</li>
<li>Greenplum has, for quite a while, had more of an advanced  analytics/embedded predictive modeling story than I realized. Bad on  them for not fleshing it out more in marketing and product packaging  alike.</li>
<li>Greenplum both denies the <a href="http://www.dbms2.com/2010/07/06/emc-is-buying-greenplum/" >concurrency  problems</a> I previously noted and also has a very credible story as  to how it will eliminate them. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Seriously, Greenplum tells of one  customer that routinely runs 150 simultaneous queries &#8211; on what I  think is not a terribly big system &#8212; and a number of POCs (Proofs of  Concept) that simulated similar levels of concurrency.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/08/09/emc-greenplum/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More on Greenplum and EMC</title>
		<link>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/</link>
		<comments>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 19:15:21 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Market share]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2534</guid>
		<description><![CDATA[I talked with Ben Werther of Greenplum for about 40 minutes, which was my first post-merger Greenplum/EMC briefing. &#8220;Historical&#8221; highlights include:

Ben says Greenplum wasn&#8217;t being shopped, by which he means Greenplum was out raising more capital and the fund-raising was going well.  Note: Half or so of Greenplum&#8217;s deals were subscription-priced, so it had weaker [...]]]></description>
			<content:encoded><![CDATA[<p>I talked with Ben Werther of Greenplum for about 40 minutes, which was my first post-merger Greenplum/EMC briefing. &#8220;Historical&#8221; highlights include:</p>
<ul>
<li>Ben says Greenplum wasn&#8217;t being shopped, by which he means Greenplum was out raising more capital and the fund-raising was going well.  <em>Note: <a href="http://www.dbms2.com/2009/10/18/greenplum-customer-notes/" >Half or so of Greenplum&#8217;s deals were subscription-priced</a>, so it had weaker cash flow than it would have if it were doing equally well selling perpetual licenses.</em></li>
<li>However, joint engineering was also going well with, e.g., Greenplum CTO Luke Lonergan spending time at EMC facilities in Cork, Ireland. And one thing led to another &#8230;</li>
<li>Greenplum has ~ 140 customers, vs. <a href="http://www.dbms2.com/2009/06/05/greenplum-update-release-3-3/" >~65 five quarters ago</a>, 100+ at year-end, and an acquisition rate of 12-15/quarter last fall.</li>
<li>A typical &#8220;small&#8221; paying customer for Greenplum starts with 10-20 TB of data.</li>
<li><a href="http://www.dbms2.com/2010/04/12/greenplumchorus/" >Greenplum Chorus</a> isn&#8217;t generally available yet, with rollout energy being focused on Greenplum 4.0. <em>Note: As important as it is for overall industry direction, Greenplum Chorus is a product which won&#8217;t be a terribly big deal in Release 1 anyway.</em></li>
</ul>
<p>Highlights looking forward include:  <span id="more-2534"></span></p>
<ul>
<li>When I challenged him, Ben sounded quite optimistic that Pat Gelsinger will immunize Greenplum against and generally counteract some of EMC&#8217;s traditionally stifling bureaucracy. (My words, of course, not his.)</li>
<li>The initial Greenplum/EMC product vision appears truly centered around &#8220;private cloud,&#8221; specifically including Greenplum, VMware, and EMC storage arrays.</li>
<li>Some other areas of potential Greenplum/EMC technical synergy I think are cool obviously haven&#8217;t been seriously addressed yet.</li>
<li>Based on what I heard from Ben about the aura around the deal and also on what I know of the individual executives at Greenplum, I think each of them is a good bet to stick around EMC for a while. (That&#8217;s on average. Of course, it would be surprising if 100% of them stayed around very long.) Basically, there&#8217;s at least a chance EMC/Greenplum will do some pretty cool stuff, and most of the guys will probably stick around to see if that actually starts to happen.*</li>
</ul>
<p><em>*Also, when they do eventually leave, they&#8217;ll surely say things to the effect &#8220;The cool stuff is well underway; my work here is done.&#8221; That party line is almost guaranteed, no matter how things unfold in reality. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>EMC is buying Greenplum</title>
		<link>http://www.dbms2.com/2010/07/06/emc-is-buying-greenplum/</link>
		<comments>http://www.dbms2.com/2010/07/06/emc-is-buying-greenplum/#comments</comments>
		<pubDate>Tue, 06 Jul 2010 22:53:52 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Storage]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2498</guid>
		<description><![CDATA[EMC is buying Greenplum. Most of the press release is a general recapitulation of Greenplum&#8217;s marketing messages, the main exceptions being (emphasis mine):
The acquisition of Greenplum will be an all-cash transaction and is expected to be completed in the third quarter of 2010, subject to customary closing conditions and regulatory approvals. The acquisition is not [...]]]></description>
			<content:encoded><![CDATA[<p>EMC is buying Greenplum. Most of the <a href="http://www.emc.com/about/news/press/2010/20100706-01.htm" onclick="javascript:pageTracker._trackPageview('/www.emc.com');">press release</a> is a general recapitulation of Greenplum&#8217;s marketing messages, the main exceptions being (emphasis mine):</p>
<blockquote><p>The acquisition of Greenplum will be an all-cash transaction and is <strong>expected to be completed in the third quarter of 2010,</strong> subject to customary closing conditions and regulatory approvals. The acquisition is not expected to have a material impact to EMC GAAP and non-GAAP EPS for the full 2010 fiscal year. Upon close, Bill Cook will lead the new data computing product division and report to Pat Gelsinger. <strong>EMC will continue to offer Greenplum&#8217;s full product portfolio to customers and plans to deliver new EMC Proven reference architectures as well as an integrated hardware and software offering</strong> designed to improve performance and drive down implementation costs.</p></blockquote>
<p>Greenplum is one of my biggest vendor clients, and EMC is just becoming one, but of course neither side gave me a heads-up before the deal happened, nor have I yet been briefed subsequently. With those disclaimers out of the way, some of my early thoughts include:</p>
<ul>
<li>I wish my clients would never buy each other, but it&#8217;s inevitable.</li>
<li>I don&#8217;t think anybody evaluating Greenplum should be much influenced by this deal one way or the other. (Whether they will be is of course a different matter.)
<ul>
<li>EMC tends to run its bigger software acquisitions in a fairly hands-off manner. There&#8217;s no particular FUD (Fear/Uncertainty/Doubt) reason why this deal should stop anybody from buying Greenplum software.</li>
<li>I also don&#8217;t think adding a rich parent adds much of a reason to buy from Greenplum. But if you&#8217;re the type who&#8217;s nervous about smaller vendors &#8212; well, Greenplum now isn&#8217;t so small.</li>
</ul>
</li>
<li><a href="http://www.dbms2.com/2010/04/12/greenplumchorus/" >Greenplum Chorus</a> could, in principle, work with non-Greenplum DBMS. That possibility suddenly looks a lot more realistic.</li>
<li>The list of analytic DBMS vendors with an appliance orientation is pretty impressive, including:
<ul>
<li>Oracle, with <a href="http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/" >Exadata</a></li>
<li>Microsoft, partially</li>
<li>Teradata</li>
<li>Netezza</li>
<li>Now EMC/Greenplum, at least partially</li>
<li>Weaker players such as:
<ul>
<li>The <a href="http://www.dbms2.com/2010/06/11/kickfire-update-2/" >ailing Kickfire</a>, which a client (not Kickfire itself) tells me is being shopped around</li>
<li>The <a href="http://www.dbms2.com/2010/03/19/some-business-trends-in-the-data-warehouse-market/" >reeling HP Neoview</a></li>
<li>XtremeData, but I&#8217;m still waiting to hear of<a href="http://www.dbms2.com/2010/03/18/xtremedata-update/" > XtremeData&#8217;s first real sale</a></li>
</ul>
</li>
</ul>
</li>
<li>Greenplum is something of a specialist in <a href="http://www.dbms2.com/2009/04/30/ebays-two-enormous-data-warehouses/" >large databases</a>. EMC has to love that.</li>
<li>Greenplum&#8217;s weakness is concurrency.</li>
<li><a href="http://www.dbms2.com/2009/10/14/greenplum-hybrid-columnar/" >Greenplum&#8217;s &#8220;polymorphic storage&#8221;</a> is a good fit for a storage vendor with appliance-y ideas.</li>
<li>And finally &#8212; I think that even software-only analytic DBMS vendors should design their systems in an increasingly storage-aware manner, and have been advising my vendor clients of same. I&#8217;ll blog that line of reasoning separately when I get a chance, and edit in a link here after I do.</li>
</ul>
<p><em><strong>Related links (edit)</strong></em></p>
<ul>
<li>Here&#8217;s the promised post as to <a href="http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/" >why analytic DBMS need to be ever more storage-aware</a>.</li>
<li><a href="http://www.kellblog.com/2010/07/06/emc-acquires-data-warehouse-vendor-greenplum-as-cornerstone-of-new-data-computing-product-division/" onclick="javascript:pageTracker._trackPageview('/www.kellblog.com');">Dave Kellogg crunched the EMC/Greenplum numbers</a>, coming up with an estimated valuation range of $3-400 million, the high end of which is rumored to be correct.</li>
<li>Merv Adrian suggests <a href="http://mervadrian.wordpress.com/2010/07/06/emc-buys-greenplum-big-data-realignment-continues/#more-2890" onclick="javascript:pageTracker._trackPageview('/mervadrian.wordpress.com');">the big EMC/Greenplum loser is ParAccel</a>, a viewpoint which presumably presupposes that the EMC/ParAccel partnership was significant in the first place.</li>
<li>I talked with Ben Werther and posted <a href="http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/" >more about Greenplum and EMC</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/06/emc-is-buying-greenplum/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Greenplum et alia&#8217;s BigDataNews.com site</title>
		<link>http://www.dbms2.com/2010/04/18/greenplum-et-alias-bigdatanews-com-site/</link>
		<comments>http://www.dbms2.com/2010/04/18/greenplum-et-alias-bigdatanews-com-site/#comments</comments>
		<pubDate>Sun, 18 Apr 2010 21:15:12 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[NoSQL]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1927</guid>
		<description><![CDATA[Greenplum recently started a website BigDataNews.com, and quickly signed up Aster Data as a co-sponsor. (Edit: As per a comment below, the decision to sign up additional sponsors was made by the site&#8217;s independent publisher.) It&#8217;s actually being run by Brett Sheppard, a former Gartner/DataQuest analyst who now gets involved in this kind of thing. [...]]]></description>
			<content:encoded><![CDATA[<p>Greenplum recently started a website <a href="http://www.bigdatanews.com/" onclick="javascript:pageTracker._trackPageview('/www.bigdatanews.com');">BigDataNews.com</a>, and quickly signed up Aster Data as a co-sponsor. <em>(Edit: As per a comment below, the decision to sign up additional sponsors was made by the site&#8217;s independent publisher.)</em> It&#8217;s actually being run by Brett Sheppard, a former Gartner/DataQuest analyst who now gets involved in this kind of thing. (Brett and I may be working on another project soon, with Greenplum funding.)</p>
<p>The heart of the site is feeds* from a variety of high-profile blogs (<em>DBMS2,</em> Daniel Abadi&#8217;s, Joe Hellerstein&#8217;s, James Kobelius&#8217;, et al.), plus some additional posts written by Brett (primarily) or Greenplum folks. Highlights of Brett&#8217;s posts include:</p>
<ul>
<li>What I am told was an unauthorized revelation that <a href="http://bigdatanews.com/content/greenplum-chorus-combining-self-service-provisioning-analytics-collaboration" onclick="javascript:pageTracker._trackPageview('/bigdatanews.com');">Greenplum Chorus is built on CouchDB and Erlang</a>.</li>
<li>An impassioned defense of <a href="http://bigdatanews.com/content/gartner-reference-or-rent" onclick="javascript:pageTracker._trackPageview('/bigdatanews.com');">the integrity of Gartner&#8217;s analysis</a>.</li>
</ul>
<p><em>*At least in my case, that&#8217;s just a post title or snippet, plus a link back to the main post. The same goes for <a href="http://www.dbms2.com/2010/04/18/aster-mapreduce-or/" >mapreduce.org</a>, actually.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/04/18/greenplum-et-alias-bigdatanews-com-site/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Story of an analytic DBMS evaluation</title>
		<link>http://www.dbms2.com/2010/04/16/story-of-an-analytic-dbms-evaluation/</link>
		<comments>http://www.dbms2.com/2010/04/16/story-of-an-analytic-dbms-evaluation/#comments</comments>
		<pubDate>Sat, 17 Apr 2010 02:56:56 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[Buying processes]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1900</guid>
		<description><![CDATA[One of our readers was kind enough to walk me through his analytic DBMS evaluation process. The story is:

The X Company (XCo) has a &#60;1 TB 	database.
100s of XCo&#8217;s customers log in at 	once to run reports. 50-200 concurrent queries is a good target 	number.
XCo had been “suffering” with 	Oracle and wanted to upgrade.
XCo didn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">One of our readers was kind enough to walk me through his analytic DBMS evaluation process. The story is:</p>
<ul>
<li>The X Company (XCo) has a &lt;1 TB 	database.</li>
<li>100s of XCo&#8217;s customers log in at 	once to run reports. 50-200 concurrent queries is a good target 	number.</li>
<li>XCo had been “suffering” with 	Oracle and wanted to upgrade.</li>
<li>XCo didn&#8217;t have a lot of money to 	spend. <strong>Netezza</strong> pulled out of the sales cycle early due to 	budget (and this was recently enough that Netezza <a href="http://www.dbms2.com/2010/01/25/netezza-skimmer/" >Skimmer</a> could have been bid).</li>
<li><strong>Greenplum</strong> didn&#8217;t offer any 	references that approached the desired number of concurrent users.</li>
<li>Ultimately the evaluation came 	down to <strong>Vertica</strong> and <strong>ParAccel.</strong></li>
<li><strong>Vertica won.</strong></li>
</ul>
<p style="margin-bottom: 0in; font-weight: normal;">Notes on the Vertica vs. ParAccel selection include:<span id="more-1900"></span></p>
<ul>
<li>ParAccel sent an engineer on-site 	to do a proof-of-concept (POC), and generally competed very hard for 	the deal.</li>
<li>Vertica dropped by for a sales 	call once, and let XCo do the Vertica POC itself.</li>
<li>Not surprisingly, XCo got the 	impression that Vertica was easier to set up and administer than 	ParAccel.</li>
<li>Also, when ParAccel emphasized 	architectural features such as custom “backplane” and compiled 	queries, XCo got the impression – right or wrong – that 	ParAccel&#8217;s performance was more brittle or situational than 	Vertica&#8217;s.</li>
<li>ParAccel was modestly faster than 	Vertica in the POC. (I think &#8212; Vertica&#8217;s numbers were described as being &#8220;very competitive.&#8221;)</li>
<li>In multiple ways, Vertica gave the 	impression of greater product and vendor maturity than ParAccel.</li>
</ul>
<p style="margin-bottom: 0in;">My contact continues to be interested in all things Greenplum, and has recommended <a href="http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/" >Greenplum Single-Node Edition</a> to his analyst colleagues.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/04/16/story-of-an-analytic-dbms-evaluation/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Greenplum Chorus and Greenplum 4.0</title>
		<link>http://www.dbms2.com/2010/04/12/greenplumchorus/</link>
		<comments>http://www.dbms2.com/2010/04/12/greenplumchorus/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 11:54:39 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[Data integration and middleware]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EAI, EII, ETL, ELT, ETLT]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Petabyte-scale data management]]></category>
		<category><![CDATA[Specific users]]></category>
		<category><![CDATA[Telecommunications]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1887</guid>
		<description><![CDATA[Greenplum is making two product announcements this morning. Greenplum 4.0 is a revision of the core Greenplum database technology. In addition, Greenplum is announcing Greenplum Chorus, which is the first product release instantiating last year&#8217;s EDC (Enterprise Data Cloud) vision statement and marketing campaign.
Greenplum 4.0 highlights and related observations include:

For the most part, Greenplum 	4.0 [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Greenplum is making two product announcements this morning. Greenplum 4.0 is a revision of the core Greenplum database technology. In addition, Greenplum is announcing Greenplum Chorus, which is the first product release instantiating last year&#8217;s <a href="http://www.dbms2.com/2009/06/08/the-future-of-data-marts/" >EDC (Enterprise Data Cloud) vision statement and marketing campaign</a>.</p>
<p style="margin-bottom: 0in;">Greenplum 4.0 highlights and related observations include:<span id="more-1887"></span></p>
<ul>
<li>For the most part, <strong>Greenplum 	4.0 is focused on general robustness catch-up and </strong><a href="http://www.dbms2.com/2009/08/21/bottleneck-whack-a-mole/" ><strong>Bottleneck Whack-A-Mole</strong></a><strong>,</strong><span style="font-weight: normal;"> much 	like the latest rel</span>eases from fellow analytic DBMS vendors 	<a href="http://www.dbms2.com/2010/02/22/data-warehouse-dbms-news-roundup/" >Vertica and Aster Data</a>.</li>
<li>Greenplum has switched its 	replication approach from logical (execute transactions against two 	different disks) to block-level (just ship over the blocks that were 	changed by the original transaction). This seems to increase a 	Greenplum database&#8217;s robustness/performance/uptime in the face of 	disk/node failure. It also provides Greenplum with an ongoing 	performance advantage in that data only has to be compressed once in 	total for both disk writes.</li>
<li>The Greenplum DBMS now has 	something called “tablespaces,” which sounds as if it extends 	<a href="http://www.dbms2.com/2009/10/14/greenplum-hybrid-columnar/" >Greenplum&#8217;s “polymorphic storage”</a> to accommodate different kinds 	of storage device. Everybody has to do and for the most part is 	doing this, e.g. <a href="http://www.dbms2.com/2008/10/14/teradata-virtual-storage/" >Teradata</a><span style="font-style: normal;"> and </span><a href="http://www.dbms2.com/2009/08/25/sybase-iq-technical-highlights/" >Sybase</a>. At least for now, you need to have the 	same mix of storage technology at every Greenplum node. That said, 	while Greenplum&#8217;s customers will surely want solid-state storage in 	the future, that&#8217;s not quite yet a major current issue.</li>
<li>The timetable on Greenplum 4.0 is 	a salami-thin-slicer&#8217;s delight:
<ul>
<li>Greenplum 4.0 has been used in 	POCs (Proofs of Concept) for a while.</li>
<li>Greenplum 4.0 has been in early 	access for a few weeks.</li>
<li>Greenplum 4.0 controlled 	availability is planned for the end of April.</li>
<li>Greenplum 4.0 general availability 	is planned around the end of May or early June.</li>
<li>(Note: Everything in Greenplum 4.0 	has been built, and is undergoing QA).</li>
</ul>
</li>
<li>Greenplum has put together a nice 	list of big-name customers, including <a href="http://www.dbms2.com/2009/03/05/fox-interactive-medias-multi-hundred-terabyte-database-running-on-greenplum/" >Fox/MySpace</a>, <a href="http://www.dbms2.com/2009/04/30/ebays-two-enormous-data-warehouses/" >eBay</a>, Sears, and T-Mobile. While Fox/MySpace never got to the <a href="http://www.dbms2.com/2008/08/25/greenplum-is-in-the-big-leagues/" >predicted</a> 1-petabyte level of user data, T-Mobile is loosely projected to 	indeed get there. The same 1-petabyte projection is made more 	confidently about another Greenplum telecom customer (unnamed), 	which seems to be in the process of acquiring a 300-node Greenplum 	system.</li>
</ul>
<p style="margin-bottom: 0in;">The really interesting part of this announcement, however, is Greenplum Chorus. Greenplum agrees with my assertion that <strong>Greenplum Chorus is a new kind of data integration/ETL technology.</strong> In particular, Greenplum Chorus is designed around a stance I agree with, namely <a href="http://www.dbms2.com/2010/04/12/enterprise-data-warehouse-edw-myt/" >it&#8217;s unrealistic to put everything into a single enterprise data warehouse (EDW)</a>; you need to manage data marts as well, preferably in a coordinated way. Mainstream data integration/ETL (Extract/Integration/Load) vendors such as Informatica<span style="font-style: normal;"> or </span><a href="http://www.dbms2.com/category/products-and-vendors/ab-initio-software-corporation/" >Ab Initio</a><span style="font-style: normal;"> would surely say “That&#8217;s often quite true, and our technology can handle such scenarios just as it handles single-EDW-data-sink environments.” But Greenplum Chorus offers three capabilities not generally found in traditional data integration products (and offers only those three capabilities), namely:</span></p>
<ul>
<li><span style="font-style: normal;">Spin 	out data marts, whether by recopying the data or by creating a 	virtual data mart inside another data warehouse/mart.</span></li>
<li><span style="font-style: normal;">Find/discover 	data in databases across your enterprise.</span></li>
<li><span style="font-style: normal;">Do 	social networking around databases/data marts.</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Greenplum Chorus is heading into early access soon, with general availability slated around midyear. Also in the mix is a Greenplum “Hypervisor” that can somehow relate to an almost unlimited number of nodes or databases; however, I didn&#8217;t get a lot of details on the Greenplum Hypervisor technology or on the target dates for delivering and integrating the Hypervisor with other parts of Greenplum&#8217;s technology.</span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">When Greenplum first talked about about the enterprise data cloud (EDC) idea, it emphasized <a href="../2009/06/08/the-future-of-data-marts/">the spinning out of physical data marts in an easy way</a></span>, as opposed to the virtual d<span style="font-style: normal;">ata marts pushed by <a href="../2009/10/27/teradatas-nebulous-cloud-strategy/">Oliver Ratzesberger and Teradata</a>. Greenplum Chorus, however, supports both kinds (as, at least directionally, does Teradata), specifically letting you choose between:</span></p>
<ul>
<li>“<span style="font-style: normal;">Independent 	sandboxes” – physical copies of the data, in a separate 	Greenplum database instance.</span></li>
<li>“<span style="font-style: normal;">Satellite 	sandboxes” – virtual data marts, of course managed by the same 	Greenplum database instance.</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Actually, if you want to recopy data in the same Greenplum database instance, you can do that too, via something called “data sets,” but that&#8217;s not the main focus. Either option, I presume, can be configured to provide either or both of the two main benefits of spun-out data marts, namely:</span></p>
<ul>
<li><span style="font-style: normal;">Control 	over the performance and SLAs (Service-Level Agreements) of your 	analytic workload</span></li>
<li><span style="font-style: normal;">Ability 	to mix in new raw data and/or new aggregations</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">in either case without messing up the performance, SLAs, security, or “one truth-ness” of the existing database.</span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">To provide those capabilities in an analytic DBMS, you need sufficiently robust parallel data movement (for the physical sandboxes) and workload management (for the virtual ones). Greenplum obviously believes it has both. Teradata makes the same claim. Other vendors would make similar assertions, and presumably will offer similar capabilities soon. You also want some kind of ability to ingest data from foreign databases, but that can be pretty routine stuff; e.g., in Release 1 of Chorus, Greenplum is content to offer ODBC access to Oracle, SQL Server, et al.</span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">The “data discovery” and “social networking” aspects of Greenplum Chorus seem to be quite Release 1 as well. Basically, Greenplum lets people post discussion threads about databases and data marts, discussing what value can be derived from them. I guess somebody could include links to web-technology reports based on those databases, but otherwise there&#8217;s no integration with business intelligence tools and their collaboration capabilities. Even so, Greenplum reports that business executives liked this capability in early access testing.</span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Greenplum Chorus is ETL without a lot of T, and without a lot of performance optimizations either. That may not be much of a problem in its paradigmatic use case, spinning out a data mart quickly for some analysis to see if valuable conclusions can be drawn. Presumably, in the most successful cases, business and technical processes would emerge after the fact to pipe up-to-date versions of that analysis into operational systems, mooting any ETL deficiencies in the initial exploration moot. In a world where “data exploration” is becoming an increasingly important concept, something like Greenplum Chorus may suffice to provide significant customer value. But whether Greenplum Chorus&#8217;s capabilities are eventually co-opted by more fully-featured data integration suites remains an open question for the future.</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/04/12/greenplumchorus/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Intelligent Enterprise’s Editors’/Editor’s Choice list for 2010</title>
		<link>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/</link>
		<comments>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/#comments</comments>
		<pubDate>Thu, 11 Feb 2010 23:13:42 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Intersystems and Cache']]></category>
		<category><![CDATA[Jaspersoft]]></category>
		<category><![CDATA[Kalido]]></category>
		<category><![CDATA[MarkLogic]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pentaho]]></category>
		<category><![CDATA[QlikTech and QlikView]]></category>
		<category><![CDATA[SAP AG]]></category>
		<category><![CDATA[Tableau Software]]></category>
		<category><![CDATA[Talend]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1578</guid>
		<description><![CDATA[As he has before, Intelligent Enterprise Editor Doug Henschen

Personally selected annual lists of 12 &#8220;Most influential&#8221; companies and 36 &#8220;Companies to watch&#8221; in analytics- and database-related sectors.
Made it clear that these are his personal selections.
Nonetheless has called it an Editors&#8217; Choice list, rather than Editor&#8217;s Choice.  

(Actually, he&#8217;s really called it an &#8220;award.&#8221;)
People advising [...]]]></description>
			<content:encoded><![CDATA[<p>As he has <a href="http://www.dbms2.com/2009/01/12/intelligent-enterprises-editorseditors-choice-list/" >before</a>, <em>Intelligent Enterprise</em> Editor Doug Henschen</p>
<ul>
<li>Personally selected <a href="http://intelligent-enterprise.informationweek.com/showArticle.jhtml;jsessionid=IANLOXCT2244BQE1GHPCKH4ATMY32JVN?articleID=222900034&amp;pgno=1" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');">annual lists</a> of 12 &#8220;Most influential&#8221; companies and 36 &#8220;Companies to watch&#8221; in analytics- and database-related sectors.</li>
<li>Made it clear that these are his personal selections.</li>
<li>Nonetheless has called it an Editors&#8217; Choice list, rather than Editor&#8217;s Choice. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
</ul>
<p>(Actually, he&#8217;s really called it an &#8220;award.&#8221;)</p>
<p><span id="more-1578"></span>People advising Doug &#8212; who come to think of it actually are Contributing Editors to <em>Intelligent Enterprise</em> or something like that &#8212; included Cindi Howson, Seth Grimes, three others, and me.</p>
<p>And if past is prologue, I will now get a flood of PR emails calling my attention to this award that I already have both participated in and blogged about. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>As usual, the sense:nonsense ratio on these lists was pleasingly high. Analytic DBMS vendors cited included IBM, Microsoft, Netezza, Oracle, Sybase, and Teradata in the &#8220;Most influential&#8221; group, with Aster, Greenplum, HP, Infobright, and Vertica among the &#8220;To watch&#8221; crowd. It&#8217;s tough to argue with those selections, whose most questionable element is probably the not-ridiculous supposition that HP could do something interesting over the coming year. Cloudera and Intersystems also made the list, deservedly.</p>
<p>All three of QlikTech, Tableau, and TIBCO made the list, which is appropriate given the potential for and interest in interactive data exploration technology.  The BI majors, independent or otherwise, were all on as well. In text mining, Doug included Attensity and Clarabridge, which I think is exactly right. (Plus OpenCalais.)  Upon reflection, I probably should have nominated Mark Logic, even though most of its business is non-enterprise; but hey, nobody&#8217;s perfect, and the same goes for lists. Open source was well represented, with Apache, Actuate, Jaspersoft, Eclipse, Infobright, Nuxeo and R all being cited (but not Ingres or Pentaho). Kalido made the list, with my endorsement, their silly I-CASE like marketing messaging notwithstanding.</p>
<p>Speaking of imperfections &#8212; there only are a few category names, and so category assignments can be pretty bizarre. (In an ideal world, middleware wouldn&#8217;t be included under &#8220;enterprise applications&#8221;.) Greenplum hasn&#8217;t really &#8220;extended&#8221; its DBMS with a &#8220;cloud&#8221; option. As much as I&#8217;d like Netezza to be more influential than SAP, that&#8217;s probably not the best way to rank them. And there are a number of &#8220;This company is on a roll!&#8221; kinds of comments that I wouldn&#8217;t necessarily endorse.</p>
<p>But those are all nitpicks. On the whole, it&#8217;s another nice job.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Comments on the Gartner 2009/2010 Data Warehouse Database Management System Magic Quadrant</title>
		<link>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/</link>
		<comments>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/#comments</comments>
		<pubDate>Wed, 10 Feb 2010 23:28:39 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[illuminate Solutions]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1553</guid>
		<description><![CDATA[At intervals of little over a year, Gartner Group publishes a Data Warehouse Database Management System Magic Quadrant. Gartner&#8217;s 2009 data warehouse DBMS Magic Quadrant &#8212; actually, January 2010 &#8212; is now out.* For many reasons, including those I noted in my comments on Gartner&#8217;s 2008 Data Warehouse DBMS Magic Quadrant, the Gartner quadrant pictures [...]]]></description>
			<content:encoded><![CDATA[<p>At intervals of little over a year, Gartner Group publishes a Data Warehouse Database Management System Magic Quadrant. <a href="http://www.gartner.com/technology/media-products/reprints/greenplum/173535.html" onclick="javascript:pageTracker._trackPageview('/www.gartner.com');">Gartner&#8217;s 2009 data warehouse DBMS Magic Quadrant</a> &#8212; actually, January 2010 &#8212; is now out.* For many reasons, including those I noted in <a href="http://www.dbms2.com/2009/01/12/gartners-2008-data-warehouse-database-management-system-magic-quadrant-is-out/" >my comments on Gartner&#8217;s 2008 Data Warehouse DBMS Magic Quadrant</a>, the Gartner quadrant pictures are a bad use of good research. Rather than rehash that this year, I&#8217;ll merely call out some points in the surrounding commentary that I find interesting or just plain strange.<span id="more-1553"></span></p>
<p><em>*Links to Gartner Magic Quadrants commonly break, but that one worked at the time of this posting.</em></p>
<ul>
<li>Gartner thinks that data warehouse appliances are on the rise, due to their simplicity.</li>
<li>Gartner correctly says that <a href="http://www.softwarememories.com/2008/09/15/database-machines/" onclick="javascript:pageTracker._trackPageview('/www.softwarememories.com');">Teradata has been a data warehouse appliance vendor from the getgo</a>.</li>
<li>Gartner characterizes IBM as being an appliance vendor as well.</li>
<li>Gartner suggests that HP is having trouble living up to its technical promises for Neoview.</li>
<li>Gartner further suggests &#8212; no surprise here &#8212; that HP Neoview has had very few new customers past its initial wave.</li>
<li>Gartner notes IBM&#8217;s difficulties in selling data warehouse installations of DB2, despite what on paper is great-sounding technology.</li>
<li>Gartner says &#8212; also no surprise &#8212; that illuminate &#8220;has seen little success in North America since opening its first office in the U.S. over two years ago.&#8221;</li>
<li>Ingres has evidently gotten a few BI-centric &#8220;appliance&#8221; deals, e.g. with Jaspersoft. But basically Ingres isn&#8217;t doing well in data warehousing.</li>
<li>Gartner does say Ingres has &#8220;the strongest open-source DBMS offering for data warehousing.&#8221; Being very literal about &#8220;open source,&#8221; that&#8217;s a defensible claim &#8212; but it&#8217;s pretty irrelevant in a world where <a href="http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/" >Greenplum Single-Node Edition</a> can be had for free. It also waves away all the data mart use cases in which Infobright Community Edition shines.</li>
<li>Gartner says that Netezza is working out as a &#8220;complex workload&#8221; enterprise data warehouse provider, according to reference checks, in addition to its established success in data mart scenarios.</li>
<li>Gartner says Oracle&#8217;s offering has finally become &#8220;accepted&#8221; in the market for databases &gt;50 TB. I guess I can live with that fairly weak claim, but <a href="http://www.dbms2.com/2009/09/19/oracle-database-siz/" >I wouldn&#8217;t go much further than that</a>.</li>
<li>Gartner asserts that, unlike software-only Oracle, Oracle Exadata isn&#8217;t significantly harder to administer than &#8220;other mixed OLTP/OLAP DBMS vendors,&#8221; because Exadata is fast enough you don&#8217;t need to jump through all those hoops any more to get tolerable performance. The money quote is &#8220;one reference reported reducing the number of indexes by a factor of 100 to fewer than five.&#8221; Note, however, that Gartner does not seem to assert that Exadata&#8217;s ease of use rivals that of the newer analytic DBMS specialists.</li>
<li>Gartner confirms <a href="http://www.dbms2.com/2009/02/01/oracle-says-they-do-onsite-exadata-pocs-after-all/" >Oracle&#8217;s reluctance to do onsite Exadata POCs</a>, but says it is not absolute. This is roughly compatible with what I&#8217;m hearing elsewhere, and indeed with Oracle own claims to be ramping up availability of Exadata POC hardware.</li>
<li>Gartner&#8217;s criteria for inclusion include at least 10 different organizations having a product &#8220;in production.&#8221; Thus, the big surprise was ParAccel being included. The money quote there is &#8220;With approximately 20 customers in the pharmaceutical, retail, financial and media/advertising analytics sectors, ParAccel has a good reference base.&#8221; That assessment is difficult to reconcile with other information, but I&#8217;ve been told Gartner is sticking to its guns. That assessment would be even harder to believe if those 20 references were all alleged to be true production customers.</li>
<li>Gartner notes that you basically can&#8217;t run a 1 TB+ MySQL data warehouse without sharding. (Of course, Infobright has an alternative, and up to a small number of terabytes so does Kickfire.)</li>
<li>Gartner reports that at least some customers are pleased with Sybase IQ&#8217;s mixed workload/enterprise data warehouse capabilities.</li>
<li>Gartner correctly notes that <a href="http://www.dbms2.com/2009/10/05/oracle-exadata-2-capacity-pricing/" >Oracle Exadata is a price-competition challenge for Teradata</a>.</li>
<li>Gartner notes that 20% of Vertica&#8217;s customers are outside the US. While not shocking, that&#8217;s more than I realized.</li>
<li>Gartner notes something I don&#8217;t think I&#8217;ve posted yet, which is that Vertica has a customer with 300 TB of data. (The identity is a deep dark secret, but if I told you you probably wouldn&#8217;t recognize the name anyway.)</li>
</ul>
<p>As does any such piece, the Gartner Data Warehouse DBMS Magic Quadrant also has outright errors.  For example:</p>
<ul>
<li>Aster Data isn&#8217;t really &#8220;the newest entrant to the DBMS data warehouse world.&#8221;</li>
<li>Aster&#8217;s SQL/MapReduce was not new in Release 4.0.</li>
<li>Greenplum isn&#8217;t yet pushing down code to the storage tier.</li>
<li>I&#8217;m not sure what kind of database-tier parallelism Gartner is claiming is new in Oracle in 11g Release 2 &#8212; but I doubt it&#8217;s really new. Rather, what Oracle has done recently is <a href="http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/" >make parallelism less administratively cumbersome</a>.</li>
<li>Vertica wasn&#8217;t really the first DBMS in the cloud. At most it was the first pure-play analytic DBMS to get there.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Greenplum Single-Node Edition &#8212; sometimes free is a real cool price</title>
		<link>http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/</link>
		<comments>http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/#comments</comments>
		<pubDate>Mon, 19 Oct 2009 13:25:41 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EnterpriseDB and Postgres Plus]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[PostgreSQL]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Scientific research]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1158</guid>
		<description><![CDATA[Greenplum is announcing today that you can run Greenplum software on a single 8-core commodity server, free.  First and foremost, that&#8217;s a strong statement that Greenplum wants enterprises to pay it for Greenplum&#8217;s parallelization/”private cloud” capabilities. Second, it may be an attractive gift to a variety of folks who want to extract insight from [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Greenplum is announcing today that you can run Greenplum software on a single 8-core commodity server, free.  First and foremost, that&#8217;s a strong statement that Greenplum wants enterprises to pay it for Greenplum&#8217;s parallelization/”<a href="../2009/06/08/the-future-of-data-marts/">private cloud</a>” capabilities. Second, it may be an attractive gift to a variety of folks who want to extract insight from terabyte-scale databases of various kinds.</p>
<p style="margin-bottom: 0in;">Greenplum Single-Node Edition:</p>
<ul>
<li>Is free of charge, although you 	can buy support.</li>
<li>Has no restrictions on use, 	production or otherwise.</li>
<li>Has no restrictions on database 	size.</li>
<li>Is closed-source.</li>
</ul>
<p style="margin-bottom: 0in;">For those who want free, terabyte-scale data warehousing software, Greenplum Single-Node Edition may be quite appealing, considering that the main available alternatives are:</p>
<ul>
<li>General-purpose open-source DBMS, 	such as PostgreSQL and MySQL (lacking analytic DBMS performance and 	features)</li>
<li>Infobright Community Edition (the 	other best choice – <a href="../2009/10/14/infobright-notes/">Infobright&#8217;s 	commercial sales success</a> indicates the solidity of Infobright&#8217;s 	technology)</li>
<li>Rough research-project code and 	other other questionable open source offerings</li>
<li>Crippleware from other commercial 	analytic DBMS vendors (e.g., <a href="../2009/10/19/teradata-partners-2009/">Teradata</a>)</li>
</ul>
<p style="margin-bottom: 0in;">For example, comparing PostgreSQL-based Greenplum with PostgreSQL itself, Greenplum offers:</p>
<ul>
<li>The ability to scale out queries 	across all cores in your box (and no, pgpool is not a serious 	alternative)</li>
<li>Storage alternatives such as 	columnar (I am told that EnterpriseDB recently stopped funding a 	project for a PostgreSQL columnar option)</li>
</ul>
<p style="margin-bottom: 0in;"><span id="more-1158"></span>Greenplum would surely also argue that its software is superior to PostgreSQL in parallel load, compression, MapReduce integration, and general fit-and-finish. I imagine that in some (perhaps not all) cases it would be right. PostgreSQL&#8217;s main technical advantages over Greenplum would probably lie in the area of datatype extensibility.</p>
<p style="margin-bottom: 0in;">The main target users for Greenplum&#8217;s Single-Node Edition are obviously <strong>individual enterprise power users or very small analytic teams.</strong> I.e., it&#8217;s people with a data mart need that a central data warehouse isn&#8217;t meeting. Potential benefits to Greenplum include:</p>
<ul>
<li>Adding value to its <a href="../2009/06/08/the-future-of-data-marts/">Enterprise 	Data Cloud</a> story</li>
<li>Seeding the market for future 	enterprise sales</li>
<li>Depriving competitors of revenue, 	perhaps at enterprises too small to ever be paying Greenplum 	customers</li>
</ul>
<p style="margin-bottom: 0in;">In addition, I see free Greenplum as a charity offering that could be appealing to <a href="http://" onclick="javascript:pageTracker._trackPageview('/');">scientists</a> who face PostgreSQL performance limitations.</p>
<p style="margin-bottom: 0in;"><em><strong>Related links</strong></em></p>
<ul>
<li><a href="http://www.greenplum.com/news/252/388/Greenplum-Introduces-Free-Greenplum-Database-Edition-for-Data-Analysts/d,press-releases/" onclick="javascript:pageTracker._trackPageview('/www.greenplum.com');">Greenplum 	Free Single-Node Edition press release</a> (I&#8217;m quoted)</li>
<li><a href="http://www.mysqlperformanceblog.com/2009/10/02/analyzing-air-traffic-performance-with-infobright-and-monetdb/" onclick="javascript:pageTracker._trackPageview('/www.mysqlperformanceblog.com');">MySQL 	Performance blog on MonetDB and Infobright community edition</a></li>
<li><a href="http://archives.postgresql.org/pgsql-general/2009-03/msg01227.php" onclick="javascript:pageTracker._trackPageview('/archives.postgresql.org');">PostgreSQL&#8217;s 	restriction to one core per query</a></li>
<li><a href="http://www.infobright.org/Forums/viewthread/1141/" onclick="javascript:pageTracker._trackPageview('/www.infobright.org');">Infobright&#8217;s 	restriction to one core per query</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
	</channel>
</rss>
