<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; Data warehouse appliances</title>
	<atom:link href="http://www.dbms2.com/category/database-management-system/data-warehouse-appliances/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 02 Sep 2010 09:06:44 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Teradata&#8217;s future product strategy</title>
		<link>http://www.dbms2.com/2010/08/12/teradata-future-product-strategy/</link>
		<comments>http://www.dbms2.com/2010/08/12/teradata-future-product-strategy/#comments</comments>
		<pubDate>Thu, 12 Aug 2010 10:37:14 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Microstrategy]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2769</guid>
		<description><![CDATA[I think Teradata&#8217;s future product strategy is coming into focus. I&#8217;ll start by outlining some particular aspects, and then show how I think it all ties together.

The immediate hook here is that I had a short conversation with Scott Gnau of Teradata yesterday, triggered by Teradata&#8217;s acquisition of Kickfire&#8217;s assets. Takeaways from that part included:

The [...]]]></description>
			<content:encoded><![CDATA[<p>I think Teradata&#8217;s future product strategy is coming into focus. I&#8217;ll start by outlining some particular aspects, and then show how I think it all ties together.<br />
<span id="more-2769"></span></p>
<p style="margin-bottom: 0in;">The immediate hook here is that I had a short conversation with Scott Gnau of Teradata yesterday, triggered by <a href="../2010/07/27/kickfire-unlikely-to-survive/">Teradata&#8217;s acquisition of Kickfire&#8217;s assets</a>. Takeaways from that part included:</p>
<ul>
<li>The acquisition is all about 	Kickfire&#8217;s <a href="../2009/08/21/kickfires-fpga-based-technical-strategy/">data 	pipelining</a> technology.</li>
<li>Scott (in my opinion rightly) 	thinks that isn&#8217;t particularly tied to Kickfire&#8217;s choice of 	particular DBMS architecture (fairly vanilla columnar).</li>
<li>No decision has been made about 	whether the right vehicle for this technology is an FPGA (Field 	Programmable Gate Array), conventional Intel CPU, RAM, etc.</li>
</ul>
<p style="margin-bottom: 0in;"><em>If you want to handicap Teradata&#8217;s future data pipelining strategy, you might note that:</em></p>
<ul>
<li><em>Kickfire&#8217;s own choice – and 	hence its existing implementation – is an FPGA.</em></li>
<li><em><a href="../2009/08/04/vectorwise-ingres-and-monetdb/">VectorWise&#8217;s 	approach to pipelining is Intel-based,</a> apparently at the cost of 	being closely tied to specific generations of Intel CPUs.</em></li>
<li><em><a href="../2009/07/27/xtremedata-announces-its-dbx-data-warehouse-appliance/">XtremeData&#8217;s 	approach to pipelining</a> is FPGA-based.</em></li>
<li><em>Teradata has a lot more 	development resources than any of those other companies, as well as 	important existing products, and hence has both means and motive to 	shoehorn new technology into older system designs.</em></li>
</ul>
<p style="margin-bottom: 0in;">While I had Scott on the phone, I brought up a few other subjects too. Highlights included:</p>
<ul>
<li>Teradata&#8217;s Flash-based appliance 	is doing just fine in beta test and customer POCs (Proofs of 	Concept).</li>
<li>Other kinds of Teradata appliance 	are not inconceivable.</li>
<li>Scott thinks <a href="http://www.dbms2.com/2010/07/31/teradata-xkoto-gridscale-rip-and-active-active-clustering/" >Michael McIntire&#8217;s 	condemnation of Active-Active architectures</a> is overstated. That 	said,
<ul>
<li>Scott does acknowledge a need for 	greater Active-Active scalability, and suggests that the reason 	Xkoto&#8217;s current products are being discontinued is their lack of 	scaling.</li>
<li>Scott seems quietly confident the 	scaling will get done.</li>
</ul>
</li>
<li>Scott is emphatic that Teradata is 	not going to go to <a href="../2009/04/20/calpont-update-you-read-it-here-first/">a 	two-tier architecture</a>. In particular, the point of splitting 	storage/lightweight database processing and heavyweight database 	processing on separate tiers is generally to save bandwidth, and 	Teradata&#8217;s BYNET is typically less than 10% loaded.</li>
<li>Scott didn&#8217;t dispute my claim that 	this all suggests <a href="../2008/10/14/teradata-virtual-storage/">Teradata 	Virtual Storage</a> is the future, at the expense of a rigid 	delineation among <a href="../2008/10/23/teradata-appliance-product-lines/">specific 	use-case-focused product lines</a>.</li>
<li>Unlike <a href="http://www.dbms2.com/2010/02/22/netezza-twinfin/" >Netezza</a> or <a href="http://www.dbms2.com/2010/02/22/aster-data-ncluster-4-5/" >Aster</a>, Teradata doesn&#8217;t seem to plan analytic capability that works outside 	the UDF (User Defined Function) framework. However, Scott noted that 	Teradata has long had the capability that Aster and Netezza now also 	have of letting you run analytic code either in “protected mode” 	(if the process fails the whole database doesn&#8217;t crash) or in the 	database kernel (best performance, if you&#8217;re sufficiently confident 	in the code&#8217;s stability to take the risk). Scott also spoke of the 	release later this quarter of Teradata FastPath, which will offer 	yet better performance (however, there&#8217;s a gotcha to Teradata 	FastPath that&#8217;s still NDA).</li>
</ul>
<p style="margin-bottom: 0in;">Putting all that together with the rest of what we know about Teradata, I&#8217;m going to call out<strong> three pillars of Teradata&#8217;s long-term product strategy:</strong></p>
<ul>
<li><strong>Same fundamentals as always.</strong> Teradata&#8217;s core product strategy is:
<ul>
<li>Single DBMS, capable of meeting 	all analytic needs while running in a single instance, usually 	running on &#8230;</li>
<li>… proprietary hardware …</li>
<li>… built from 	conservatively-chosen parts.</li>
</ul>
</li>
<li><strong>Selective vertical application 	stack.</strong> No matter how horizontally-oriented they are, many 	companies that have been in the analytic technology business for a 	while wind up with some vertical applications. It sort of just 	happens. Teradata is no exception. Teradata also likes to sell 	services to its product customers, and some of those are quite 	vertical-aware.</li>
<li><strong>Mutable, modular platform.</strong> This is what I highlighted above. Note that it&#8217;s philosophically 	attuned with the one-system-does-everything approach Teradata 	prefers. More subtly, please also note that it goes well with 	customer-by-customer price customization, which is almost a must for 	Teradata given the Innovator&#8217;s Dilemma kind of pricing box it finds 	itself in.</li>
</ul>
<p style="margin-bottom: 0in;">So far, that&#8217;s not too exciting, except in the details of how Teradata&#8217;s engineers make that all work. But there&#8217;s a <strong>fourth pillar to Teradata&#8217;s technical strategy</strong> as well, and it&#8217;s a wild card: t<strong>ight partnerships.</strong> Every time I talk with Teradata hardware chief Carson Schmidt, he seems excited about some particular version of a part or other – sometimes from a reasonably established vendor (once it was LSI Logic), sometimes from a tiny one (notably <a href="../2009/10/25/teradata-hardware-strategy-and-tactics/">the “stealth” start-up on which Teradata bet its first solid-state product</a>.) In the future, I expect tight business intelligence partnerships as well. Cognos BI will be increasingly integrated with IBM&#8217;s DBMS and hardware; Business Objects&#8217; BI will increasingly be integrated with SAP&#8217;s applications; and Oracle&#8217;s BI will eventually be integrated with everything. How do you compete with that if you<span style="font-style: normal;">&#8216;re Microstrategy? </span>Well, you try to have superior product, of course – but you also partner as closely with DBMS vendors as you can, an approach Microstrategy has already started. Predictive analytics stalwart <a href="http://www.dbms2.com/2010/05/15/further-clarifying-in-database-mpp-sas/" >SAS</a>, of course, is on a partnership binge as well.</p>
<p style="margin-bottom: 0in;">Teradata has a larger installed base than almost all its competitors, and enjoys richer third-party software and service support as a result. But I suspect that going forward,  for Teradata to remain a leading competitor at price points it is willing to accept, Teradata&#8217;s “ecosystem” advantages will need to ratchet up one or several notches.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/08/12/teradata-future-product-strategy/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Links and observations</title>
		<link>http://www.dbms2.com/2010/08/09/links-and-observations/</link>
		<comments>http://www.dbms2.com/2010/08/09/links-and-observations/#comments</comments>
		<pubDate>Tue, 10 Aug 2010 02:37:51 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Calpont]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Northscale]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[XtremeData]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2743</guid>
		<description><![CDATA[I&#8217;m back from a trip to the SF Bay area, with a lot of writing ahead of me. I&#8217;ll dive in with some quick comments here, then write at greater length about some of these points when I can. From my trip:  

Aster Data showed me a lot of customer names and deal sizes, across [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m back from a trip to the SF Bay area, with a lot of writing ahead of me. I&#8217;ll dive in with some quick comments here, then write at greater length about some of these points when I can. From my trip:  <span id="more-2743"></span></p>
<ul>
<li>Aster Data showed me a lot of customer names and deal sizes, across a bunch of industries (mainly enterprise rather than web). Yes, Aster&#8217;s market success is for real. (But almost all those details are NDA.)</li>
<li>Sybase&#8217;s product plans for IQ are pretty impressive. (But the most interesting parts are, you guessed it, NDA.)</li>
<li>I&#8217;ve kissed and made up* with ParAccel, now that they&#8217;ve replaced their CEO, replaced their marketing chief, and stopped the worst of the <a href="http://www.dbms2.com/2010/01/15/there-sure-seem-to-be-a-lot-of-inaccuracies-on-paraccels-website/" >marketing</a> <a href="http://www.dbms2.com/2009/06/22/the-tpc-h-benchmark-is-a-blight-upon-the-industry/" >nonsense</a> I used to complain about. ParAccel has some interesting plans for ParAccel 3.0 which are, naturally, NDA.</li>
<li>The Peoplesoft guys are doing it over again at Workday. Only this time, their platform isn&#8217;t a relational DBMS. Rather, it&#8217;s an in-memory, completely object-oriented data model, with disk used only on a &#8220;Just in case the power ever goes out&#8221; basis. (Thankfully, nothing at all about our conversation was NDA.)</li>
<li>I&#8217;m finally feeling good about <a href="# I spent considerable time  with my clients at both Greenplum and EMC (if we ignore the fact that  the deal has closed and they're now the same company). I also had more  of  a hardcore engineering discussion than I've had with Greenplum for  quite a while (I should have been pushier about that earlier). Takeaways  included:      * This is starting off as a honeymoon deal. Everything  Greenplum was planning to do is being continued. Additional resources  are being poured into Greenplum to do more.     * Some Greenplum execs  seem to envision staying long term, some seem to envision moving on to  their next startups. The ones who envision moving on are, however, going  to work hard first to make the merger a success.     * Greenplum has,  for quite a while, had more of an advanced analytics/embedded predictive  modeling story than I realized. Bad on them for not fleshing it out  more in marketing and product packaging alike.     * Greenplum both  denies the concurrency problems I previously noted and also has a very  credible story as to how it will eliminate them. :) Seriously, Greenplum  tells of one customer that routinely runs 150 simultaneously queries -  on what I think is not a terribly big system -- and a number of POCs  (Proofs of Concept) that simulated similar levels of concurrency.">Northscale&#8217;s  memcached-compatible persistent store Membase</a>. The main reason is  that they showed me a near-term path to interfaces that are richer than  key-value. Also, Todd Hoff reassured me that even pure persistent  memcached has a place.</li>
<li>Rumor says that even the one app for which Facebook was using Cassandra &#8212; in-box search &#8212; has been decommissioned. On the other hand, numerous other scale-0ut DBMS (SQL or otherwise) seem to have Facebook footholds. But details are &#8212; all together now! &#8212; NDA.</li>
</ul>
<p><em>*If you know ParAccel&#8217;s new marketing chief Michael Weir, you  surely guessed I mean that only in a figurative sense.</em></p>
<p>From elsewhere:</p>
<ul>
<li>Daniel Abadi offered <a href="http://dbmsmusings.blogspot.com/2010/08/thoughts-on-kickfires-apparent-demise.html" onclick="javascript:pageTracker._trackPageview('/dbmsmusings.blogspot.com');">his  analysis</a> of <a href="../2010/07/27/kickfire-unlikely-to-survive/">Kickfire&#8217;s  demise</a>. In general I agree, but Daniel neglected to mention one  hugely important factor &#8212; the chicken-egg negative effect of Kickfire&#8217;s  lack of market or marketing traction. Customers were extremely reluctant to buy from Kickfire  because they perceived, correctly, that Kickfire&#8217;s survivability was far  from assured.</li>
<li>While the <a href="http://infinidb.org/community/forums/11-general-infinidb/1000-strange-issue-with-drop-table" onclick="javascript:pageTracker._trackPageview('/infinidb.org');">InfiniDB forums</a> suggest that there are at least a couple of production users of Calpont&#8217;s free InfiniDB, Calpont seemingly has a long way to go to be even as successful as Kickfire. But Calpont does have a bit of money to spend on lead generation; maybe some day they&#8217;ll even have actual customers.</li>
<li>In a response to a question I messaged over, <a href="http://www.dbms2.com/2010/03/18/xtremedata-update/" >XtremeData</a> tells me they have actual customers now. Press releases to follow.</li>
<li>The <a href="http://news.cnet.com/8301-31021_3-20013111-260.html?part=rss&amp;subj=news&amp;tag=2547-1_3-0-20" onclick="javascript:pageTracker._trackPageview('/news.cnet.com');">admiration for the job Mark Hurd did at HP</a> is in my opinion overstated. Sure, the financial/operational management appeared to work, but HP did little on Hurd&#8217;s watch to strengthen its reputation or customers&#8217; loyalty. In particular:
<ul>
<li>HP&#8217;s analytics efforts have accomplished little.</li>
<li>HP&#8217;s data warehouse appliance efforts have failed pathetically.</li>
<li>From what I hear, HP&#8217;s execution in its Exadata partnership was not good.</li>
<li>HP&#8217;s server business in general is distinguished mainly by HP being a big company.</li>
<li>HP&#8217;s EDS acquisition has been rocky, not that EDS was sailing so smoothly on its own beforehand.</li>
<li>HP&#8217;s success in PCs amounts to &#8220;arguably, HP sucks a little less than the other guys&#8221;.</li>
<li>HP&#8217;s elite reputation is long gone (admittedly, for the most part that predates Hurd).</li>
</ul>
</li>
<li><a href="http://intelligent-enterprise.informationweek.com/blog/archives/2010/08/software_innova.html" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');">Doug Henschen</a> evidently favors really strong intellectual property protection for software, even forbidding plug-compatible reverse engineering. I agree with Doug up to the point that <a href="http://www.monashreport.com/2010/07/19/my-view-of-intellectual-property/" onclick="javascript:pageTracker._trackPageview('/www.monashreport.com');">it should be forbidden to copy proprietary software</a>, but I don&#8217;t see why he (or a court) would view such behavior as copying.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/08/09/links-and-observations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Advice for some non-clients</title>
		<link>http://www.dbms2.com/2010/07/30/advice-for-some-non-clients/</link>
		<comments>http://www.dbms2.com/2010/07/30/advice-for-some-non-clients/#comments</comments>
		<pubDate>Fri, 30 Jul 2010 14:35:52 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[Information Builders]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Kalido]]></category>
		<category><![CDATA[MarkLogic]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Objectivity and Infinite Graph]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[SenSage]]></category>
		<category><![CDATA[Tableau Software]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2699</guid>
		<description><![CDATA[Edit: Any further anonymous comments to this post will be deleted. Signed comments are permitted as always.

Most of what I get paid for is in some form or other consulting. (The same would be true for many other analysts.) And so I can be a bit stingy with my advice toward non-clients. But my non-clients [...]]]></description>
			<content:encoded><![CDATA[<p><em>Edit: Any further anonymous comments to this post will be deleted. Signed comments are permitted as always.<br />
</em></p>
<p>Most of what I get paid for is in some form or other consulting. (<a href="http://www.strategicmessaging.com/blurring-analyst-consultant-line/2010/07/28/" onclick="javascript:pageTracker._trackPageview('/www.strategicmessaging.com');">The same would be true for many other analysts</a>.) And so I can be a bit stingy with my advice toward non-clients. But my non-clients are a distinguished and powerful group, including in their number Oracle, IBM, Microsoft, and most of the BI vendors. So here&#8217;s a bit of advice for them too.</p>
<p><strong>Oracle. </strong>On the plus side, you guys have been making progress against your reputation for untruthfulness. Oh, I&#8217;ve dinged you for some <a href="http://www.dbms2.com/2008/09/30/oracle-crosses-the-line-on-integrity/" >past</a> <a href="http://www.dbms2.com/2008/06/28/response-to-rita-sallam-of-oracle/" >slip-ups</a>, but on the whole they&#8217;ve been no worse than other vendors.&#8217; But recently you pulled a doozy. The <a href="http://www.oracle.com/us/corporate/analystreports/infrastructure/index.html" onclick="javascript:pageTracker._trackPageview('/www.oracle.com');">analyst reports</a> section of your website fails to distinguish between unsponsored and sponsored work.* That is a horrible ethical stumble. Fix it fast. Then put processes in place to ensure nothing that dishonest happens again for a good long time.</p>
<p><em>*Merv Adrian&#8217;s &#8220;report&#8221; listed high on that page is actually a sponsored white paper. That Merv himself screwed up by not labeling it clearly as such in no way exonerates Oracle. Besides, I&#8217;m sure Merv won&#8217;t soon repeat the error &#8212; but for Oracle, this represents a whole pattern of behavior.</em></p>
<p><strong>Oracle.</strong> And while I&#8217;m at it, outright dishonesty isn&#8217;t your only unnecessary credibility problem. <a href="http://www.strategicmessaging.com/so-what-is-an-analyst-anyway/2010/07/25/" onclick="javascript:pageTracker._trackPageview('/www.strategicmessaging.com');">You&#8217;re also playing too many games in analyst relations</a>.</p>
<p><strong>HP.</strong> Neoview will never succeed. Admit it to yourselves. Go buy something that can.  <span id="more-2699"></span></p>
<p><strong>Smaller BI vendors.</strong> Analytic DBMS evaluations commonly include BI strategy and tool selection as well. If an analytic DBMS expert tells you he needs to learn more about your product line, don&#8217;t blow him off. In fact, you should be particularly embracing anybody who&#8217;s shown a fondness for small DBMS vendors; maybe he or his clients will like small BI vendors as well. That means (among others) you, <strong>Jaspersoft, Endeca, </strong>and <strong>Tableau.</strong></p>
<p><strong>Information Builders. </strong>Is there anything about your BI products that is in any way technologically differentiated? If so, you might want to mention some examples to somebody some time.</p>
<p><strong>Kalido.</strong> I&#8217;ve said this to you before, but it bears repeating &#8212; your positioning translates to &#8220;I-CASE for analytics,&#8221; and that&#8217;s not a good thing. If your product is not as cumbersome and entrapping as that sounds, you need to do a much better job of explaining why not.</p>
<p><strong>SenSage.</strong> You are what you are. Sell out while the selling is good. You don&#8217;t have the corporate personality to make it into the analytic DBMS mainstream on your own.</p>
<p><strong>Ingres. </strong>You need to be more engaged with analysts than you are. <a href="http://www.softwarememories.com/2010/07/25/ingres-history/" onclick="javascript:pageTracker._trackPageview('/www.softwarememories.com');">Ingres navel-gazed too much 25 years ago</a>, and evidently you haven&#8217;t outgrown it yet.</p>
<p><strong>TIBCO.</strong> You probably have a lot of cool analytic technology, but I don&#8217;t know of an influencer who has much relationship with or trust in you. Rethink how you&#8217;re approaching influencer relations top to bottom.</p>
<p><strong>Tableau.</strong> You had a lot of mindshare, but it&#8217;s fading. Do something.</p>
<p><strong>MarkLogic, graph DBMS vendors, etc.</strong> You&#8217;re clinging too hard to the NoSQL label. Nobody is out there deciding among Cassandra, neo4j, and MarkLogic. They might be deciding between MongoDB and MarkLogic, I guess, but if you admit to yourself that&#8217;s all it is you&#8217;ll probably change your messaging somewhat.</p>
<p><strong>Objectivity.</strong> Get real about marketing. Infinite Graph is a cool opportunity. But I didn&#8217;t even ping you for a meeting when I&#8217;m in your area next week, because I wouldn&#8217;t have known who to reach out to.</p>
<p><strong>Everybody (especially Objectivity).</strong> &#8220;First X deployed in the cloud&#8221; is almost surely an inaccurate claim. Don&#8217;t make it. And by the way, even if it were true, it probably wouldn&#8217;t be interesting.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/30/advice-for-some-non-clients/feed/</wfw:commentRss>
		<slash:comments>43</slash:comments>
		</item>
		<item>
		<title>Kickfire unlikely to survive</title>
		<link>http://www.dbms2.com/2010/07/27/kickfire-unlikely-to-survive/</link>
		<comments>http://www.dbms2.com/2010/07/27/kickfire-unlikely-to-survive/#comments</comments>
		<pubDate>Tue, 27 Jul 2010 18:56:48 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2658</guid>
		<description><![CDATA[Following up on a previous report of Kickfire&#8217;s troubles &#8212; a Kickfire customer tipped me off that Kickfire told him they&#8217;re selling their IP and engineers, and the Kickfire products will be discontinued.
At this time, I have no idea who the lucky buyer is.
Edit: We now know it&#8217;s Teradata.
]]></description>
			<content:encoded><![CDATA[<p>Following up on a previous report of <a href="http://www.dbms2.com/2010/06/11/kickfire-update-2/" >Kickfire&#8217;s troubles</a> &#8212; a Kickfire customer tipped me off that Kickfire told him they&#8217;re selling their IP and engineers, and the Kickfire products will be discontinued.</p>
<p>At this time, I have no idea who the lucky buyer is.</p>
<p><em>Edit: We now know it&#8217;s Teradata.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/27/kickfire-unlikely-to-survive/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>More on Greenplum and EMC</title>
		<link>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/</link>
		<comments>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 19:15:21 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Market share]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2534</guid>
		<description><![CDATA[I talked with Ben Werther of Greenplum for about 40 minutes, which was my first post-merger Greenplum/EMC briefing. &#8220;Historical&#8221; highlights include:

Ben says Greenplum wasn&#8217;t being shopped, by which he means Greenplum was out raising more capital and the fund-raising was going well.  Note: Half or so of Greenplum&#8217;s deals were subscription-priced, so it had weaker [...]]]></description>
			<content:encoded><![CDATA[<p>I talked with Ben Werther of Greenplum for about 40 minutes, which was my first post-merger Greenplum/EMC briefing. &#8220;Historical&#8221; highlights include:</p>
<ul>
<li>Ben says Greenplum wasn&#8217;t being shopped, by which he means Greenplum was out raising more capital and the fund-raising was going well.  <em>Note: <a href="http://www.dbms2.com/2009/10/18/greenplum-customer-notes/" >Half or so of Greenplum&#8217;s deals were subscription-priced</a>, so it had weaker cash flow than it would have if it were doing equally well selling perpetual licenses.</em></li>
<li>However, joint engineering was also going well with, e.g., Greenplum CTO Luke Lonergan spending time at EMC facilities in Cork, Ireland. And one thing led to another &#8230;</li>
<li>Greenplum has ~ 140 customers, vs. <a href="http://www.dbms2.com/2009/06/05/greenplum-update-release-3-3/" >~65 five quarters ago</a>, 100+ at year-end, and an acquisition rate of 12-15/quarter last fall.</li>
<li>A typical &#8220;small&#8221; paying customer for Greenplum starts with 10-20 TB of data.</li>
<li><a href="http://www.dbms2.com/2010/04/12/greenplumchorus/" >Greenplum Chorus</a> isn&#8217;t generally available yet, with rollout energy being focused on Greenplum 4.0. <em>Note: As important as it is for overall industry direction, Greenplum Chorus is a product which won&#8217;t be a terribly big deal in Release 1 anyway.</em></li>
</ul>
<p>Highlights looking forward include:  <span id="more-2534"></span></p>
<ul>
<li>When I challenged him, Ben sounded quite optimistic that Pat Gelsinger will immunize Greenplum against and generally counteract some of EMC&#8217;s traditionally stifling bureaucracy. (My words, of course, not his.)</li>
<li>The initial Greenplum/EMC product vision appears truly centered around &#8220;private cloud,&#8221; specifically including Greenplum, VMware, and EMC storage arrays.</li>
<li>Some other areas of potential Greenplum/EMC technical synergy I think are cool obviously haven&#8217;t been seriously addressed yet.</li>
<li>Based on what I heard from Ben about the aura around the deal and also on what I know of the individual executives at Greenplum, I think each of them is a good bet to stick around EMC for a while. (That&#8217;s on average. Of course, it would be surprising if 100% of them stayed around very long.) Basically, there&#8217;s at least a chance EMC/Greenplum will do some pretty cool stuff, and most of the guys will probably stick around to see if that actually starts to happen.*</li>
</ul>
<p><em>*Also, when they do eventually leave, they&#8217;ll surely say things to the effect &#8220;The cool stuff is well underway; my work here is done.&#8221; That party line is almost guaranteed, no matter how things unfold in reality. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Why analytic DBMS increasingly need to be storage-aware</title>
		<link>http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/</link>
		<comments>http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 06:30:14 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2515</guid>
		<description><![CDATA[In my quick reactions to the EMC/Greenplum announcement, I opined

I think that even software-only analytic DBMS vendors should design their systems in an increasingly storage-aware manner

promising to explain what I meant later on. So here goes.  
There always have been good technical reasons to tailor hardware to analytic database software. Data moves through disk controller, [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;"><span style="font-style: normal;">In <a href="../2010/07/06/emc-is-buying-greenplum/">my quick reactions to the EMC/Greenplum announcement</a>, I opined</span></p>
<blockquote>
<p style="margin-bottom: 0in;">I think that even software-only analytic DBMS vendors should design their systems in an increasingly storage-aware manner</p>
</blockquote>
<p style="margin-bottom: 0in;">promising to explain what I meant later on. So here goes.  <span id="more-2515"></span></p>
<p style="margin-bottom: 0in;">There always have been good technical reasons to tailor hardware to analytic database software. Data moves through disk controller, network, RAM, CPU and more, each with its own data rate. Getting different kinds of parts into the right bal<span style="font-style: normal;">ance doesn&#8217;t completely eliminate bottlenecks – the <a href="../2010/07/06/the-one-hoss-shay/">Wonderful One-Hoss Shay</a> is poetic fiction </span>– but it certainly can help. As a result, every analytic DBMS vendor of any size offers at least one of:</p>
<ul>
<li><a href="../2007/01/27/data-warehouse-appliance-hardware-strategies/">A 	Type 0 appliance</a></li>
<li>A Type 1 appliance</li>
<li>A “recommended hardware 	configuration”</li>
</ul>
<p style="margin-bottom: 0in;">And beyond performance, appliances and pre-specified hardware configurations offer at least the possibility of easing installation, administration, and support.</p>
<p style="margin-bottom: 0in;">There also are marketing reasons to offer an appliance or something appliance-like.</p>
<ul>
<li>To various extents, Oracle, 	Teradata, Microsoft, IBM, Netezza, and EMC are all telling the world 	that your hardware should be optimized for your analytic DBMS.</li>
<li>Smaller vendors such as Vertica 	and Aster Data also tend to cobble together some sort of appliance, 	in part so they don&#8217;t have to say they disagree.</li>
<li>Thus, a “We don&#8217;t see any point 	in special hardware assembly at all” story would leave an analytic 	DBMS vendor pretty far out on a limb.</li>
</ul>
<p style="margin-bottom: 0in;">Finally, there are three overlapping technical trends that increase the need for storage-awareness in analytic DBMS. First and foremost is the rise of <strong>solid-state memory.</strong> For starters, I believe:</p>
<ul>
<li><a href="../2010/06/25/flash-is-coming-well/">Flash 	will be important for analytic DBMS soon</a>.</li>
<li>There are good technical reasons 	for this.</li>
<li><a href="../2010/01/22/oracle-database-hardware-strategy/">Oracle&#8217;s 	marketing will make a big deal out of the Flash aspects of Exadata</a>, 	so other analytic DBMS vendors will need a response. And of course, 	if Netezza or Teradata preemptively make a big deal of their 	Flash-based offerings, that just adds to the pressure for Flash 	adoption on everybody else.</li>
<li>But it&#8217;s not just Flash – <a href="../2010/01/31/flash-pcmsolid-state-memory-disk/">Flash, 	other solid-state memory, and disk</a> will be combined in various 	ways.</li>
</ul>
<p style="margin-bottom: 0in;">But this move to Flash will require analytic DBMS vendors to be increasingly storage-aware for at least three reasons:</p>
<ul>
<li>It just adds another level of 	<strong>complexity</strong> to their hardware-balancing challenges.</li>
<li><strong>Flash overturns some of the 	fundamental assumptions of modern analytic DBMS,</strong> in particular:
<ul>
<li><a href="../2006/09/19/is-data-warehousing-now-all-about-sequential-access/">Sequential 	reads are hugely better than random</a></li>
<li>The worst bottleneck is at the 	point where data comes out of storage.</li>
</ul>
</li>
<li><strong>The Flash technology stack is 	still immature,</strong> and you have to pick your poison in how to deal 	with it. Vendors are making very different choices in this regard – 	and they do have to choose.</li>
</ul>
<p style="margin-bottom: 0in;">Another trend that could naturally lead analytic DBMS vendors to be more storage-aware is their incorporation of what could be viewed as hierarchical storage/ILM technologies.  Different data is stored in different ways and/or on different kinds of storage hardware. (Vendors pursuing – you guessed it – different approache<span style="font-style: normal;">s to this include <a href="../2009/08/04/2008/10/14/teradata-virtual-storage/">Teradata</a>, <a href="../2009/10/14/greenplum-hybrid-columnar/">Greenplum</a>, <a href="../2009/08/04/flexstore-and-the-rest-of-vertica-35/">Vertica</a>, and <a href="../2009/08/25/sybase-iq-technical-highlights/">Sybase</a>.) The m</span>ore automatic that process is, the more storage-aware the DBMS will need to be.</p>
<p style="margin-bottom: 0in;">Finally, there are reasons to th<span style="font-style: normal;">ink that <a href="../2008/09/06/sans-vs-das-in-mpp-data-warehousing/">DBMS should be split between conventional servers and smart storage</a>. This is, of course, the E</span>xadata strategy. <a href="../2010/06/21/netezza-silicon-balance/">Netezza&#8217;s two-processor approach</a>, while rather different, also somewhat validates the idea.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>EMC is buying Greenplum</title>
		<link>http://www.dbms2.com/2010/07/06/emc-is-buying-greenplum/</link>
		<comments>http://www.dbms2.com/2010/07/06/emc-is-buying-greenplum/#comments</comments>
		<pubDate>Tue, 06 Jul 2010 22:53:52 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Storage]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2498</guid>
		<description><![CDATA[EMC is buying Greenplum. Most of the press release is a general recapitulation of Greenplum&#8217;s marketing messages, the main exceptions being (emphasis mine):
The acquisition of Greenplum will be an all-cash transaction and is expected to be completed in the third quarter of 2010, subject to customary closing conditions and regulatory approvals. The acquisition is not [...]]]></description>
			<content:encoded><![CDATA[<p>EMC is buying Greenplum. Most of the <a href="http://www.emc.com/about/news/press/2010/20100706-01.htm" onclick="javascript:pageTracker._trackPageview('/www.emc.com');">press release</a> is a general recapitulation of Greenplum&#8217;s marketing messages, the main exceptions being (emphasis mine):</p>
<blockquote><p>The acquisition of Greenplum will be an all-cash transaction and is <strong>expected to be completed in the third quarter of 2010,</strong> subject to customary closing conditions and regulatory approvals. The acquisition is not expected to have a material impact to EMC GAAP and non-GAAP EPS for the full 2010 fiscal year. Upon close, Bill Cook will lead the new data computing product division and report to Pat Gelsinger. <strong>EMC will continue to offer Greenplum&#8217;s full product portfolio to customers and plans to deliver new EMC Proven reference architectures as well as an integrated hardware and software offering</strong> designed to improve performance and drive down implementation costs.</p></blockquote>
<p>Greenplum is one of my biggest vendor clients, and EMC is just becoming one, but of course neither side gave me a heads-up before the deal happened, nor have I yet been briefed subsequently. With those disclaimers out of the way, some of my early thoughts include:</p>
<ul>
<li>I wish my clients would never buy each other, but it&#8217;s inevitable.</li>
<li>I don&#8217;t think anybody evaluating Greenplum should be much influenced by this deal one way or the other. (Whether they will be is of course a different matter.)
<ul>
<li>EMC tends to run its bigger software acquisitions in a fairly hands-off manner. There&#8217;s no particular FUD (Fear/Uncertainty/Doubt) reason why this deal should stop anybody from buying Greenplum software.</li>
<li>I also don&#8217;t think adding a rich parent adds much of a reason to buy from Greenplum. But if you&#8217;re the type who&#8217;s nervous about smaller vendors &#8212; well, Greenplum now isn&#8217;t so small.</li>
</ul>
</li>
<li><a href="http://www.dbms2.com/2010/04/12/greenplumchorus/" >Greenplum Chorus</a> could, in principle, work with non-Greenplum DBMS. That possibility suddenly looks a lot more realistic.</li>
<li>The list of analytic DBMS vendors with an appliance orientation is pretty impressive, including:
<ul>
<li>Oracle, with <a href="http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/" >Exadata</a></li>
<li>Microsoft, partially</li>
<li>Teradata</li>
<li>Netezza</li>
<li>Now EMC/Greenplum, at least partially</li>
<li>Weaker players such as:
<ul>
<li>The <a href="http://www.dbms2.com/2010/06/11/kickfire-update-2/" >ailing Kickfire</a>, which a client (not Kickfire itself) tells me is being shopped around</li>
<li>The <a href="http://www.dbms2.com/2010/03/19/some-business-trends-in-the-data-warehouse-market/" >reeling HP Neoview</a></li>
<li>XtremeData, but I&#8217;m still waiting to hear of<a href="http://www.dbms2.com/2010/03/18/xtremedata-update/" > XtremeData&#8217;s first real sale</a></li>
</ul>
</li>
</ul>
</li>
<li>Greenplum is something of a specialist in <a href="http://www.dbms2.com/2009/04/30/ebays-two-enormous-data-warehouses/" >large databases</a>. EMC has to love that.</li>
<li>Greenplum&#8217;s weakness is concurrency.</li>
<li><a href="http://www.dbms2.com/2009/10/14/greenplum-hybrid-columnar/" >Greenplum&#8217;s &#8220;polymorphic storage&#8221;</a> is a good fit for a storage vendor with appliance-y ideas.</li>
<li>And finally &#8212; I think that even software-only analytic DBMS vendors should design their systems in an increasingly storage-aware manner, and have been advising my vendor clients of same. I&#8217;ll blog that line of reasoning separately when I get a chance, and edit in a link here after I do.</li>
</ul>
<p><em><strong>Related links (edit)</strong></em></p>
<ul>
<li>Here&#8217;s the promised post as to <a href="http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/" >why analytic DBMS need to be ever more storage-aware</a>.</li>
<li><a href="http://www.kellblog.com/2010/07/06/emc-acquires-data-warehouse-vendor-greenplum-as-cornerstone-of-new-data-computing-product-division/" onclick="javascript:pageTracker._trackPageview('/www.kellblog.com');">Dave Kellogg crunched the EMC/Greenplum numbers</a>, coming up with an estimated valuation range of $3-400 million, the high end of which is rumored to be correct.</li>
<li>Merv Adrian suggests <a href="http://mervadrian.wordpress.com/2010/07/06/emc-buys-greenplum-big-data-realignment-continues/#more-2890" onclick="javascript:pageTracker._trackPageview('/mervadrian.wordpress.com');">the big EMC/Greenplum loser is ParAccel</a>, a viewpoint which presumably presupposes that the EMC/ParAccel partnership was significant in the first place.</li>
<li>I talked with Ben Werther and posted <a href="http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/" >more about Greenplum and EMC</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/06/emc-is-buying-greenplum/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Netezza&#8217;s silicon balance</title>
		<link>http://www.dbms2.com/2010/06/21/netezza-silicon-balance/</link>
		<comments>http://www.dbms2.com/2010/06/21/netezza-silicon-balance/#comments</comments>
		<pubDate>Mon, 21 Jun 2010 12:00:12 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2321</guid>
		<description><![CDATA[As I&#8217;ve mentioned in a couple of other posts, Netezza is stressing that the most recent wave of its technology is software-only, with no hardware upgrades made or needed. In other words, Netezza boxes already have all the silicon they need. But of course, there are really at least three major aspects to the Netezza [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">As I&#8217;ve mentioned in a couple of other posts, Netezza is stressing that the most recent <a href="http://www.dbms2.com/2010/06/21/netezza-database-software-technology-overview/" >wave</a> of its technology is software-only, with no hardware upgrades made or needed. In other words, Netezza boxes already have all the silicon they need. But of course, there are really at least three major aspects to the Netezza silicon story – FPGA (Field-Programmable Gate Array), CPU, and RAM.</p>
<ul>
<li>Netezza planned to be “generous” 	in its original TwinFin FPGA capacity, anticipating software 	upgrades like the ones it&#8217;s introducing now. It is satisfied that 	this strategy worked. More on this below.</li>
<li>The same surely applies to CPU.</li>
<li>What&#8217;s more, I get the sense that 	the CPU turned out in practice to be even more over-provisioned than 	they anticipated …</li>
<li>… at least when one just 	considers Netezza&#8217;s base NPS software.</li>
<li>However, I suspect that if the 	advanced analytics capability takes off, Netezza will determine that 	more CPU is always better.</li>
<li>And by the way, NEC is making 	versions of Netezza appliances with more advanced chips than Netezza 	is. So if anybody should really, really need more CPU in their 	Netezza boxes, there&#8217;s a very straightforward way to make that 	happen. (And if there were nontrivial demand for that, appropriate 	support plans could surely be structured.)</li>
<li>Everybody needs to be careful 	about RAM. Netezza is surely no exception.</li>
</ul>
<p style="margin-left: 0.49in; text-indent: -0.25in; margin-bottom: 0in;">
<p style="margin-bottom: 0in;">The major parts of Netezza&#8217;s FPGA software are:</p>
<ul>
<li><strong>Compress Engine 2.</strong> This is 	Netezza&#8217;s new way of doing compression.</li>
<li><strong>Compress Engine 1.</strong> This is 	Netezza&#8217;s old way of doing compression. It is being kept around so 	that existing Netezza tables don&#8217;t suddenly have to be changed or 	reloaded.</li>
<li><strong>Project Engine.</strong> Guess what 	this does.</li>
<li><strong>Restrict Engine.</strong> Ditto.</li>
<li><strong>Visibility Engine.</strong> This 	<a href="http://www.dbms2.com/2006/09/27/logless-lockless-netezza-more-carefully-explained/" >enforces ACID</a> and handles row-level security. It is “sort 	of a corner of” the Restrict Engine (Actually, Netezza seems to 	waver as to whether to describe “Restrict” and “Visibility” 	as being two engines or one.)</li>
<li>Miscellaneous plumbing.</li>
</ul>
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">If I understood correctly, each Netezza FPGA has two each of the engines in parallel.</p>
<p style="margin-bottom: 0in;"><em><strong>Related link</strong></em></p>
<ul>
<li>An August, 2009 post on <a href="http://www.dbms2.com/2009/08/08/netezza-fpga/" >what Netezza does in its FPGA</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/06/21/netezza-silicon-balance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A partial overview of Netezza database software technology</title>
		<link>http://www.dbms2.com/2010/06/21/netezza-database-software-technology-overview/</link>
		<comments>http://www.dbms2.com/2010/06/21/netezza-database-software-technology-overview/#comments</comments>
		<pubDate>Mon, 21 Jun 2010 11:57:35 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Theory and architecture]]></category>
		<category><![CDATA[Workload management]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2322</guid>
		<description><![CDATA[Netezza is having its user conference Enzee Universe in Boston  Monday–Wednesday, June 21-23, and naturally will be announcing new products there, and otherwise providing hooks and inducements to get itself written about. (The preliminary count is seven press releases in all.) To get a head start, I stopped by Netezza Thursday for meetings that [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Netezza is having its user conference Enzee Universe in Boston  Monday–Wednesday, June 21-23, and naturally will be announcing new products there, and otherwise providing <a href="http://www.dbms2.com/2010/06/21/notes-on-a-spate-of-netezza-related-blog-posts/" >hooks and inducements to get itself written about</a>. (The preliminary count is seven press releases in all.) To get a head start, I stopped by Netezza Thursday for meetings that included a 3 ½ hour session with 10 or so senior engineers, and have exchanged some clarifying emails since.  <span id="more-2322"></span></p>
<p style="margin-bottom: 0in;">It might be best to start with some Netezza product introduction and naming housekeeping:</p>
<ul>
<li>Netezza isn&#8217;t changing the 	hardware on any of its existing systems at this time. Rather, 	Netezza&#8217;s product upgrades are contained in <strong>a software-only </strong><span style="font-weight: normal;">release &#8230;</span></li>
<li>… except 	that it isn&#8217;t a “release,” but rather a <strong>“wave.”</strong> There 	are three points to that terminological distinction:
<ul>
<li>The advanced 	analytics part doesn&#8217;t depend on the new database platform software.</li>
<li>Individual 	functions in the advanced analytics part don&#8217;t necessarily depend on 	advances in the analytics platform.</li>
<li>It plays on 	the surfboard-centric naming of Netezza&#8217;s appliances. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
</ul>
</li>
<li>Netezza has wisely scrapped <a href="http://www.dbms2.com/2010/02/22/netezza-twinfin/" >its 	prior plan to make its advanced-analytics capabilities be a 	chargeable add-on to it core appliance products</a>. Rather, Netezza 	is going to offer <strong>advanced analytics as part of its core product.</strong> Part of the reason is that the interest in these capabilities is 	broader than Netezza first anticipated. The name for this is is 	something like <strong>i-Class advanced analytics capabilities.</strong></li>
<li>There is a “relea<span style="font-weight: normal;">se” 	in all this too, namely </span><strong>NPS 6.0</strong><span style="font-weight: normal;"> (Netezza Performance Software). That&#8217;s the core DBMS technology. </span></li>
<li><span style="font-weight: normal;">It&#8217;s 	all to be shipped in Q3.</span></li>
</ul>
<p style="margin-bottom: 0in; font-weight: normal;">Highlights of our NPS 6.0 conversation include:</p>
<ul>
<li>As promised, Netezza has improved 	its <strong>compression</strong> significantly. Because this was anticipated, 	this upgrade was planned for in the design of <a href="http://www.dbms2.com/2009/07/30/netezza-new-product-family/" >the systems Netezza 	started introducing last summer</a>. Consequently, the reduction in 	I/O produced by compression translates almost directly into better 	performance – the silicon is now more fully loaded than it was 	before, but few if any actual silicon bottlenecks have been 	introduced by the I/O improvement.</li>
<li>Netezza&#8217;s other big performance 	enhancement is the introduction of <strong>clustered base tables, </strong><span style="font-weight: normal;">which 	it says can reduce I/O by an order of magnitude or better.</span></li>
<li>Netezza says that there are 	individual queries in which the enhancements take query performance 	up 30-40X. (Presumably, those would be ones for which clustered base 	tables are a big win.)</li>
<li>More interestingly, Netezza says 	that <strong>overall performance is improved by &gt;2X.</strong> That&#8217;s 	queries, load, backup, and everything else all blended together.</li>
<li>Underpinning 	all this, Netezza went from 125 MHz to a blend of 125 and 250 MHz in 	its FPGA clock speeds. Also, the width of the FPGA onboard data path 	went from 16 to 32 bits. Netezza suggests that the naive calculation 	which says this could increase FPGA throughput 4X isn&#8217;t entirely 	misleading.</li>
<li>Netezza is 	pretty content with its <strong>workload management</strong> capabilities for 	queries, but nonetheless keeps adding features. <a href="http://www.dbms2.com/2009/07/18/netezza-on-concurrency-and-workload-management/" >Workload management</a> has not yet been extended to cover all the non-query parts of the 	analytic functionality.</li>
<li>Netezza 	continues to enhance its <strong>cost-based optimizer and query planner.</strong></li>
<li><span style="font-weight: normal;">Netezza 	has long used an </span><strong>internal networking</strong><span style="font-weight: normal;"> approach that&#8217;s rather different from TCP/IP. Netezza views TCP/IP&#8217;s 	strength as recovering gracefully if there&#8217;s congestion. However, 	Netezza would rather do whatever it takes to preclude congestion in 	the first place, except perhaps in rare edge cases. I&#8217;m not aware of 	what enhancements, if any, have been made to Netezza&#8217;s internal 	networking specifically in NPS 6.0.</span></li>
</ul>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">The basic idea of clustered base tables (“base tables” are ones that are not, for example, materialized views) is to </span><span style="font-style: normal;"><strong>range partition in multiple dimensions at once.</strong></span><span style="font-style: normal;"> Then you rule out (as in don&#8217;t retrieve) all those blocks that fail a match in any one of the cluster dimensions.</span><span style="font-style: normal;"><span style="font-weight: normal;"> Netezza says its customers were doing a lot of work to simulate this benefit by multiple sorts; Netezza&#8217;s implementation will now handle that much more automatically. Netezza says that talking to customers revealed that 4-5 cluster dimensions was almost always the most somebody would need; they will ship support for 4. That makes sense. In most cases, you&#8217;d want to cluster on the answers to “W” questions – Who, What, Where, When (but probably not Why), in one dimension each. However, Netezza does call out as an ideal use case geospatial, precisely because 2 (or more rarely 3) dimensions each have “equal weight.”</span></span></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;"><span style="font-weight: normal;">I don&#8217;t know how other vendors implement clustered base tables, but in Netezza&#8217;s case it&#8217;s via a </span></span><span style="font-style: normal;"><strong>space-filling curve.</strong></span><span style="font-style: normal;"><span style="font-weight: normal;"> (Actually, they called it a “Hilbert space-filling curve,” but I oppose that phrasing, as it&#8217;s apt to lead to extremely incorrect use of the term “Hilbert space.”) I.e., data is mapped to 4-tuples (say) in line with the dimensions, which are then sorted in a linear order in line with a space-filling curve. Happily, Netezza hasn&#8217;t experienced problems clustering columns that have particularly challenging cardinality or skew.</span></span></p>
<p style="margin-bottom: 0in;">If I understood correctly, you can only zone map (and presumably cluster) on integers and dates right now, but that will change soon. <em>(Edit: In blog comments and email, Tim Greenwood of Netezza explained to me that the NPS 6.0 workarounds to that were much more robust than I realized.)</em></p>
<p style="margin-bottom: 0in;">Netezza put a lot of work for NPS 6 into something it calls <strong>“table grooming,”</strong> which amounts to recopying tables in more beneficial form. Uses for table grooming – which is a manually initiated process – include but probably aren&#8217;t limited to:</p>
<ul>
<li>Clustering tables and, as needed, 	reclustering them.</li>
<li>Getting rid of data that was 	deleted. (Netezza has Postgres-style multiversion concurrency 	control – MVCC – but no time-travel, so keeping around deleted 	data is a waste of space.)</li>
<li>Recompressing data from 	Compress Engine 1 to Compress Engine 2.</li>
<li>Alter Table</li>
</ul>
<p style="margin-bottom: 0in;">The core ideas of table grooming include:</p>
<ul>
<li>The Netezza NPS software copies 	rows from one place to another.</li>
<li>Netezza NPS then updates the 	appropriate metadata.</li>
<li>Metadata updates are 	transactional, even though the actual data movement is not.</li>
</ul>
<p style="margin-bottom: 0in;">This can be done part of a table at a time. Reads and loads are unaffected by the process, or at least not blocked. Delete commits are indeed blocked during a reorg, but Netezza guesses that the blocks hold for a few minutes during the grooming of a clustered base table, 10-15 seconds if space is being reclaimed, and something similar for an Alter Table.</p>
<p style="margin-bottom: 0in;">And finally, here are some notes on Netezza&#8217;s query optimization and planning.</p>
<ul>
<li>Netezza has a traditional 	<strong>cost-based optimizer,</strong> in which all operations have estimated 	costs, measured in microseconds, irrespective of which parts of the 	system (CPU, I/O, network, whatever) they most stress. (I have 	trouble imagining how a cost-based optimizer could work differently 	from that without incurring huge computational costs.)</li>
<li><a href="http://www.dbms2.com/2010/06/21/netezza-silicon-balance/" >Netezza&#8217;s bottleneck is almost 	always disk I/O</a>.</li>
<li>Netezza&#8217;s optimizer is not/no 	longer based on the PostgreSQL optimizer.</li>
<li>Netezza does a lot of <strong>query 	transformation.</strong> Key points include:
<ul>
<li>Netezza joins are usually very 	cheap.</li>
<li>Filtered scans are cheap too.</li>
<li>More expensive in Netezza are data 	redistribution (duh), sorts, and unfiltered scans.</li>
<li>Most expensive of all are 	intermediate result sets that don&#8217;t fit into memory.</li>
</ul>
</li>
<li>Specific examples of Netezza query 	transformation include:
<ul>
<li>Pushing predicates out to nodes.&#8217;</li>
<li>Flattening query trees and 	eliminating subqueries.</li>
<li>Rewriting windowed aggregates to 	be joins + grouped aggregates.</li>
<li>(New in 6.0) Transforming outer 	joins into other kinds.</li>
</ul>
</li>
<li>Netezza does real-time sampling to 	help with query planning. (But this is only worth doing for queries 	that are estimated to be expensive.) Zone maps (and clustering too?) 	are invoked as part of deciding where to sample. Sampling was for 	scans only prior to NPS 6.0, and will now be done for joins as well.</li>
</ul>
<p style="margin-bottom: 0in;"><em><strong>Related links</strong></em></p>
<ul>
<li><a href="http://www.dbms2.com/2010/06/21/notes-on-a-spate-of-netezza-related-blog-posts/" >Notes on this week&#8217;s spate of Netezza-related blog posts</a></li>
<li><a href="http://www.dbms2.com/2010/06/21/netezza-ibm-db2-compression/" >How Netezza (and IBM) do database compression</a></li>
<li><a href="http://www.dbms2.com/2010/06/21/netezza-silicon-balance/" >Netezza&#8217;s silicon balance</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/06/21/netezza-database-software-technology-overview/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Notes on a spate of Netezza-related blog posts</title>
		<link>http://www.dbms2.com/2010/06/21/notes-on-a-spate-of-netezza-related-blog-posts/</link>
		<comments>http://www.dbms2.com/2010/06/21/notes-on-a-spate-of-netezza-related-blog-posts/#comments</comments>
		<pubDate>Mon, 21 Jun 2010 11:55:33 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Presentations]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2318</guid>
		<description><![CDATA[Fearing that last year&#8217;s tight travel budgets would hamper attendance, Netezza – like a number of other vendors – decided to forgo a traditional user conference. Instead, it took its Enzee Universe show on the road, essentially spreading the conference across eight cities. I was asked to keynote six of the installments.
After the first one, [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Fearing that last year&#8217;s tight travel budgets would hamper attendance, Netezza – like a number of other vendors – decided to forgo a traditional user conference. Instead, it took its Enzee Universe show on the road, essentially spreading the conference across eight cities. I was asked to <a href="http://www.dbms2.com/2009/07/30/netezza-enzee-universe/" >keynote</a> six of the installments.</p>
<p style="margin-bottom: 0in;">After the first one, Netezza Marketing VP Tim Young took me aside for two pieces of constructive criticism. The surprising one* was that he felt I had been INSUFFICIENTLY critical of Netezza. Since then, every other conversation we&#8217;ve had about content creation has also featured ringing reassurances that Tim truly wants independent, non-pandering work.</p>
<p style="margin-bottom: 0in;"><em>*The unsurprising one was that I&#8217;d rushed. Well, duh. After months of telling me I had a 1 hour slot, Netezza cut me to ½ hour a few days beforehand. And my talk had been designed to be high-speed even in the longer time slot … </em></p>
<p style="margin-bottom: 0in;">As a result, I accepted a subsequent gig from Netezza that I would barely consider from most other vendors. Namely, for this year&#8217;s Enzee Universe – <a href="http://www.netezza.com/userconference/agenda.html" onclick="javascript:pageTracker._trackPageview('/www.netezza.com');">June 21-23, aka Monday-Wednesday of this week, at the Westin Waterfront Hotel in Boston</a> – I would do some contemporaneous blogging. The parameters we agreed on included:  <span id="more-2318"></span></p>
<ul>
<li>I would just blog here on <a href="http://www.dbms2.com" >DBMS2</a>, with 	Netezza allowed to reuse posts in their entirety on its site(s).</li>
<li>I also would give a talk on the 	conference&#8217;s last day.</li>
<li>I wouldn&#8217;t say much about 	conference sessions, because:
<ul>
<li>I&#8217;m not a session-attending kind 	of guy. (I wasn&#8217;t particularly good at sitting still in class in 8<sup>th</sup> grade. I haven&#8217;t gotten much better since. And <a href="http://www.strategicmessaging.com/powerpoints/2008/02/02/" onclick="javascript:pageTracker._trackPageview('/www.strategicmessaging.com');">I have a huge 	aversion to other people&#8217;s uninterruptible PowerPoints</a>.)</li>
<li>I think Netezza&#8217;s sessions are 	just as hype-filled as anybody else&#8217;s. (Much as I enjoyed traveling 	around the world with Netezza last year, it was painful hearing Jim 	Baum claim in city after city that Netezza boasts a 50X performance 	advantage vs. the competition.)</li>
</ul>
</li>
<li>Rather, I&#8217;d base things much more 	on individual conversations and meetings.</li>
<li>Because I didn&#8217;t see how 	turnaround time could work otherwise, we&#8217;d have some of those 	meetings beforehand, and others early in the conference.</li>
</ul>
<p style="margin-bottom: 0in;">That last bit didn&#8217;t exactly wholly work out; for the second consecutive year Netezza pulled a surprise schedule switch a few days beforehand. But:</p>
<ul>
<li>I did have extensive, fascinating 	meetings at Netezza&#8217;s offices on Thursday, which were the fodder for 	multiple posts going up today.</li>
<li>I have a nice meeting schedule set 	up for Tuesday.</li>
<li>There should be plenty of 	opportunity for hallway and exhibit-floor conversation as the 	conference progresses.</li>
<li>I even have my own private 	conference room, with a lovely name (the “Paine Room”).</li>
</ul>
<p style="margin-bottom: 0in;">So far as I know, the rest of the plan is still operative.</p>
<p style="margin-bottom: 0in;">Posts already written as I draft this one include:</p>
<ul>
<li><a href="http://www.dbms2.com/2010/06/21/netezza-database-software-technology-overview/" >A long discussion of Netezza&#8217;s 	technology, focusing on the database parts</a></li>
<li><a href="http://www.dbms2.com/2010/06/21/netezza-ibm-db2-compression/" >A discussion of Netezza&#8217;s and 	IBM&#8217;s compression strategies</a></li>
<li><a href="http://www.dbms2.com/2010/06/21/netezza-silicon-balance/" >Notes on how Netezza balances 	its silicon and uses its FPGAs</a></li>
<li><a href="http://www.dbms2.com/2010/06/21/data-warehouse-load-latency/" >A quickie on data warehouse 	loading latency</a></li>
</ul>
<p style="margin-bottom: 0in;">I still need to write one focusing on Netezza&#8217;s advanced analytics strategy, and plan to edit in a link to it when it&#8217;s up.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/06/21/notes-on-a-spate-of-netezza-related-blog-posts/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
