<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS2 -- DataBase Management System Services &#187; Netezza</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/netezza/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Tue, 16 Mar 2010 17:52:48 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>February 2010 data warehouse DBMS news roundup</title>
		<link>http://www.dbms2.com/2010/02/22/data-warehouse-dbms-news-roundup/</link>
		<comments>http://www.dbms2.com/2010/02/22/data-warehouse-dbms-news-roundup/#comments</comments>
		<pubDate>Mon, 22 Feb 2010 08:30:23 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1628</guid>
		<description><![CDATA[February is usually a busy month for data warehouse DBMS product releases, product announcements, and other real or contrived data warehouse DBMS news, and it can get pretty confusing trying to keep those categories of “news” apart.*  This year is no exception, although several vendors – including Teradata and Netezza – are taking “rolling thunder” [...]]]></description>
			<content:encoded><![CDATA[<p>February is usually a busy month for data warehouse DBMS product releases, product announcements, and other real or contrived data warehouse DBMS news, and it can get pretty confusing trying to keep those categories of “news” apart.*  This year is no exception, although several vendors – including Teradata and Netezza – are taking “rolling thunder” approaches, doing some of their announcements this month while holding others back for March or April.</p>
<p><em>*I probably have it worse than most people in that regard, because my clients run tentative feature lists and announcement schedules by me well in advance, which may get changed multiple times before the final dates roll around. I also occasionally miss some detail, if it wasn&#8217;t in a pre-briefing but gets added at the end.</em></p>
<p>Anyhow, the three big themes of this month&#8217;s announcements are probably:</p>
<ul>
<li><strong>Integrating different kinds of analytic processing into databases and DBMS. </strong></li>
<li><strong>Taking advantage of hardware advances.</strong></li>
<li><strong>Playing catchup</strong> in areas where small vendors&#8217; products weren&#8217;t mature yet.</li>
</ul>
<p><span id="more-1628"></span>For example, the three biggest data warehouse DBMS product announcements this month are probably:</p>
<ul>
<li><strong>Aster Data nCluster 4.5.</strong> Much like Aster&#8217;s prior release &#8212; <a href="../../../../../2009/10/30/aster-data-application-server-ncluster/">Aster Data nCluster 4.0</a> – <a href="http://www.dbms2.com/2010/02/22/aster-data-ncluster-4-5/" >Aster Data nCluster 4.5</a> has a major focus on integrating analytics and database processing. This time, the emphasis is on application development tools and pre-built analytic packages. In addition, Aster&#8217;s management tool GUIs have been upgraded, building on catch-up functionality in the Aster Data nCluster 4.0.</li>
<li><strong>Netezza&#8217;s “i” add-on to its existing TwinFin products.</strong> With <a href="../../../../../2010/02/22/netezza-twinfin/">Netezza TwinFin(i)</a>, Netezza becomes the second MPP RDBMS vendor with a comprehensive “Big Data Analytic Platform” kind of strategy. (Netezza would surely argue that it was the first, but that depends on how seriously one took <a href="../../../../../2007/09/27/the-netezza-developer-network/">Netezza&#8217;s prior attempt</a>.) Many of the details are different from Aster&#8217;s, of course, but the general philosophy is similar. So far, Netezza has announced one interesting proprietary library of analytic packages (for linear/matrix algebra), plus the port of 4,000 or so functions in open source libraries.</li>
<li><strong>Vertica 4.0.</strong> Vertica has had a highly innovative columnar DBMS architecture from the getgo, but at the cost of some restrictions or awkwardness in the relationship between data layout and SQL processing. Vertica says that <a href="../../../../../2010/02/22/vertica-4/">Vertica 4.0</a> fixes all that. In addition, it has some analytic processing enhancements, especially in the time series area, where Vertica doesn&#8217;t vigorously dispute that Sybase IQ previously had an advantage.</li>
</ul>
<p>In addition,</p>
<ul>
<li><strong>Teradata is announcing its Data Warehouse Appliance 2580, the successor to the Teradata 2550.</strong> This is purely a hardware refresh; Teradata&#8217;s hardware and software upgrades are not generally synced. The Teradata 2580 upgrades CPUs from Harpertown to Nehalem, includes 3X the RAM of its predecessor, and offers an option for 1 TB disks (thus lowering the bottom price/TB a lot, to $31K list).</li>
<li>Aster, Vertica, and ParAccel have all called attention to the fact that, if solid-state drives have interfaces like those of disk drives, and if a DBMS supports disk drives, then a DBMS also supports solid-state drives as well. At least Aster and ParAccel have signaled that they have at least one customer or prospect each interested in Fusion I/O&#8217;s solid-state technology, especially in the retail sector. This is basically a hardware matter as well, and a big deal only for those who were somehow unaware of <a href="../../../../../2010/01/31/flash-pcmsolid-state-memory-disk/">the impending dominance of solid-state memory technology</a>.</li>
<li>Sybase announced its <a href="../../../../../2010/02/05/sybase-aleri-rap/">Aleri</a> acquisition earlier this month.</li>
<li>Various vendors have bragged about various rankings, awards, or benchmarks, or – sometimes less tediously &#8212; about last year&#8217;s sales results.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/22/data-warehouse-dbms-news-roundup/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TwinFin(i) – Netezza&#8217;s version of a parallel analytic platform</title>
		<link>http://www.dbms2.com/2010/02/22/netezza-twinfin/</link>
		<comments>http://www.dbms2.com/2010/02/22/netezza-twinfin/#comments</comments>
		<pubDate>Mon, 22 Feb 2010 08:21:13 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[SAS Institute]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1613</guid>
		<description><![CDATA[Much like Aster Data did in Aster 4.0 and now Aster 4.5, Netezza is announcing a general parallel big data analytic platform strategy. It is called Netezza TwinFin(i), it is a chargeable option for the Netezza TwinFin appliance, and many announced details are on the vague side, with Netezza promising more clarity at or before [...]]]></description>
			<content:encoded><![CDATA[<p>Much like Aster Data did in <a href="http://www.dbms2.com/2009/10/30/aster-data-application-server-ncluster/" >Aster 4.0</a> and now <a href="http://www.dbms2.com/2010/02/22/aster-data-ncluster-4-5/" >Aster 4.5</a>, Netezza is announcing a general parallel big data analytic platform strategy. It is called Netezza TwinFin(i), it is a chargeable option for the <a href="http://www.dbms2.com/2009/07/30/netezza-new-product-family/" >Netezza TwinFin</a> appliance, and many announced details are on the vague side, with Netezza promising more clarity at or before its Enzee Universe conference in June. At a high level, the Aster and Netezza approaches compare/contrast as follows:<span id="more-1613"></span></p>
<ul>
<li>Netezza&#8217;s software runs on well-designed proprietary hardware. Aster runs on hardware that&#8217;s more off-the-shelf.</li>
<li>Aster was first to ship, and will also be first to ship an IDE (Integrated Development Environment).</li>
<li>MapReduce is central to Aster&#8217;s approach. Netezza TwinFin(i) supports MapReduce too, specifically a Hadoop implementation, but I don&#8217;t get the sense that everything Netezza does is built on MapReduce underpinnings.</li>
<li>Both Aster and Netezza try to provide rich functionality for creating in-memory data structures parallel analytic programs can use. Both seem to let you escape from the pure relational-table paradigm more easily than, say, Teradata&#8217;s new persistent memory capabilities do.</li>
<li>Aster and Netezza have made different choices about what kinds of prebuilt analytic packages to offer. Netezza could actually leapfrog Aster in this regard, but let&#8217;s see where each vendor is by, say, mid-year. If you care about the details of built-in analytic functions, you really should consider executing non-disclosure agreements with both those companies.</li>
<li>Both Aster and Netezza stress that you can run analytic functions out-of-process, greatly reducing the chance that they crash the database. Netezza and I&#8217;m pretty sure also Aster also retain the option of running in-process, which provides maximum performance. (In Netezza&#8217;s case C++ is the only in-process language supported, and I think Aster has a similar limitation.)</li>
<li>Like Aster, Netezza is integrating SQL queries and other analytic processing under the same workload management rubric.</li>
<li>Much like Aster, Netezza is tap-dancing by implying much richer forthcoming SAS support than anything currently announced. (The crunch-per-paragraph ratio in either vendor&#8217;s SAS-related press releases to date is distressingly low.)</li>
</ul>
<p>More specifically, here are some highlights of what I know, am guessing, and/or am allowed to say about Netezza TwinFin(i) at this time.</p>
<ul>
<li>The foundation for the analytic add-ons in Netezza TwinFin(i) is some sort of low-level “analytic executables.” Not understanding exactly what these are is my biggest area of confusion in the whole TwinFin(i) stack. Are they all C++, with everything translated into same? Is there Java all the way down as an alternative? (E.g., Hadoop is written in Java.) Anyhow, whatever it is, it&#8217;s surely a big improvement on <a href="../../../../../2007/09/27/the-netezza-developer-network/">Netezza&#8217;s prior Verilog-based generation of analytic extensibility technology</a>.</li>
<li>The announced list of languages supported in Netezza TwinFin(i) is Java, Python, Fortran, R, and C/C++. More are coming.</li>
<li>Netezza has named a lot of analytic functions it is adding, and hinting about more to come. It has named <a href="http://cran.r-project.org/" onclick="javascript:pageTracker._trackPageview('/cran.r-project.org');">CRAN/R</a> and GNU libraries, saying those have 1900 or more functions each. Netezza has also built its own linear algebra library for TwinFin(i), called nzMatrix. And as previously noted, TwinFin(i) also boasts a Hadoop implementation.</li>
<li>I haven&#8217;t heard about much in the way of TwinFin(i)-specific IDE support.</li>
<li>I don&#8217;t really have details as to what kinds of in-memory data structures Netezza TwinFin(i) does or doesn&#8217;t support.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/22/netezza-twinfin/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Intelligent Enterprise’s Editors’/Editor’s Choice list for 2010</title>
		<link>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/</link>
		<comments>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/#comments</comments>
		<pubDate>Thu, 11 Feb 2010 23:13:42 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Intersystems and Cache']]></category>
		<category><![CDATA[Jaspersoft]]></category>
		<category><![CDATA[Kalido]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pentaho]]></category>
		<category><![CDATA[QlikTech and QlikView]]></category>
		<category><![CDATA[SAP AG]]></category>
		<category><![CDATA[Tableau Software]]></category>
		<category><![CDATA[Talend]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1578</guid>
		<description><![CDATA[As he has before, Intelligent Enterprise Editor Doug Henschen

Personally selected annual lists of 12 &#8220;Most influential&#8221; companies and 36 &#8220;Companies to watch&#8221; in analytics- and database-related sectors.
Made it clear that these are his personal selections.
Nonetheless has called it an Editors&#8217; Choice list, rather than Editor&#8217;s Choice.  

(Actually, he&#8217;s really called it an &#8220;award.&#8221;)
People advising [...]]]></description>
			<content:encoded><![CDATA[<p>As he has <a href="http://www.dbms2.com/2009/01/12/intelligent-enterprises-editorseditors-choice-list/" >before</a>, <em>Intelligent Enterprise</em> Editor Doug Henschen</p>
<ul>
<li>Personally selected <a href="http://intelligent-enterprise.informationweek.com/showArticle.jhtml;jsessionid=IANLOXCT2244BQE1GHPCKH4ATMY32JVN?articleID=222900034&amp;pgno=1" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');">annual lists</a> of 12 &#8220;Most influential&#8221; companies and 36 &#8220;Companies to watch&#8221; in analytics- and database-related sectors.</li>
<li>Made it clear that these are his personal selections.</li>
<li>Nonetheless has called it an Editors&#8217; Choice list, rather than Editor&#8217;s Choice. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
</ul>
<p>(Actually, he&#8217;s really called it an &#8220;award.&#8221;)</p>
<p><span id="more-1578"></span>People advising Doug &#8212; who come to think of it actually are Contributing Editors to <em>Intelligent Enterprise</em> or something like that &#8212; included Cindi Howson, Seth Grimes, three others, and me.</p>
<p>And if past is prologue, I will now get a flood of PR emails calling my attention to this award that I already have both participated in and blogged about. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>As usual, the sense:nonsense ratio on these lists was pleasingly high. Analytic DBMS vendors cited included IBM, Microsoft, Netezza, Oracle, Sybase, and Teradata in the &#8220;Most influential&#8221; group, with Aster, Greenplum, HP, Infobright, and Vertica among the &#8220;To watch&#8221; crowd. It&#8217;s tough to argue with those selections, whose most questionable element is probably the not-ridiculous supposition that HP could do something interesting over the coming year. Cloudera and Intersystems also made the list, deservedly.</p>
<p>All three of QlikTech, Tableau, and TIBCO made the list, which is appropriate given the potential for and interest in interactive data exploration technology.  The BI majors, independent or otherwise, were all on as well. In text mining, Doug included Attensity and Clarabridge, which I think is exactly right. (Plus OpenCalais.)  Upon reflection, I probably should have nominated Mark Logic, even though most of its business is non-enterprise; but hey, nobody&#8217;s perfect, and the same goes for lists. Open source was well represented, with Apache, Actuate, Jaspersoft, Eclipse, Infobright, Nuxeo and R all being cited (but not Ingres or Pentaho). Kalido made the list, with my endorsement, their silly I-CASE like marketing messaging notwithstanding.</p>
<p>Speaking of imperfections &#8212; there only are a few category names, and so category assignments can be pretty bizarre. (In an ideal world, middleware wouldn&#8217;t be included under &#8220;enterprise applications&#8221;.) Greenplum hasn&#8217;t really &#8220;extended&#8221; its DBMS with a &#8220;cloud&#8221; option. As much as I&#8217;d like Netezza to be more influential than SAP, that&#8217;s probably not the best way to rank them. And there are a number of &#8220;This company is on a roll!&#8221; kinds of comments that I wouldn&#8217;t necessarily endorse.</p>
<p>But those are all nitpicks. On the whole, it&#8217;s another nice job.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Comments on the Gartner 2009/2010 Data Warehouse Database Management System Magic Quadrant</title>
		<link>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/</link>
		<comments>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/#comments</comments>
		<pubDate>Wed, 10 Feb 2010 23:28:39 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[illuminate Solutions]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1553</guid>
		<description><![CDATA[At intervals of little over a year, Gartner Group publishes a Data Warehouse Database Management System Magic Quadrant. Gartner&#8217;s 2009 data warehouse DBMS Magic Quadrant &#8212; actually, January 2010 &#8212; is now out.* For many reasons, including those I noted in my comments on Gartner&#8217;s 2008 Data Warehouse DBMS Magic Quadrant, the Gartner quadrant pictures [...]]]></description>
			<content:encoded><![CDATA[<p>At intervals of little over a year, Gartner Group publishes a Data Warehouse Database Management System Magic Quadrant. <a href="http://www.gartner.com/technology/media-products/reprints/greenplum/173535.html" onclick="javascript:pageTracker._trackPageview('/www.gartner.com');">Gartner&#8217;s 2009 data warehouse DBMS Magic Quadrant</a> &#8212; actually, January 2010 &#8212; is now out.* For many reasons, including those I noted in <a href="http://www.dbms2.com/2009/01/12/gartners-2008-data-warehouse-database-management-system-magic-quadrant-is-out/" >my comments on Gartner&#8217;s 2008 Data Warehouse DBMS Magic Quadrant</a>, the Gartner quadrant pictures are a bad use of good research. Rather than rehash that this year, I&#8217;ll merely call out some points in the surrounding commentary that I find interesting or just plain strange.<span id="more-1553"></span></p>
<p><em>*Links to Gartner Magic Quadrants commonly break, but that one worked at the time of this posting.</em></p>
<ul>
<li>Gartner thinks that data warehouse appliances are on the rise, due to their simplicity.</li>
<li>Gartner correctly says that <a href="http://www.softwarememories.com/2008/09/15/database-machines/" onclick="javascript:pageTracker._trackPageview('/www.softwarememories.com');">Teradata has been a data warehouse appliance vendor from the getgo</a>.</li>
<li>Gartner characterizes IBM as being an appliance vendor as well.</li>
<li>Gartner suggests that HP is having trouble living up to its technical promises for Neoview.</li>
<li>Gartner further suggests &#8212; no surprise here &#8212; that HP Neoview has had very few new customers past its initial wave.</li>
<li>Gartner notes IBM&#8217;s difficulties in selling data warehouse installations of DB2, despite what on paper is great-sounding technology.</li>
<li>Gartner says &#8212; also no surprise &#8212; that illuminate &#8220;has seen little success in North America since opening its first office in the U.S. over two years ago.&#8221;</li>
<li>Ingres has evidently gotten a few BI-centric &#8220;appliance&#8221; deals, e.g. with Jaspersoft. But basically Ingres isn&#8217;t doing well in data warehousing.</li>
<li>Gartner does say Ingres has &#8220;the strongest open-source DBMS offering for data warehousing.&#8221; Being very literal about &#8220;open source,&#8221; that&#8217;s a defensible claim &#8212; but it&#8217;s pretty irrelevant in a world where <a href="http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/" >Greenplum Single-Node Edition</a> can be had for free. It also waves away all the data mart use cases in which Infobright Community Edition shines.</li>
<li>Gartner says that Netezza is working out as a &#8220;complex workload&#8221; enterprise data warehouse provider, according to reference checks, in addition to its established success in data mart scenarios.</li>
<li>Gartner says Oracle&#8217;s offering has finally become &#8220;accepted&#8221; in the market for databases &gt;50 TB. I guess I can live with that fairly weak claim, but <a href="http://www.dbms2.com/2009/09/19/oracle-database-siz/" >I wouldn&#8217;t go much further than that</a>.</li>
<li>Gartner asserts that, unlike software-only Oracle, Oracle Exadata isn&#8217;t significantly harder to administer than &#8220;other mixed OLTP/OLAP DBMS vendors,&#8221; because Exadata is fast enough you don&#8217;t need to jump through all those hoops any more to get tolerable performance. The money quote is &#8220;one reference reported reducing the number of indexes by a factor of 100 to fewer than five.&#8221; Note, however, that Gartner does not seem to assert that Exadata&#8217;s ease of use rivals that of the newer analytic DBMS specialists.</li>
<li>Gartner confirms <a href="http://www.dbms2.com/2009/02/01/oracle-says-they-do-onsite-exadata-pocs-after-all/" >Oracle&#8217;s reluctance to do onsite Exadata POCs</a>, but says it is not absolute. This is roughly compatible with what I&#8217;m hearing elsewhere, and indeed with Oracle own claims to be ramping up availability of Exadata POC hardware.</li>
<li>Gartner&#8217;s criteria for inclusion include at least 10 different organizations having a product &#8220;in production.&#8221; Thus, the big surprise was ParAccel being included. The money quote there is &#8220;With approximately 20 customers in the pharmaceutical, retail, financial and media/advertising analytics sectors, ParAccel has a good reference base.&#8221; That assessment is difficult to reconcile with other information, but I&#8217;ve been told Gartner is sticking to its guns. That assessment would be even harder to believe if those 20 references were all alleged to be true production customers.</li>
<li>Gartner notes that you basically can&#8217;t run a 1 TB+ MySQL data warehouse without sharding. (Of course, Infobright has an alternative, and up to a small number of terabytes so does Kickfire.)</li>
<li>Gartner reports that at least some customers are pleased with Sybase IQ&#8217;s mixed workload/enterprise data warehouse capabilities.</li>
<li>Gartner correctly notes that <a href="http://www.dbms2.com/2009/10/05/oracle-exadata-2-capacity-pricing/" >Oracle Exadata is a price-competition challenge for Teradata</a>.</li>
<li>Gartner notes that 20% of Vertica&#8217;s customers are outside the US. While not shocking, that&#8217;s more than I realized.</li>
<li>Gartner notes something I don&#8217;t think I&#8217;ve posted yet, which is that Vertica has a customer with 300 TB of data. (The identity is a deep dark secret, but if I told you you probably wouldn&#8217;t recognize the name anyway.)</li>
</ul>
<p>As does any such piece, the Gartner Data Warehouse DBMS Magic Quadrant also has outright errors.  For example:</p>
<ul>
<li>Aster Data isn&#8217;t really &#8220;the newest entrant to the DBMS data warehouse world.&#8221;</li>
<li>Aster&#8217;s SQL/MapReduce was not new in Release 4.0.</li>
<li>Greenplum isn&#8217;t yet pushing down code to the storage tier.</li>
<li>I&#8217;m not sure what kind of database-tier parallelism Gartner is claiming is new in Oracle in 11g Release 2 &#8212; but I doubt it&#8217;s really new. Rather, what Oracle has done recently is <a href="http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/" >make parallelism less administratively cumbersome</a>.</li>
<li>Vertica wasn&#8217;t really the first DBMS in the cloud. At most it was the first pure-play analytic DBMS to get there.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Netezza Skimmer</title>
		<link>http://www.dbms2.com/2010/01/25/netezza-skimmer/</link>
		<comments>http://www.dbms2.com/2010/01/25/netezza-skimmer/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 14:39:00 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Pricing]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1441</guid>
		<description><![CDATA[As I previously complained, last week wasn&#8217;t a very convenient time for me to have briefings. So when Netezza emailed to say it would release its new entry-level Skimmer appliance this morning, while I asked for and got a Friday afternoon briefing, I kept it quick and basic.
That said, highlights of my Netezza Skimmer briefing [...]]]></description>
			<content:encoded><![CDATA[<p>As I previously <a href="http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/" >complained</a>, last week wasn&#8217;t a very convenient time for me to have briefings. So when Netezza emailed to say it would release its new entry-level Skimmer appliance this morning, while I asked for and got a Friday afternoon briefing, I kept it quick and basic.</p>
<p>That said, highlights of my Netezza Skimmer briefing included:</p>
<ul>
<li>In essence, Netezza Skimmer is 1/3 of Netezza&#8217;s previously smallest appliance, for 1/3 the price.</li>
<li>I.e., Netezza Skimmer has 1 S-blade and 9 disks, vs. 3 S-blades and 24 disks on the Netezza TwinFin 3.</li>
<li>With 1 disk reserved as a hot spare, that boils down to a 1:1:1 ratio among CPU cores, FPGA cores, and 1-terabyte disks on Netezza skimmer. The same could pretty much be said of Netezza TwinFin, the occasional hot-spare disk notwithstanding.</li>
<li>Netezza Skimmer costs $125K.</li>
<li>With 2.8 or so TB of space for user data before compression, that&#8217;s right in line with the <a href="http://www.dbms2.com/2009/07/30/the-netezza-price-point/" >Netezza price point</a> of slightly &lt;$20K/terabyte of user data.</li>
<li>That assumes Netezza&#8217;s usual 2.25X compression. I forgot to ask when 4X compression was actually being shipped.</li>
<li>I forgot to ask, but it seems obvious that Netezza Skimmer uses identical or substantially similar components to Netezza TwinFin&#8217;s.</li>
<li>Netezza Skimmer is 7 rack units high.</li>
<li>In place of the SMP hosts on TwinFin Systems, Netezza Skimmer has a host blade.</li>
<li>Netezza (specifically Phil Francisco) mentioned that when Kalido uses Netezza Skimmer for its appliance, there will be an additional host computer, but when it uses TwinFin for the same software, the built-in host will suffice. (Even so, I suspect it might be too strong to say that Skimmer&#8217;s built-in host computer is underpowered.)</li>
<li>Netezza also suggested that more appliance OEMs are coming down the pike specifically focused on the affordable Skimmer.</li>
</ul>
<p><span id="more-1441"></span>Obviously, Netezza Skimmer isn&#8217;t breaking any new technical ground. If Netezza had just called Skimmer &#8220;TwinFin 1,&#8221; nobody should have objected. So the main news here is that you can buy a Netezza box for $125K, plug it in, load a few terabytes of data, and be good to go with a pretty solid data warehouse.  For enterprises and data mart outsourcers with databases of the appropriate size, that could be a pretty attractive deal.</p>
<p>Is Netezza Skimmer as cheap as buying your own hardware and putting (free) <a href="http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/" >Greenplum Single-Node Edition</a> software on it? Not even close, especially since Greenplum&#8217;s free option limits you to lower overall compute power. Does Netezza Skimmer have as high availability as more expensive alternatives? In some cases, surely not. Skimmer is neither the cheapest thing around nor an utterly high-end product.</p>
<p>But Netezza Skimmer belongs on a lot of short lists even so.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/01/25/netezza-skimmer/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Reports of perfectly-balanced hardware configurations are greatly exaggerated</title>
		<link>http://www.dbms2.com/2009/10/25/data-warehouse-balanced-hardware-configuration/</link>
		<comments>http://www.dbms2.com/2009/10/25/data-warehouse-balanced-hardware-configuration/#comments</comments>
		<pubDate>Sun, 25 Oct 2009 04:00:27 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1165</guid>
		<description><![CDATA[Data warehouse appliance and software appliance vendors like to claim that they&#8217;ve worked out just the right hardware configuration(s), and that a single configuration is correct for a fairly broad range of workloads. But there are a lot of reasons to be dubious about that. Specific vendor evidence includes:

Teradata ascribes 	considerable importance to a Virtual [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Data warehouse appliance and software appliance vendors like to claim that they&#8217;ve worked out just the right hardware configuration(s), and that a single configuration is correct for a fairly broad range of workloads. But there are a lot of reasons to be dubious about that. Specific vendor evidence includes:</p>
<ul>
<li><strong>Teradata</strong> ascribes 	considerable importance to a <a href="http://www.dbms2.com/2008/10/14/teradata-virtual-storage//" >Virtual 	Storage</a> technology whose main purpose is to allow mixing of 	heterogeneous storage devices in a single system. And the discussion 	rarely suggests that these parts will be in a rigid fixed 	relationship.</li>
<li><strong>Netezza</strong> &#8212; as Teradata 	keeps reminding me &#8212; often sells boxes with the expectation that 	they won&#8217;t be filled with data, so as to increase spindle count and hence performance.</li>
<li><strong>Oracle/Sun</strong> have dropped 	some comments about Exadata being more flexibly configured going 	forward.</li>
<li><strong>Kickfire&#8217;s</strong> <a href="../2009/10/18/kickfire-capacity-and-pricing/">new 	“high-end” appliance</a> lets you attach fairly arbitrary 	amounts of external storage.</li>
<li>And of course, <strong>software-only 	analytic DBMS vendors</strong> run their software in all sorts of 	hardware and storage environments.</li>
</ul>
<p style="margin-bottom: 0in;">What&#8217;s more, the claim never made a lot of sense anyway. With the rarest of exceptions, even a single data warehouse&#8217;s workload will contain different queries that strain different parts of the system in different ratios. Calculating the “ideal” hardware configuration for that single workload would be forbiddingly difficult. And even if one could calculate it, it almost surely would be different than another user&#8217;s “ideal” configuration. How a single hardware configuration can be “ideally balanced” for a broad class of use cases boggles the imagination.</p>
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/10/25/data-warehouse-balanced-hardware-configuration/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Oracle Exadata 2 capacity pricing</title>
		<link>http://www.dbms2.com/2009/10/05/oracle-exadata-2-capacity-pricing/</link>
		<comments>http://www.dbms2.com/2009/10/05/oracle-exadata-2-capacity-pricing/#comments</comments>
		<pubDate>Mon, 05 Oct 2009 12:20:19 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1021</guid>
		<description><![CDATA[Summary of Oracle Exadata 2 capacity pricing
Analyzing Oracle Exadata pricing is always harder than one would first think. But I&#8217;ve finally gotten around to doing an Oracle Exadata 2 pricing spreadsheet. The main takeaways are:

If 	we believe Oracle&#8217;s claims of 10X compression, Exadata 2 costs more 	per terabyte of user data than Netezza TwinFin &#8212; [...]]]></description>
			<content:encoded><![CDATA[<p><strong><span style="font-style: normal;">Summary of Oracle Exadata 2 capacity pricing</span></strong></p>
<p><span style="font-style: normal;">Analyzing Oracle Exadata pricing is always harder than one would first think. But I&#8217;ve finally gotten around to doing an Oracle Exadata 2 pricing </span><a href="http://www.monash.com/uploads/Oracle-Exadata-pricing-estimates-Oct-2009.xls" onclick="javascript:pageTracker._trackPageview('/www.monash.com');">spreadsheet</a>.<span style="font-style: normal;"> The main takeaways are:</span></p>
<ul>
<li><span style="font-style: normal;">If 	we believe Oracle&#8217;s claims of 10X compression, Exadata 2 costs more 	per terabyte of user data than Netezza TwinFin &#8212; $22-26K/TB vs. 	TwinFin&#8217;s &lt;$20K &#8212; but less than the Teradata 2550.</span></li>
<li><span style="font-style: normal;">These 	figures are highly sensitive to assumptions about Oracle&#8217;s </span><a href="http://www.dbms2.com/2009/09/03/oracle-11g-exadata-hybrid-columnar-compression/" >hybrid 	columnar compression</a><span style="font-style: normal;">.</span></li>
<li><span style="font-style: normal;">Similarly, 	if Netezza or Teradata were to significantly upgrade their own 	compression, the price comparison would look quite different.</span></li>
<li><span style="font-style: normal;">Options 	such as Data Mining or Oracle Spatial add 12% or so each to 	Exadata&#8217;s total system price.</span></li>
</ul>
<p><strong><span style="font-style: normal;">Longer version</span></strong></p>
<p><span style="font-style: normal;">When Oracle introduced Exadata last year it was, well, <a href="../2008/09/30/oracle-database-machine-exadata-pricing-part-2/">expensive</a>. Exadata 2 has now been announced, and it is significantly cheaper than Exadata 1 per terabyte of user data, based on:</span></p>
<ul>
<li>Similar overall pricing</li>
<li>Twice the disk capacity</li>
<li>Better compression</li>
</ul>
<p style="margin-bottom: 0in;"><strong><span id="more-1021"></span>Compression is the big question mark.</strong> Row-based DBMS vendors have traditionally been, if anything, conservative in their compression claims, although Netezza recently went with a not-sandbagged 2.25X compression estimate to get below <a href="http://www.dbms2.com/2009/07/30/the-netezza-price-point/" >the $20K/terabyte price point</a>. <span style="font-style: normal;">Columnar software vendors have tended to be more aggressive, with figures of 10X or more casually thrown around, or 40-50X for archival storage. But since columnar vendors sell mainly on a software-only basis, those claims haven&#8217;t generally shown up in per-terabyte total system cost comparisons, which most commonly focus on data warehouse appliance product lines.*</span></p>
<p style="margin-bottom: 0in;"><em>*Kickfire, the one columnar pure-play appliance vendor, hast to date been quite conservative in its compression marketing claims.</em></p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Oracle, however, recently announced a feature called </span><em>hybrid columnar compression</em><span style="font-style: normal;">, and is now making compression claims with the usual columnar grandiosity. Oracle&#8217;s story is 10X compression, and they&#8217;re sticking to it, perhaps because 10 is such a nice round number.* <span style="text-decoration: line-through;">Since Oracle hybrid columnar compression is part of 11g Release 2, and isn&#8217;t Exadata-specific, w</span>We can hope to eventually get a sense from the field of what levels of compression are actually realistic. <em>(Edit: Actually, it seems that <a href="http://blog.tanelpoder.com/2009/09/01/oracle-11gr2-has-been-released-and-with-column-oriented-storage-option/" onclick="javascript:pageTracker._trackPageview('/blog.tanelpoder.com');">hybrid columnar compression only works with Exadata</a>, at least at this time.)</em> But for now, we don&#8217;t have much to go on except Oracle&#8217;s claims.</span></p>
<p style="margin-bottom: 0in;"><em>*Greg Rahn of Oracle tweeted me yesterday that one customer is getting 12-17X compression on &#8220;dimensional model&#8221; data. That sounds comparable to <a href="http://www.dbms2.com/2008/09/24/vertica-finally-spells-out-its-compression-claims/" >Vertica&#8217;s claim of 20X on &#8220;marketing analytics&#8221; and 30X on &#8220;consumer data&#8221; datasets</a>.<br />
</em>
</p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;"><strong>Based on 10X compression</strong></span><span style="font-style: normal;"> (vs. Netezza&#8217;s 2.25X and Teradata&#8217;s lower figure), </span><span style="font-style: normal;"><strong>Oracle Exadata 2 is somewhat more expensive than Netezza TwinFin, and significantly cheaper than Teradata&#8217;s 2550.</strong></span> Specifically, Oracle Exadata 2 comes in around $22K/TB of user data for a full rack, and $26K/TB for a quarter rack, which is the Exadata 2 configuration more comparable to a TwinFin rack in user data capacity. This is if you look at the Exadata hardware version that uses 600 GB SAS drives (vs. 1 TB SAS drives for TwinFin and 300 GB SAS drives for Teradata). With 2 TB SATA drives, at the same system pricing, Exadata prices are 70% lower, getting down to $6K/TB for a full rack. You can see how I got these figures on my Oracle Exadata 2 pricing spreadsheet linked above.</p>
<p style="margin-bottom: 0in;">Obviously, price/terabyte is just one metric. Throughput is often even more important, but also is a lot harder to quantify simply. Oracle Exadata 2 offers more raw I/O than Netezza TwinFin. Netezza TwinFin, with its FPGA-based pipelining, probably has more processing oomph than Oracle Exadata. Oracle&#8217;s compression could lead to better use of RAM cache. And so it goes.</p>
<p style="margin-bottom: 0in;"><span style="font-style: normal;">Meanwhile, two factors that in my opinion don&#8217;t matter much to the analysis are:</span></p>
<ul>
<li><em>The 	re-usability of Oracle licenses on other hardware.</em><span style="font-style: normal;"> Most of Exadata&#8217;s cost is either for hardware, or for server 	software that&#8217;s priced on a per-core basis. Neither of those is 	going to manage much (or any) more data three years from now than it 	can today.</span></li>
<li><em>What 	Oracle claims as pricing metrics.</em><span style="font-style: normal;"> Oracle&#8217;s comments on Exadata pricing generally sow confusion, which 	is why I do my own spreadsheets.</span></li>
</ul>
<p style="margin-bottom: 0in;"><em><strong>Related links</strong></em></p>
<ul>
<li><a href="http://www.dbms2.com/2009/09/25/the-hunt-for-oracle-exadata-production-references/" >The hunt for Oracle Exadata production references</a> (but hopefully some will be revealed at Oracle Open World)</li>
<li><a href="http://www.dbms2.com/2009/09/30/facts-and-rumors/" >A favorable rumor about Exadata sales</a></li>
<li><a href="http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/" >Issues in integrating OLTP and data warehousing in a single system</a></li>
<li>James Kobielus of Forrester tweets that <a href="http://twitter.com/jameskobielus/status/4627010008" onclick="javascript:pageTracker._trackPageview('/twitter.com');">there are &#8220;plenty&#8221; of pleased Exadata users</a></li>
<li>More on <a href="http://www.dbms2.com/2009/10/06/oracle-and-vertica-on-compression-and-other-physical-data-layout-features/" >Oracle Exadata hybrid columnar compression</a></li>
</ul>
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/10/05/oracle-exadata-2-capacity-pricing/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Facts and rumors</title>
		<link>http://www.dbms2.com/2009/09/30/facts-and-rumors/</link>
		<comments>http://www.dbms2.com/2009/09/30/facts-and-rumors/#comments</comments>
		<pubDate>Wed, 30 Sep 2009 06:21:44 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[DATAllegro]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Petabyte-scale data management]]></category>
		<category><![CDATA[Specific users]]></category>
		<category><![CDATA[Telecommunications]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=963</guid>
		<description><![CDATA[
Vertica is putting out a press 	release today touting its 100th customer, and talking of triple 	digit growth last year.
Multiple sources have told me that 	the DATAllegro system is being thrown out of Dell, so evidently Dell is telling this to one and all. If that goes 	through, this would presumably leave TEOCO as DATAllegro&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>Vertica is putting out a press 	release today touting its 100th customer, and talking of triple 	digit growth last year.</li>
<li>Multiple sources have told me that 	the DATAllegro system is being thrown out of <a href="http://www.dbms2.com/2009/03/02/closing-the-book-on-the-datallegro-customer-base/" >Dell</a>, so evidently Dell is telling this to one and all. If that goes 	through, this would presumably leave <a href="http://www.dbms2.com/2008/05/23/data-warehouse-appliance-power-user-teoco/" >TEOCO</a> as DATAllegro&#8217;s single happy 	customer. (I haven&#8217;t checked with Microsoft for its view.)</li>
<li>A rumor has it that Infiniband 	technology vendor Voltaire, Ltd. privately claims triple-digit sales 	of switches for Exadata 1 (I think that one would be one switch per Exadata installation, not per rack). Based just on a quick glance, this is far from confirmed by 	Voltaire&#8217;s earnings <a href="http://seekingalpha.com/article/135775-voltaire-ltd-q1-2009-earnings-call-transcript" onclick="javascript:pageTracker._trackPageview('/seekingalpha.com');">conference 	call</a> <a href="http://seekingalpha.com/article/152278-voltaire-q2-2009-earnings-transcript" onclick="javascript:pageTracker._trackPageview('/seekingalpha.com');">transcripts</a> or <a href="http://sec.gov/cgi-bin/browse-edgar?action=getcompany&amp;CIK=0001401678&amp;owner=exclude&amp;count=40" onclick="javascript:pageTracker._trackPageview('/sec.gov');">SEC 	filings</a>. However, the most recent transcript does seem to 	indicate Voltaire got multiple Exadata deals in the 	telecommunications sector, and suggests some Exadata penetration in 	other sectors as well.</li>
<li>I was told of a 	classified-agency user that has &gt;1 petabyte of data on Exadata 1 	and 600 terabytes or so on Netezza. My not-obviously-biased source says 	the agency is distinctly happier with Netezza than Exadata.</li>
<li>Like <a href="http://paraccel.com/data_warehouse_blog/?p=104" onclick="javascript:pageTracker._trackPageview('/paraccel.com');">ParAccel</a>, 	<a href="http://www.theregister.co.uk/2009/09/29/tpc_slaps_oracle/" onclick="javascript:pageTracker._trackPageview('/www.theregister.co.uk');">Oracle 	just got dinged for TPC-related misbehavior</a>.</li>
<li>Rumor has it that Sun has no 	intention of helping ParAccel rerun its withdrawn TPC-H benchmark.</li>
<li>ParAccel has withdrawn the claim 	from its home page to be the &#8220;CERTIFIED&#8221; price-performance 	leader. This seems to confirm that the claim was a reference to the 	TPC-H. In my opinion, that was a gross misrepresentation of what the 	TPC-H shows.</li>
</ul>
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/09/30/facts-and-rumors/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>What Nielsen really uses in data warehousing DBMS</title>
		<link>http://www.dbms2.com/2009/09/29/a-c-nielsen-data-warehousing-dbms/</link>
		<comments>http://www.dbms2.com/2009/09/29/a-c-nielsen-data-warehousing-dbms/#comments</comments>
		<pubDate>Tue, 29 Sep 2009 15:04:26 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Specific users]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=950</guid>
		<description><![CDATA[In its latest earnings call, Oracle made a reference to The Nielsen Company that was &#8212; to put it politely &#8212; rather confusing. I just plopped down in a chair next to Greg Goff, who evidently runs data warehousing at Nielsen, and had a quick chat. Here&#8217;s the real story.

The Nielsen Company has over half [...]]]></description>
			<content:encoded><![CDATA[<p>In its latest earnings call, Oracle made <a href="../2009/09/19/oracle-database-siz/">a reference to The Nielsen Company</a> that was &#8212; to put it politely &#8212; rather confusing. I just plopped down in a chair next to Greg Goff, who evidently runs data warehousing at Nielsen, and had a quick chat. Here&#8217;s the real story.</p>
<ul>
<li>The Nielsen Company has over half a 	petabyte of data on Netezza in the US. This installation is growing.</li>
<li>The Nielsen Company indeed has 45 	terabytes or whatever of data on Oracle in its European (Customer) 	Information Factory. This is not particularly growing. Nielsen&#8217;s 	Oracle data warehouse has been built up over the past 9 years. It&#8217;s 	not new. It&#8217;s certainly not on Exadata, nor planned to move to 	Exadata.</li>
<li>These are not single-instance 	databases. Nielsen&#8217;s biggest single Netezza database is 20 terabytes 	or so of user data, and its biggest single Oracle database is 10 	terabytes or so.</li>
<li>Much (most?) of the rest of the 	installations are customer data marts and the like, based in each 	case on the “big” central database. (That&#8217;s actually a classic 	<a href="../2009/06/08/the-future-of-data-marts/">data 	mart use case</a>.) Greg said that Netezza&#8217;s capabilities to spin 	out those databases seemed pretty good.</li>
<li>That 10 terabyte Oracle data 	warehouse instance requires a lot of partitioning effort and so on 	in the usual way.</li>
<li>Nielsen has no immediate plans to 	replace Oracle with Netezza.</li>
<li>Nielsen actually has 800 terabytes 	or so of Netezza equipment. Some of that is kept more lightly loaded, 	for performance.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/09/29/a-c-nielsen-data-warehousing-dbms/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Oracle gives a few customer database size examples</title>
		<link>http://www.dbms2.com/2009/09/19/oracle-database-siz/</link>
		<comments>http://www.dbms2.com/2009/09/19/oracle-database-siz/#comments</comments>
		<pubDate>Sun, 20 Sep 2009 00:40:52 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Specific users]]></category>
		<category><![CDATA[Telecommunications]]></category>
		<category><![CDATA[Web analytics]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=905</guid>
		<description><![CDATA[In its recent quarterly conference call, Oracle said (as per the Seeking Alpha transcript):
AC Neilsen, for instance, we deployed a 45-terabyte data [mart], they called it; Adidas, 13 terabytes; Australian Bureau of Statistics, 250 terabytes; and of course, some of our high-end ones that you have probably heard of in the past, AT&#38;T, 250 terabytes; [...]]]></description>
			<content:encoded><![CDATA[<p>In its recent quarterly conference call, Oracle said (as per <a href="http://seekingalpha.com/article/161887-oracle-f1q10-qtr-end-8-31-09-earnings-call-transcript?page=5" onclick="javascript:pageTracker._trackPageview('/seekingalpha.com');">the Seeking Alpha transcript</a>):</p>
<blockquote><p>AC Neilsen, for instance, we deployed a 45-terabyte data [mart], they called it; Adidas, 13 terabytes; Australian Bureau of Statistics, 250 terabytes; and of course, some of our high-end ones that you have probably heard of in the past, AT&amp;T, 250 terabytes; Yahoo!, 700 terabytes &#8212; just gives you an idea of the size of the databases that are out there and how they are growing, and that’s driving the need for greater throughput.</p></blockquote>
<p>I don&#8217;t know what&#8217;s being counted there, but I wouldn&#8217;t be surprised if those were legit user-data figures.</p>
<p>Some other notes:</p>
<ul>
<li><span style="text-decoration: line-through;">The Yahoo database is of course Yahoo&#8217;s first-generation data warehouse, which has been largely superseded by <a href="http://www.dbms2.com/2009/07/06/yahoo-is-up-to-10-petabytes-now/" >an internal system more than 10X that size</a>.</span> <em>(Edit: Actually, Greg Rahn of Oracle says below that it&#8217;s a different database.)</em></li>
<li>I&#8217;m keynoting the Netezza road show this month, and Nielsen is up there on stage touting Netezza. <em>(Edit: <a href="http://www.dbms2.com/2009/09/29/a-c-nielsen-data-warehousing-dbms/" >Nielsen indeed does the overwhelming majority of its data warehousing on Netezza</a>.)</em></li>
<li>I&#8217;d be surprised if AT&amp;T&#8217;s largest data warehouse were &#8220;only&#8221; 250 terabytes in size. <em>(Edit: Actually, I am told the database in question is 310 TB of user data and growing. More later, hopefully.)</em></li>
<li>Oracle didn&#8217;t exactly say that those were Exadata installations.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/09/19/oracle-database-siz/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.326 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2010-03-16 15:37:31 -->
