<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; Pricing</title>
	<atom:link href="http://www.dbms2.com/category/pricing/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Wed, 08 Feb 2012 22:51:11 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>Notes on the Oracle Big Data Appliance</title>
		<link>http://www.dbms2.com/2012/01/10/notes-on-the-oracle-big-data-appliance/</link>
		<comments>http://www.dbms2.com/2012/01/10/notes-on-the-oracle-big-data-appliance/#comments</comments>
		<pubDate>Wed, 11 Jan 2012 01:32:39 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5809</guid>
		<description><![CDATA[Oracle announced its Big Data Appliance. Specs may be found in the Oracle Big Data Appliance press release. Beyond that: The most important software on the Oracle Big Data Appliance is a full set of Cloudera Enterprise code. Oracle will do Tier 1 Cloudera/Hadoop support, while Cloudera handles Tiers 2 and 3. The key spec [...]]]></description>
			<content:encoded><![CDATA[<p>Oracle announced its Big Data Appliance. Specs may be found in <a href="http://www.oracle.com/us/corporate/press/1453721">the Oracle Big Data Appliance press release</a>. Beyond that:</p>
<ul>
<li>The most important software on the Oracle Big Data Appliance is a full set of <a href="../2012/01/10/a-couple-of-links-explaining-cloudera-manager/">Cloudera Enterprise</a> code. Oracle will do Tier 1 Cloudera/Hadoop support, while Cloudera handles Tiers 2 and 3.</li>
<li>The key spec ratios are 1 core/4 GB RAM/3 TB raw disk. That&#8217;s reasonably in line with <a href="http://www.dbms2.com/2011/06/04/hardware-for-hadoop/">Cloudera figures I published in June, 2010</a>.</li>
<li>This is really Oracle&#8217;s <a href="http://www.dbms2.com/2012/01/08/big-data-terminology-and-positioning/">multi-structured big data appliance</a>. Oracle&#8217;s relational big data appliance is Exadata, which has been out for years and has comparable capacity to Oracle&#8217;s new &#8220;Big Data Appliance.&#8221; (<a href="http://www.eweek.com/c/a/IT-Infrastructure/Oracle-Launches-ClouderaPowered-Big-Data-Appliance-172364/">Chris Preimesberger</a> made a similar point.)</li>
<li>The Oracle Big Data Appliance list price is $450,000 for 18 12-core servers, plus $54,000/year maintenance.
<ul>
<li>That&#8217;s around $25,000 per server (and associated storage).</li>
<li>That&#8217;s also around $2,000/core.</li>
<li>That&#8217;s also around $500/TB of spinning disk, before <a href="http://www.dbms2.com/2011/07/06/hadoop-hardware-and-compression/">compression</a>.</li>
<li>None of those per-unit figures sounds ridiculous &#8230;</li>
<li>&#8230; but because of Oracle&#8217;s appliance configuration there&#8217;s indeed a hefty minimum initial purchase.</li>
</ul>
</li>
</ul>
<p><a href="http://www.zdnet.com/blog/btl/oracle-rolls-out-big-data-play-with-aggressive-price-cloudera/66529"><span id="more-5809"></span>Peter Goldmacher</a> argues that, because of size and price point, the Oracle Big Data appliance is targeted for high-end deployments rather than starter/test/development set-ups. To first approximation, that makes sense, in that:</p>
<ul>
<li>The Oracle Big Data Appliance is in the petabyte range for data capacity, and &#8230;</li>
<li>&#8230; <a href="http://www.dbms2.com/2011/07/06/petabyte-hadoop-clusters/">the number of petabyte-scale Hadoop deployments is in the low tens</a>, and &#8230;</li>
<li>&#8230; many of those aren&#8217;t at Oracle shops anyway.</li>
</ul>
<p>Surely the Oracle Big Data Appliance isn&#8217;t designed for the 4-8 node play-with-Hadoop crowd.</p>
<p>On the the other hand, if you&#8217;re at a big, committed Oracle shop, and you want to do your first serious Hadoop deployment, why not go with the Oracle Big Data Appliance? You probably could save money with an alternative approach &#8212; but if your employers are committed to Oracle, saving money is surely not their greatest concern. Overpay by a bit; make your management happy with the Oracle logo; get Hadoop on your resume; prosper. That seems like a winning plan all the way around.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2012/01/10/notes-on-the-oracle-big-data-appliance/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Exasol update</title>
		<link>http://www.dbms2.com/2011/11/12/exasol-update/</link>
		<comments>http://www.dbms2.com/2011/11/12/exasol-update/#comments</comments>
		<pubDate>Sun, 13 Nov 2011 02:37:13 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Exasol]]></category>
		<category><![CDATA[Market share and customer counts]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Specific users]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Workload management]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5661</guid>
		<description><![CDATA[I last wrote about Exasol in 2008. After talking with the team Friday, I&#8217;m fixing that now. The general theme was as you&#8217;d expect: Since last we talked, Exasol has added some new management, put some effort into sales and marketing, got some customers, kept enhancing the product and so on. Top-level points included: Exasol&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p><a href="../../../../../2008/08/16/exasol-technical-briefing/">I last wrote about Exasol in 2008</a>. After talking with the team Friday, I&#8217;m fixing that now. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  The general theme was as you&#8217;d expect: Since last we talked, Exasol has added some new management, put some effort into sales and marketing, got some customers, kept enhancing the product and so on.</p>
<p>Top-level points included:</p>
<ul>
<li>Exasol&#8217;s technical philosophy is substantially the same as before, albeit not with as extreme a focus on fitting everything in RAM.</li>
<li>Exasol believes its flagship DBMS EXASolution has great performance on a load-and-go basis.</li>
<li>Exasol has 25 EXASolution customers, all in Germany.*</li>
<li>5 of those are &#8220;cloud&#8221; customers, at hosting providers engaged by Exasol.</li>
<li>EXASolution database sizes now range from the low 100s of gigabytes up to 30 terabytes.</li>
<li>Pretty much the whole company is in Nuremberg.</li>
</ul>
<p><span id="more-5661"></span><em>*That excludes some money from Hitachi. Exasol&#8217;s Hitachi partnership is still in limbo, an apparent casualty of the world economic crisis.</em></p>
<p>On the technical side:</p>
<ul>
<li>As noted in my 2008 post, EXASolution is a columnar, no-head-node MPP (Massively Parallel Processing) DBMS.</li>
<li>The main way EXASolution compresses data is via dictionary/tokenization. 5:1 is a typical compression ratio before mirroring and so on, out of a 2-10:1 range.</li>
<li>EXASolution writes data to blocks in memory that are smaller than what is otherwise its preferred size (1/2 to 5 megabytes). These are sent to disk, where merge eventually happens. Exasol insists that write performance has always been fully satisfactory to customers to date.</li>
<li>EXASolution doesn&#8217;t have much in the way of performance tuning knobs. Exasol says they aren&#8217;t needed, and says that one really can start an EXASolution POC (Proof of Concept) in a day or so.</li>
<li>EXASolution doesn&#8217;t have much in the way of workload management capabilities, except what&#8217;s automagic (e.g., short query bias). However, it does collect statistics you can query via your favorite BI tool.</li>
<li>EXASolution doesn&#8217;t have much in the way of <a href="../../../../../2011/02/24/analytic-platforms/">analytic platform</a> capabilities, although there is some Lua-based scripting. However, there&#8217;s something NDA in the analytic platform area Coming Soon.*</li>
</ul>
<p>In general, the whole thing sounds somewhat like ParAccel, at least at a high level.</p>
<p><em>*Exasol is not and never has been our client, but we can keep secrets for them even so.</em></p>
<p>Naturally, Exasol believes EXASolution has fine concurrency, with at least one customer routinely running 2000 concurrent users, 200 concurrent sessions (via connection pooling), and 5-10 concurrent queries. Another customer has 3500 Cognos users. 1-200 concurrent queries appears to be the record peak load. Anyhow, Exasol says that plans to offer real workload management could be accelerated if a need were discovered.</p>
<p>Exasol says it almost never loses POCs, but admits that it competes fairly rarely against Vertica and ParAccel, no doubt for reasons of geography. Exasol boasts one visible Sybase IQ replacement (Sony Music).</p>
<p>While Exasol&#8217;s sales to date have been in Germany, there are plans to change that soon. At least one sales cycle is well underway in Eastern Europe. Offices in other Germanic countries are planned. Existing customers are planning to deploy additional copies outside Germany. Discussions are underway regarding other geographies, e.g. English-speaking ones.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/11/12/exasol-update/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Vertica Community Edition</title>
		<link>http://www.dbms2.com/2011/10/18/vertica-community-edition/</link>
		<comments>http://www.dbms2.com/2011/10/18/vertica-community-edition/#comments</comments>
		<pubDate>Tue, 18 Oct 2011 15:48:10 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5491</guid>
		<description><![CDATA[The press release announcing Vertica&#8217;s Community Edition is a bit vague. And indeed, much of what I know about Vertica Community Edition is along the lines of &#8220;This is what I think will happen, but of course it could still change.&#8221; That said, I believe: Vertica Community Edition has all of regular Vertica&#8217;s features. However [...]]]></description>
			<content:encoded><![CDATA[<p>The press release announcing <a href="http://www.vertica.com/news/press/vertica-announces-community-edition-version-of-vertica-analytic-database/">Vertica&#8217;s Community Edition</a> is a bit vague. And indeed, much of what I know about Vertica Community Edition is along the lines of &#8220;This is what I think will happen, but of course it could still change.&#8221; That said, I believe:</p>
<ul>
<li>Vertica Community Edition has all of regular Vertica&#8217;s features. However &#8230;</li>
<li>&#8230; HP Vertica reserves the right to open a feature gap in future releases.</li>
<li>The license restriction on Vertica Community Edition is that you&#8217;re limited to 1 terabyte of data, and 3 nodes. I imagine that&#8217;s for one production copy, and you&#8217;re perfectly free to also set up mirrors for test, development, disaster recovery, and so on. However &#8230;</li>
<li>&#8230; HP Vertica would be annoyed if you stuck a free copy of Vertica on each of 50 nodes and managed the whole thing via, say, Hadapt.</li>
<li>HP Vertica plans to be very generous with true academic researchers, suspending or waiving limits on database size and node count. Not coincidentally, Vertica Community Edition is being announced at <a href="http://www.dbms2.com/2011/09/20/xldb-the-one-conference-i-like-to-go-to/">XLDB</a>, where Vertica is also a top-level sponsor. (I introduced Vertica and XLDB&#8217;s Jacek Becla to each other as soon as I heard about Vertica&#8217;s Community Edition plans.)</li>
<li>The only support available for Vertica Community Edition is through forums. This could change.</li>
</ul>
<p>I&#8217;m a big supporter of the Vertica Community Edition idea, for four reasons:</p>
<ul>
<li>It should now be easier to download and evaluate Vertica.</li>
<li>Vertica Community Edition could be a big help to academic researchers.</li>
<li>Vertica could now be more appealing to some of the &#8220;Omigod, we&#8217;re outgrowing Oracle Standard Edition and we don&#8217;t want to pay up for Oracle Enterprise Edition/Exadata&#8221; crowd.</li>
<li>People are under the impression that what Vertica actually charges today resembles its <a href="http://www.dbms2.com/2009/04/25/vertica-pricing-and-customer-metrics/">long-ago list prices</a>. This announcement may help puncture Vertica&#8217;s outdated pricing image.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/18/vertica-community-edition/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>salesforce.com, force.com, database.com, data.com, heroku.com &#8212; notes and context</title>
		<link>http://www.dbms2.com/2011/09/15/salesforce-force-database-data-heroku/</link>
		<comments>http://www.dbms2.com/2011/09/15/salesforce-force-database-data-heroku/#comments</comments>
		<pubDate>Thu, 15 Sep 2011 16:09:08 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Market share and customer counts]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[salesforce.com]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5236</guid>
		<description><![CDATA[As previously noted, I attended Dreamforce, the user conference for my clients at salesforce.com. When I work with them, I focus primarily on database.com and related businesses. I&#8217;ve had to struggle a bit, however, to sort out the various pieces, and specifically the differences among: salesforce.com. This is the parent company, and the runaway leader [...]]]></description>
			<content:encoded><![CDATA[<p>As previously noted, I attended <a href="http://www.texttechnologies.com/2011/09/14/social-technology-in-the-enterprise/">Dreamforce</a>, the user conference for my clients at salesforce.com. When I work with them, I focus primarily on database.com and related businesses. I&#8217;ve had to struggle a bit, however, to sort out the various pieces, and specifically the differences among:</p>
<ul>
<li><strong>salesforce.com.</strong> This is the parent company, and the runaway leader in the SaaS (Software as a Service) enterprise application market, especially in the area of CRM (Customer Relationship Management).</li>
<li><strong>force.com.</strong> This is salesforce.com&#8217;s application development stack split out for other SaaS vendors to use, both inside and outside the CRM segment. It can be referred to as a PaaS offering (Platform as a Service). force.com relies on a proprietary salesforce.com language called APEX, which has a strong stored procedure/ database trigger orientation.</li>
<li><strong>database.com.</strong> This is the database part of force.com, spun out separately in general availability as of Dreamforce two weeks ago.</li>
<li><strong>data.com.</strong> Also launched at Dreamforce (and based, if I understand correctly, on an acquisition), this is a provider of 3rd-party data you might use as inputs to your CRM systems.</li>
<li><strong>Heroku.</strong> Another salesforce.com acquisition, Heroku is in essence a PaaS competitor to force.com. Heroku is focused on Ruby and Java, and supports a number of DBMS, SQL and NoSQL alike.</li>
<li><strong><a href="http://appexchange.salesforce.com/home">AppExchange</a>.</strong> This is a marketplace for things designed to integrate with salesforce.com (and perhaps also apps built on force.com). The latest claim is that there are 1200+ AppExchange offerings.</li>
<li><strong>The complete set of SaaS apps built on force.com.</strong> A <a href="http://www.salesforce.com/au/assets/pdf/Force.com_Multitenancy_WP_101508.pdf">2008 white paper</a> refers to 47,000 organizations being &#8220;supported&#8221; by force.com. Recently I&#8217;ve heard a figure just under 100,000. I&#8217;m not clear as to what that metric measures &#8212; aggregate users of SaaS apps built  via force.com? Clearly there are a lot of SaaS apps built on force.com, with actual customers, but I don&#8217;t know how big &#8220;a lot&#8221; is. (Perhaps a salesforce.com person could chime into the comment thread with some clarity.)</li>
</ul>
<p><span id="more-5236"></span>The pricing for force.com and database.com is clearly designed for enterprise SaaS applications whose users will be inside customer organizations. If you want to do something public-facing, prices are prohibitive without a special deal. That&#8217;s the bad news. The good news is that salesforce.com says, publicly and privately, that it&#8217;s indeed open to cutting such volume pricing deals.</p>
<p>When I talked with CTOs and the like at some Dreamforce-exhibiting SaaS vendors, on the whole they seemed<strong> very happy with force.com.</strong> The one repeated complaint was that force.com imposed <strong>unpleasant rate limits</strong> (e.g., number of API calls). Working around those limits involves unnatural acts of coding, phones calls to helpful salesforce.com staffers to get the limits raised, or both. When I talked with salesforce.com cofounder Parker Harris, he seemed painfully aware of the problem, and indicated that relaxing the limits is an important technical goal.</p>
<p>The other force.com weakness I uncovered was expected &#8212; while it may be great as long as your application matches its implicit assumptions, there are <strong>some things it can&#8217;t easily do.</strong> This has been a recurring issue since database-oriented 4GLs (Fourth-Generation Languages) came around in the 1980s. For example, one firm wanted a Flash UI &#8212; I&#8217;m not sure why &#8212; and went outside force.com for that part of the application.</p>
<p><em><strong>Related link</strong></em></p>
<ul>
<li>The <a href="http://www.dbms2.com/2011/09/15/database-architecture-salesforce-com-force-com-and-database/">database architecture of salesforce.com, force.com, and database.com</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/09/15/salesforce-force-database-data-heroku/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Kaminario goes (mainly) flash</title>
		<link>http://www.dbms2.com/2011/09/14/kaminario-goes-mainly-flash/</link>
		<comments>http://www.dbms2.com/2011/09/14/kaminario-goes-mainly-flash/#comments</comments>
		<pubDate>Wed, 14 Sep 2011 09:30:53 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Kaminario]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Solid-state memory]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5227</guid>
		<description><![CDATA[Kaminario, which used to be in the business of solid state storage via DRAM, now is emphasizing hybrid DRAM/flash storage appliances instead. The reason is evidently price. Per terabyte of primary storage (before mirroring onto disk and so on): A Kaminario K2 DRAM-only appliance costs $100K. A Kaminario K2 flash-only appliance costs $30K (but nobody [...]]]></description>
			<content:encoded><![CDATA[<p>Kaminario, which used to be in the business of solid state storage via DRAM, now is emphasizing hybrid DRAM/flash storage appliances instead. The reason is evidently price. <strong>Per terabyte of primary storage</strong> (before mirroring onto disk and so on):</p>
<ul>
<li>A Kaminario K2 DRAM-only appliance costs <strong>$100K.</strong></li>
<li>A Kaminario K2 flash-only appliance costs $30K (but nobody buys that configuration).</li>
<li>A typical Kaminario K2 hybrid DRAM/flash appliance might cost <strong>$35K</strong> (which tells us that there&#8217;s a lot more flash than DRAM).</li>
</ul>
<p>Kaminario positions DRAM as where you focus your most write-intensive/ bottlenecking loads, such as logging or <a href="../../../../../2010/08/16/vertica-flash-temp-space/">temp space</a>, with the primary benefit being performance and a secondary benefit being slowing the wear on your flash.</p>
<p><span id="more-5227"></span><em>If you want even your mirrors to be on flash &#8212; which Kaminario says greatly reduces the temporary performance hit in case of a failure &#8212; there will be an additional charge. Perhaps Kaminario will dig up a price number and post it in the comment thread.</em></p>
<p>The flash comes in via Fusion-io cards. Kaminario stresses that it sells a SAN (Storage Area Network) kind of offering, as opposed to the shared-nothing way one might otherwise use Fusion-io cards in servers&#8217; PCIe slots. Kamanario further asserts its built-in high availability is both smoother and less costly than Texas Memory Systems or Violin Memory alternatives; Kaminario is generally proud of its high availability features, down to redundant uninterruptible power supplies. Apparently the sweet spot of Kaminario&#8217;s market is single-chassis 5-6 TB systems, but Kaminario asserts seamless elasticity even if you grow into a second chassis.</p>
<p>Price resistance seems to have gotten strongly in the way of Kaminario&#8217;s growth, although the company was evasive about customer counts and the like. But it does now have 60+ employees and an aggressive hiring plan, vs. &lt;50 when <a href="../../../../../2010/10/19/introduction-to-kaminario/">I wrote about Kaminario a year ago</a>. I do believe that many enterprises would benefit from<strong> throwing solid-state storage at certain performance problems,</strong> at least as a band-aid, while they contemplate software changes.* But evidently Kaminario has had difficulties &#8212; especially at the DRAM-only price point &#8212; getting customers to agree, or at least to agree that Kaminario K2 was a sufficiently cost-effective way to address the issue.</p>
<p><em>*If you like, you can regard this as <strong>deferring repayment of your technical debt.</strong></em></p>
<p>Kaminario&#8217;s comments about how its technology is or will be applied are all over the place (again, I think part of this is due to having a small number of customers overall, and wanting to conceal how small that number is). But in general Kaminario has seen more OLTP (OnLine Transaction Processing) than analytic uptake, which contributes to them thinking that low latency is a bigger deal than raw IOPS (Input/Output Per Second). Certainly Kaminario is focused on database applications of some kind or other, generally running on big-name DBMS such as Oracle or Microsoft SQL Server</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/09/14/kaminario-goes-mainly-flash/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Eight kinds of analytic database (Part 1)</title>
		<link>http://www.dbms2.com/2011/07/05/eight-kinds-of-analytic-database-part-1/</link>
		<comments>http://www.dbms2.com/2011/07/05/eight-kinds-of-analytic-database-part-1/#comments</comments>
		<pubDate>Tue, 05 Jul 2011 08:17:44 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Benchmarks and POCs]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Buying processes]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Database diversity]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Log analysis]]></category>
		<category><![CDATA[MOLAP]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Petabyte-scale data management]]></category>
		<category><![CDATA[Predictive modeling and advanced analytics]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[QlikTech and QlikView]]></category>
		<category><![CDATA[SAND Technology]]></category>
		<category><![CDATA[Scientific research]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica Systems]]></category>
		<category><![CDATA[Web analytics]]></category>
		<category><![CDATA[Workload management]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4868</guid>
		<description><![CDATA[Analytic data management technology has blossomed, leading to many questions along the lines of &#8220;So which products should I use for which category of problem?&#8221; The old EDW/data mart dichotomy is hopelessly outdated for that purpose, and adding a third category for &#8220;big data&#8221; is little help. Let&#8217;s try eight categories instead. While no categorization [...]]]></description>
			<content:encoded><![CDATA[<p>Analytic data management technology has blossomed, leading to many questions along the lines of &#8220;So which products should I use for which category of problem?&#8221; The old EDW/data mart dichotomy is hopelessly outdated for that purpose, and adding a third category for &#8220;big data&#8221; is little help.</p>
<p>Let&#8217;s try eight categories instead. While <a href="http://www.strategicmessaging.com/no-market-categorization-is-ever-precise/2011/03/01/">no categorization is ever perfect</a>, these each have at least some degree of technical homogeneity. Figuring out which types of analytic database you have or need &#8212; and in most cases you&#8217;ll need several &#8212; is a great early step in your analytic technology planning.  <span id="more-4868"></span></p>
<p><strong><em>Enterprise data warehouse</em></strong> (Full or partial)</p>
<ul>
<li><em>Kinds of data likely to be included:</em> All, but especially operational</li>
<li><em>Likely use styles:</em> All</li>
<li><em>Canonical example:</em> Central EDW for a big enterprise</li>
<li><em>Stresses:</em> Concurrency, reliability, workload management</li>
</ul>
<p>The enterprise data warehouse (EDW) ideal says that you copy all your data into one place, and drive all decision-making from there. <a href="../../../../../2011/06/21/its-official-the-grand-central-edw-will-never-happen/">Full EDWs are pipedreams</a>. Still, a partial EDW makes sense for most large enterprises, and many indeed already have one. The first product lines to consider for classical EDWs are Teradata, DB2, Exadata, and maybe Microsoft SQL Server, especially if you&#8217;re going to stress concurrency and/or operational use cases.</p>
<p><strong><em>Traditional data mart</em></strong></p>
<ul>
<li><em>Kinds of data likely to be included:</em> All</li>
<li><em>Likely use styles:</em> Business intelligence, budgeting/consolidation, investigative</li>
<li><em>Examples:</em> Reporting servers, planning/consolidation servers, anything MOLAP, etc.</li>
<li><em>Stresses:</em> Performance, concurrency, TCO</li>
</ul>
<p>Whether or not you have something like an enterprise data warehouse, it&#8217;s common to have lighter-weight data marts as well. A traditional data mart might drive reports and dashboards. Or it might be specialized for budgeting, planning, and/or consolidation.  Some <a href="../../../../../2011/03/03/investigative-analytics/">investigative analytics</a> may be in the mix as well.</p>
<p>Any DBMS that can support an EDW can also support a data mart, but it may not be the most cost-effective way to do so. Columnar DBMS might have more attractive performance and TCO (Total Cost of Ownership); the same goes for Netezza. Some of them &#8212; e.g. Sybase IQ and <a href="../../../../../2011/06/20/vertica-release-5/">Vertica</a> &#8212; have excellent track records in concurrent usage as well. <a href="../../../../../2011/05/29/when-to-use-relational-database-management-system/">Ted Codd</a> pushed what amounts to MOLAP (Multidimensional OnLine Analytic Processing) systems for these use cases. But relational DBMS commonly do a better job, which is one reason most major MOLAP products have wound up at RDBMS companies.</p>
<p><strong><em>Investigative data mart &#8212; agile</em></strong></p>
<ul>
<li><em>Kinds of data likely to be included:</em> All, especially customer-centric</li>
<li><em>Likely use styles</em>: Investigative</li>
<li><em>Canonical example:</em> A few analysts getting a few TB to examine</li>
<li><em>Stresses:</em> Ease of setup/load, ease of admin, price/performance</li>
</ul>
<p>Besides the traditional data mart, there are at least two other kinds. Both are focused on investigative analytics, but they&#8217;re differentiated by database size.</p>
<p>If you have just a few analysts,* looking at no more than a few terabytes of data (perhaps even just some gigabytes) &#8212; and if that data is &#8220;single-subject&#8221; and fairly homogenous &#8212; your watchwords should be &#8220;cheap&#8221;, &#8220;easy&#8221;, and &#8220;fast&#8221;. You don&#8217;t need to invest in much hardware, in expensive software, in much administrative effort (the analysts can be their own DBAs),  nor should you endure much set-up time. Just grab a product, grab some data, and start running queries (or extracts into the statistical tool of your choice).</p>
<p><em>*If you have dozens or even hundreds of analysts hitting the same database, you&#8217;re probably back to the more concurrency-oriented scenarios outlined above.</em></p>
<p>Infobright is often cost-effective among columnar analytic DBMS. Other vendors might cut you a price break as well. If you have multiple terabytes of data, don&#8217;t rule out Netezza&#8217;s lowest-end products (even if they&#8217;d really rather sell you something bigger). Or, if you&#8217;re in the sub-terabyte range, maybe you can get by with an in-memory BI tool such as QlikView, and not do anything special on the DBMS side at all.</p>
<p><strong><em>Investigative data mart &#8212; big</em></strong></p>
<ul>
<li><em>Kinds of data likely to be included:</em> All, especially customer-centric, logs, financial trade, scientific</li>
<li><em>Likely use styles</em>: Investigative</li>
<li><em>Canonical example:</em> Single-subject 20 TB &#8211; 20 PB relational database<em></em></li>
<li><em>Stresses:</em> Performance, scale-out, analytic functionality</li>
</ul>
<p>But if you&#8217;re looking at tens of terabytes of relational data, or even more, you really do have a &#8220;big data&#8221; problem. Performance and scalability are major challenges, usually best addressed by MPP (Massively Parallel Processing) systems, such as Netezza, Vertica, Aster Data, ParAccel, Teradata, or Greenplum. Performance POCs (Proofs Of Concept) are a big part of the buying process. Vendor price negotiations are crucial too.</p>
<p><em>Actually, in the low tens of terabytes you might be able to get away with a shared-disk system that has excellent compression &#8212; e.g., columnar products like Sybase IQ, Infobright, or SAND, rather than just Vertica and ParAccel.</em></p>
<p>Assuming you have affordable, scalable query performance, the competitive differentiator can switch to additional analytic functionality. Aster, Netezza, ParAccel, Vertica, and Greenplum either offer full <a href="../../../../../2011/02/24/analytic-platforms/">analytic platforms</a>, or seem to be on the path to doing so. Teradata, which now owns Aster Data, offers substantial built-in analytic capability in its traditional products as well, and the same goes for Sybase IQ.</p>
<p><em>Continued in <a href="http://www.dbms2.com/2011/07/05/eight-kinds-of-analytic-database-part-2/">Part 2</a>,</em><em> where we cover some of the more difficult use cases.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/07/05/eight-kinds-of-analytic-database-part-1/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Observations on Oracle pricing</title>
		<link>http://www.dbms2.com/2011/06/24/observations-on-oracle-pricing/</link>
		<comments>http://www.dbms2.com/2011/06/24/observations-on-oracle-pricing/#comments</comments>
		<pubDate>Fri, 24 Jun 2011 06:45:45 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4816</guid>
		<description><![CDATA[A couple of months ago, Oracle asked me to pull some observations on pricing until after the earnings call that just occurred, and I grudgingly acquiesced. In the interim, more information on Oracle pricing has emerged (including in the comment thread to that post). The original notes are: Oracle disputes some common claims about its [...]]]></description>
			<content:encoded><![CDATA[<p>A couple of months ago, Oracle asked me to pull some observations on pricing until after the <a href="http://www.dbms2.com/2011/06/24/forthcoming-oracle-appliances/">earnings call</a> that just occurred, and <a href="http://www.dbms2.com/2011/05/03/oracle-exadata-business-technology/">I grudgingly acquiesced</a>. In the interim, <a href="http://www.dbms2.com/2011/06/15/notes-and-links-june-15-2011/">more information on Oracle pricing</a> has emerged (including in the comment thread to that post). The original notes are:</p>
<p>Oracle disputes some common claims about its cost and pricing. In particular, <strong>Oracle software maintenance costs a fixed 22% of your annual license price, </strong>so<strong> if you get a discount on your licenses, it ripples through to your maintenance.</strong> This is true even if you have an all-you-can-eat ULA (Unlimited License Agreement).</p>
<ul>
<li>Based on that, Oracle contends      that Exadata isn’t all that expensive if you have a suitable ULA. You have      to buy the hardware and the storage software, but the database server      software is effectively free. (Whether your use of additional licenses      affect the price of your ULA when it comes up for renewal might, of      course, be a different matter.)</li>
<li>Nothing in that discussion      obviates the point that if you’re just using Oracle Standard Edition,      upgrading to Oracle Enterprise Edition, associated chargeable options,      and/or Exadata can be seriously expensive.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/06/24/observations-on-oracle-pricing/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Notes and links, June 15, 2011</title>
		<link>http://www.dbms2.com/2011/06/15/notes-and-links-june-15-2011/</link>
		<comments>http://www.dbms2.com/2011/06/15/notes-and-links-june-15-2011/#comments</comments>
		<pubDate>Wed, 15 Jun 2011 11:07:32 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[1010data]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data models and architecture]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Specific users]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4722</guid>
		<description><![CDATA[Five things:  Back in April, Steve Miller suggested that approximate BI could be a growing trend, gaining speed at the expense of (often false anyway) precision. That idea of course goes well with Infobright&#8217;s recent released Rough Query feature, and also with Datameer&#8217;s year-earlier pitch. Aster Data (now a Teradata company) is positioning itself as [...]]]></description>
			<content:encoded><![CDATA[<p>Five things:  <span id="more-4722"></span></p>
<p>Back in April, Steve Miller suggested that <a href="http://www.information-management.com/blogs/business_intelligence_big_data_analytics_approximate_BI-10020170-1.html">approximate BI</a> could be a growing trend, gaining speed at the expense of (often false anyway) precision. That idea of course goes well with Infobright&#8217;s recent released <a href="../../../../../2011/06/14/infobright-4-0/">Rough Query</a> feature, and also with <a href="../../../../../2010/04/16/introduction-to-datameer/">Datameer&#8217;s year-earlier pitch</a>.</p>
<p>Aster Data (now a Teradata company) is positioning itself as <a href="http://www.asterdata.com/blog/2011/06/13/multi-structured-data-platform-capabilities-required-for-big-data-analytics/">analyzing multi-structured data</a> &#8212; which is my second-choice term, behind the more precise but odder-sounding &#8220;<a href="../../../../../2011/05/17/poly-structured-database/">poly-structured</a>.&#8221; I hope &#8220;poly-structured&#8221; wins, and plan to keep using it myself; but I recognize that &#8220;multi-structured&#8221; may actually be the one that prevails.</p>
<p>Barbara Darrow wrote a great piece on <a href="http://searchdatacenter.techtarget.com/news/2240036530/Oracle-pitches-cut-rate-Exadata-hardware-to-boost-sales">Oracle Exadata pricing</a>. Highlights include:</p>
<ul>
<li>Routine Oracle software discounts are high.</li>
<li>Exadata discounts are higher.</li>
<li>Big/referenceable customers get the best Exadata discounts. The term &#8220;extremely deep&#8221; was used. (I&#8217;ve also heard that from Oracle competitors, with the term &#8220;free&#8221; even coming up, hyperbolically or otherwise.)</li>
<li>Oracle&#8217;s hardware maintenance pricing is forcing users to trash Sun gear, even when it&#8217;s working. One guy told the story of literally crying as the Sun boxes were taken away.</li>
<li>Oracle&#8217;s 22% of license maintenance fee goes up to 27% after two years. I didn&#8217;t know that.</li>
</ul>
<p>Oracle has been making considerable messaging fuss around a win in Japan, where <a href="../../../../../2011/02/02/exadata-notes/">Softbank replaced years-old Teradata systems with vastly less new Exadata gear</a>. I blogged that this is hardly an apples-to-apples comparison. During <a href="../../../../../2011/05/03/oracle-exadata-business-technology/">my visit last April</a>, Oracle pushed back, in particular pointing out that the Softbank division that awarded the deal was very separate from the one that was an Oracle reseller. But Monday Teradata shared with me a counter-pushback, asserting that during the recent worldwide recession, Softbank assigned its underemployed systems integration division to do internal projects &#8212; including the data warehouse upgrade. I.e., Teradata stands by its claim that this replacement was strongly influenced by the Softbank/Oracle partnership.</p>
<p>If you&#8217;re analytically inclined, Kx Systems has some interesting ideas, manifested in kdb+ and so on. A <a href="http://queue.acm.org/detail.cfm?id=1531242">2009 ACM article</a> seems as good a starting point as any, the company&#8217;s website probably aside. Confusingly, <a href="http://kx.com/index.php">Kx</a> is small company that evidently does most of its selling through a couple of much larger partners. Also, 1010data happens to be built on an older version of Kx&#8217;s technology.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/06/15/notes-and-links-june-15-2011/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Hardware for Hadoop</title>
		<link>http://www.dbms2.com/2011/06/04/hardware-for-hadoop/</link>
		<comments>http://www.dbms2.com/2011/06/04/hardware-for-hadoop/#comments</comments>
		<pubDate>Sat, 04 Jun 2011 22:47:12 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4610</guid>
		<description><![CDATA[After suggesting that there&#8217;s little point to Hadoop appliances, it occurred to me to look into what kinds of hardware actually are used with Hadoop. So far as I can tell: Hadoop nodes today tend to run on fairly standard boxes. Hadoop nodes in the past have tended to run on boxes that were light [...]]]></description>
			<content:encoded><![CDATA[<p>After suggesting that <a href="http://www.dbms2.com/2011/06/02/why-you-would-want-an-appliance-and-when-you-wouldnt/">there&#8217;s little point to Hadoop appliances</a>, it occurred to me to look into what kinds of hardware actually are used with Hadoop. So far as I can tell:</p>
<ul>
<li>Hadoop nodes today tend to run on fairly standard boxes.</li>
<li>Hadoop nodes in the past have tended to run on boxes that were light with respect to RAM.</li>
<li>The number of spindles per core on Hadoop node boxes is going up even as disks get bigger.</li>
</ul>
<p><span id="more-4610"></span>A key input comes from Cloudera, who to my joy delegated the questions to Omer Trajman, who wrote:</p>
<blockquote><p>Most Hadoop deployments today use systems with dual socket and quad or  hex cores (8 or 12 cores total, 16 or 24 hyper-threaded). Storage has  increased as well with 6-8 spindles being common and some deployments  going to 12 spindles. These are SATA disks with between 1TB and 2TB  capacity. The amount of RAM varies depending on the application. 24GB is  common as is 36GB – all ECC RAM. HBase clusters may have more RAM so  they can cache more data. Some customers put Hadoop on their “standard  box” which may not be perfectly balanced (e.g. more RAM, less disk) and  needs to be altered slightly to meet the above specs. The new Dell C2100  series and the HP SL170 series are both popular server lines for  Hadoop.</p>
<p>For a year ago perspective, see this post: <a href="http://www.cloudera.com/blog/2010/03/clouderas-support-team-shares-some-basic-hardware-recommendations/" target="_blank">http://www.cloudera.com/blog/2010/03/clouderas-support-team-shares-some-basic-hardware-recommendations/</a></p></blockquote>
<p>Bullet points from that year-ago link include:</p>
<ul>
<blockquote>
<li>4 1TB hard disks in a JBOD (Just a Bunch Of Disks) configuration</li>
<li>2 quad core CPUs, running at least 2-2.5GHz</li>
<li>16-24GBs of RAM (24-32GBs if you’re considering HBase)</li>
<li>Gigabit Ethernet</li>
</blockquote>
</ul>
<p>So basically we&#8217;re talking in the range of 2-3 GB of RAM per core &#8212; and 1 spindle per core, up from perhaps half a spindle per core a year ago.</p>
<p>Meanwhile, a 2009 <a href="https://opencirrus.org/system/files/OpenCirrusHadoop2009.ppt">Yahoo  slide deck</a> refers to &#8220;500 nodes, 4000 cores, 3TB RAM, 1.5PB disk&#8221;;  that divides out to 8 cores, 6 GB of RAM, and 3 TB of disk per node, all  on &#8220;commodity hardware.&#8221; By 2010 Yahoo was evidently up to <a href="http://twitter.com/#!/marin_dimitrov/status/12900368052">2 GB of RAM per core</a>.</p>
<p>There are lots of data points on the <a href="http://wiki.apache.org/hadoop/PoweredBy">Apache Hadoop wiki</a>, but many seem a few years old, and I don&#8217;t immediately see how to time-stamp them. Overall, they seem consistent with the trends I noted at the top of the post.</p>
<p>One thing I haven&#8217;t done is attempted to price any of these systems.</p>
<p>Contributions in the comment thread would be warmly appreciated.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/06/04/hardware-for-hadoop/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Slashdot venting thread about Oracle/Sun hardware</title>
		<link>http://www.dbms2.com/2011/05/26/slashdot-venting-thread-about-oraclesun-hardware/</link>
		<comments>http://www.dbms2.com/2011/05/26/slashdot-venting-thread-about-oraclesun-hardware/#comments</comments>
		<pubDate>Thu, 26 May 2011 08:51:13 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4565</guid>
		<description><![CDATA[Slashdot has what amounts to a venting thread about Oracle/Sun hardware. The one consistent favorable theme is that Sun hardware is good stuff if you want to run Oracle. Otherwise, comments repeatedly say: Product discounts are down, effectively creating a price increase. Service prices are way up for some customers, because cheaper service options have [...]]]></description>
			<content:encoded><![CDATA[<p>Slashdot has what amounts to <a href="http://hardware.slashdot.org/story/11/05/25/0146217/After-a-Lull-Sun-Server-Business-Grows-Under-Oracle?utm_source=headlines&amp;utm_medium=email">a venting thread about Oracle/Sun hardware</a>. The one consistent favorable theme is that Sun hardware is good stuff if you want to run Oracle. Otherwise, comments repeatedly say:</p>
<ul>
<li>Product discounts are down, effectively creating a price increase.</li>
<li>Service prices are way up for some customers, because cheaper service options have been eliminated.</li>
<li>Service quality is unsatisfactory.</li>
<li>Oracle is difficult to do business with.</li>
</ul>
<p>So far, I haven&#8217;t seen any comments to the effect &#8220;I don&#8217;t know what you guys are talking about; we&#8217;re perfectly happy with Sun&#8221;, but surely those will come too.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/05/26/slashdot-venting-thread-about-oraclesun-hardware/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

