<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; Oracle</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/oracle/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Wed, 08 Feb 2012 12:22:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>Microsoft SQL Server 2012 and enterprise database choices in general</title>
		<link>http://www.dbms2.com/2012/01/24/microsoft-sql-server-2012/</link>
		<comments>http://www.dbms2.com/2012/01/24/microsoft-sql-server-2012/#comments</comments>
		<pubDate>Tue, 24 Jan 2012 14:42:34 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Mid-range]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5859</guid>
		<description><![CDATA[Microsoft is launching SQL Server 2012 on March 7. An IM chat with a reporter resulted, and went something like this. Reporter: [Care to comment]? CAM: SQL Server is an adequate product if you don&#8217;t mind being locked into the Microsoft stack. For example, the ColumnStore feature is very partial, given that it can&#8217;t be [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sqlserverlaunch.com/ww/Home">Microsoft is launching SQL Server 2012 on March 7</a>. An IM chat with a reporter resulted, and went something like this.</p>
<p><strong>Reporter: [Care to comment]?</strong><br />
<strong>CAM:</strong> SQL Server is an adequate product if you don&#8217;t mind being locked into the Microsoft stack. For example, the ColumnStore feature is very partial, given that <a href="http://msdn.microsoft.com/en-us/library/gg492088%28v=sql.110%29.aspx#Update">it can&#8217;t be updated</a>; but Oracle doesn&#8217;t have columnar storage at all.</p>
<p><strong>Reporter: Is the lock-in overall worse than IBM DB2, Oracle?</strong><br />
<strong>CAM:</strong> Microsoft locks you into an operating system, so yes.</p>
<p><strong>Reporter: Is this release something larger Oracle or IBM shops could consider as a lower-cost alternative a co-habitation scenario, in the event they&#8217;re mulling whether to buy more Oracle or IBM licenses?</strong><br />
<strong>CAM:</strong> If they have a strong Microsoft-stack investment already, sure. Otherwise, why?</p>
<p><strong>Reporter: [How about] just cost?</strong><br />
<strong>CAM:</strong> DB2 works just as well to keep Oracle honest as SQL Server does, and without a major operating system commitment. For analytic databases you want an analytic DBMS or appliance anyway.</p>
<p>Best is to have one major vendor of OTLP/general-purpose DBMS, a web DBMS, a DBMS for disposable projects (that may be the same as one of the first two), plus however many different analytic data stores you need to get the job done.</p>
<p>By &#8220;web DBMS&#8221; I mean MySQL, NewSQL, or NoSQL. Actually, you might need more than one product in that area.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2012/01/24/microsoft-sql-server-2012/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Notes on the Oracle Big Data Appliance</title>
		<link>http://www.dbms2.com/2012/01/10/notes-on-the-oracle-big-data-appliance/</link>
		<comments>http://www.dbms2.com/2012/01/10/notes-on-the-oracle-big-data-appliance/#comments</comments>
		<pubDate>Wed, 11 Jan 2012 01:32:39 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5809</guid>
		<description><![CDATA[Oracle announced its Big Data Appliance. Specs may be found in the Oracle Big Data Appliance press release. Beyond that: The most important software on the Oracle Big Data Appliance is a full set of Cloudera Enterprise code. Oracle will do Tier 1 Cloudera/Hadoop support, while Cloudera handles Tiers 2 and 3. The key spec [...]]]></description>
			<content:encoded><![CDATA[<p>Oracle announced its Big Data Appliance. Specs may be found in <a href="http://www.oracle.com/us/corporate/press/1453721">the Oracle Big Data Appliance press release</a>. Beyond that:</p>
<ul>
<li>The most important software on the Oracle Big Data Appliance is a full set of <a href="../2012/01/10/a-couple-of-links-explaining-cloudera-manager/">Cloudera Enterprise</a> code. Oracle will do Tier 1 Cloudera/Hadoop support, while Cloudera handles Tiers 2 and 3.</li>
<li>The key spec ratios are 1 core/4 GB RAM/3 TB raw disk. That&#8217;s reasonably in line with <a href="http://www.dbms2.com/2011/06/04/hardware-for-hadoop/">Cloudera figures I published in June, 2010</a>.</li>
<li>This is really Oracle&#8217;s <a href="http://www.dbms2.com/2012/01/08/big-data-terminology-and-positioning/">multi-structured big data appliance</a>. Oracle&#8217;s relational big data appliance is Exadata, which has been out for years and has comparable capacity to Oracle&#8217;s new &#8220;Big Data Appliance.&#8221; (<a href="http://www.eweek.com/c/a/IT-Infrastructure/Oracle-Launches-ClouderaPowered-Big-Data-Appliance-172364/">Chris Preimesberger</a> made a similar point.)</li>
<li>The Oracle Big Data Appliance list price is $450,000 for 18 12-core servers, plus $54,000/year maintenance.
<ul>
<li>That&#8217;s around $25,000 per server (and associated storage).</li>
<li>That&#8217;s also around $2,000/core.</li>
<li>That&#8217;s also around $500/TB of spinning disk, before <a href="http://www.dbms2.com/2011/07/06/hadoop-hardware-and-compression/">compression</a>.</li>
<li>None of those per-unit figures sounds ridiculous &#8230;</li>
<li>&#8230; but because of Oracle&#8217;s appliance configuration there&#8217;s indeed a hefty minimum initial purchase.</li>
</ul>
</li>
</ul>
<p><a href="http://www.zdnet.com/blog/btl/oracle-rolls-out-big-data-play-with-aggressive-price-cloudera/66529"><span id="more-5809"></span>Peter Goldmacher</a> argues that, because of size and price point, the Oracle Big Data appliance is targeted for high-end deployments rather than starter/test/development set-ups. To first approximation, that makes sense, in that:</p>
<ul>
<li>The Oracle Big Data Appliance is in the petabyte range for data capacity, and &#8230;</li>
<li>&#8230; <a href="http://www.dbms2.com/2011/07/06/petabyte-hadoop-clusters/">the number of petabyte-scale Hadoop deployments is in the low tens</a>, and &#8230;</li>
<li>&#8230; many of those aren&#8217;t at Oracle shops anyway.</li>
</ul>
<p>Surely the Oracle Big Data Appliance isn&#8217;t designed for the 4-8 node play-with-Hadoop crowd.</p>
<p>On the the other hand, if you&#8217;re at a big, committed Oracle shop, and you want to do your first serious Hadoop deployment, why not go with the Oracle Big Data Appliance? You probably could save money with an alternative approach &#8212; but if your employers are committed to Oracle, saving money is surely not their greatest concern. Overpay by a bit; make your management happy with the Oracle logo; get Hadoop on your resume; prosper. That seems like a winning plan all the way around.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2012/01/10/notes-on-the-oracle-big-data-appliance/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>A couple of links explaining Cloudera Manager</title>
		<link>http://www.dbms2.com/2012/01/10/a-couple-of-links-explaining-cloudera-manager/</link>
		<comments>http://www.dbms2.com/2012/01/10/a-couple-of-links-explaining-cloudera-manager/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 22:23:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5798</guid>
		<description><![CDATA[Predictably, I wasn&#8217;t pre-briefed on the details of Oracle&#8217;s Big Data Appliance announcement today, and an inquiry to partner Cloudera doesn&#8217;t happen to have been immediately answered.* But anyhow, it&#8217;s clear from coverage by Larry Dignan and Derrick Harris that Oracle&#8217;s Big Data Appliance includes: Some version of Cloudera Manager (I&#8217;m guessing more or less [...]]]></description>
			<content:encoded><![CDATA[<p>Predictably, I wasn&#8217;t pre-briefed on the details of Oracle&#8217;s Big Data Appliance announcement today, and an inquiry to partner Cloudera doesn&#8217;t happen to have been immediately answered.* But anyhow, it&#8217;s clear from coverage by <a href="http://www.zdnet.com/blog/btl/oracle-rolls-out-big-data-play-with-aggressive-price-cloudera/66529">Larry Dignan</a> and <a href="http://gigaom.com/cloud/cloudera-brings-the-hadoop-to-oracles-big-data-appliance/">Derrick Harris</a> that Oracle&#8217;s Big Data Appliance includes:</p>
<ul>
<li>Some version of Cloudera Manager (I&#8217;m guessing more or less the best one).*</li>
<li>Some version of Apache Hadoop (I&#8217;m guessing the same distribution that Cloudera prefers to use).*</li>
<li>Some kind of support.</li>
</ul>
<p>In other words, it&#8217;s a lot like getting Cloudera Enterprise,* plus some hardware, plus some other stuff.</p>
<p><em>*Edit: About 2 minutes after I posted this, I got email from Cloudera CEO Mike Olson. Yes, the Oracle Big Data Appliance bundles Cloudera Enterprise.</em></p>
<p>That raises an anyway recurring question: <strong>What exactly is Cloudera Manager?</strong> <span id="more-5798"></span>When asked, I&#8217;ve always tended to mumble something like: <strong>Um, it&#8217;s management stuff. </strong>There&#8217;s an overview on <a href="http://www.cloudera.com/products-services/tools/">the Cloudera Manager product page</a>, but it doesn&#8217;t really say much, even if you click on the Data Sheet link. More helpful, I think, is <a href="http://www.cloudera.com/blog/2011/12/cloudera-manager-3-7-released/">a December post on Cloudera&#8217;s busy blog</a>. Technically, the post is about the new features in the Cloudera Manager 3.7 point release, but more generally it helps to explain what Cloudera Manager does, in areas such as (and these bullet points are all direct quotes):</p>
<ul>
<li> Automated Hadoop Deployment</li>
<li> Centralized Management</li>
<li> Configuration Management</li>
<li> Service Monitoring</li>
<li> Log Search</li>
<li> Events and Alerts</li>
<li> Configuration versioning and Audit trails</li>
<li> Activity Monitoring</li>
<li> Operational Reports</li>
</ul>
<p>Taken together,<strong> those two Cloudera links do a pretty good job of explaining Cloudera Manager, and illustrating why a Hadoop user would want to have either Cloudera Manager or a similar competitive offering.</strong></p>
<p><em>Edit: The day after I originally made this post, Cloudera put up another post <a href="http://www.cloudera.com/blog/2012/01/cloudera-manager-thank-you-customers/">directly explaining what Cloudera Manager is about</a>.<br />
</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2012/01/10/a-couple-of-links-explaining-cloudera-manager/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Big data terminology and positioning</title>
		<link>http://www.dbms2.com/2012/01/08/big-data-terminology-and-positioning/</link>
		<comments>http://www.dbms2.com/2012/01/08/big-data-terminology-and-positioning/#comments</comments>
		<pubDate>Mon, 09 Jan 2012 01:35:57 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Data models and architecture]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Log analysis]]></category>
		<category><![CDATA[MarkLogic]]></category>
		<category><![CDATA[Market share and customer counts]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Splunk]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5768</guid>
		<description><![CDATA[Recently, I observed that Big Data terminology is seriously broken. It is reasonable to reduce the subject to two quasi-dimensions: Bigness &#8212; Volume, Velocity, size Structure &#8212; Variety, Variability, Complexity given that High-velocity &#8220;big data&#8221; problems are usually high-volume as well.* Variety, variability, and complexity all relate to the simply-structured/poly-structured distinction. But the conflation should [...]]]></description>
			<content:encoded><![CDATA[<p>Recently, I observed that <a href="../../../../../2011/09/11/big-data-has-jumped-the-shark/">Big Data terminology is seriously broken</a>. It is reasonable to reduce the subject to two quasi-dimensions:</p>
<ul>
<li><strong>Bigness</strong> &#8212; Volume, Velocity, size</li>
<li><strong>Structure</strong> &#8212; Variety, Variability, Complexity</li>
</ul>
<p>given that</p>
<ul>
<li>High-velocity &#8220;big data&#8221; problems are usually high-volume as well.*</li>
<li>Variety, variability, and complexity all relate to the <a href="../../../../../2011/05/17/poly-structured-database/">simply-structured/poly-structured</a> distinction.</li>
</ul>
<p>But the conflation should stop there.</p>
<p><em>*Low-volume/high-velocity problems are commonly referred to as <a href="../2011/08/25/renaming-cep-or-not/">&#8220;event processing&#8221; and/or &#8220;streaming&#8221;</a>.</em></p>
<p>When people claim that bigness and structure are the same issue, they oversimplify into mush. So I think we need four pieces of terminology, reflective of a 2&#215;2 matrix of possibilities. For want of better alternatives, my suggestions are:</p>
<ul>
<li><strong>Relational big data</strong> is data of high volume that fits well into a relational DBMS.</li>
<li><strong>Multi-structured big data</strong> is data of high volume that doesn&#8217;t fit well into a relational DBMS. <em>Alternative: Poly-structured big data.</em></li>
<li><strong>Conventional relational data</strong> is data of not-so-high volume that fits well into a relational DBMS. <em>Alternatives: Ordinary/normal/smaller relational data.</em></li>
<li><strong>Smaller poly-structured data</strong> is data for which <a href="../../../../../2011/07/31/dynamic-fixed-schema-databases/">dynamic schema</a> capabilities are important, but which doesn&#8217;t rise to &#8220;big data&#8221; volume.</li>
</ul>
<p><span id="more-5768"></span>Notes on all this include:</p>
<ul>
<li>&#8220;Relational big data&#8221; is commonly what you need a scalable analytic relational DBMS for. But there are non-analytic use cases as well.</li>
<li>The paradigmatic example of &#8220;multi-structured big data&#8221; is log files. Thus, multi-structured big data is commonly what you need a <a href="../../../../../2011/06/04/dirty-data-stored-dirt-cheap/">big bit bucket</a> for.</li>
<li>One might want to equate non-analytic relational big data technology to &#8220;NewSQL&#8221;. However, I&#8217;m struggling to think of a database size range in which the entire NewSQL industry can match Oracle&#8217;s market share alone.</li>
<li>One might want to equate non-analytic multi-structured big data technology to &#8220;NoSQL&#8221;. However:
<ul>
<li>&#8220;NoSQL&#8221; is also used to encompass not-so-big-data use cases, such as prototyping in MongoDB.</li>
<li><a href="../../../../../2011/10/02/defining-nosql/">&#8220;NoSQL&#8221; has non-ACID/low(er)-data-integrity connotations</a> that aren&#8217;t appropriate for all non-relational systems.</li>
</ul>
</li>
<li>Up to a point, you can analyze relational big data in a conventional relational DBMS, but an analytic RDBMS will usually win on TCO (Total Cost of Ownership). In particular, reasonable thresholds for moving an analytic database off Oracle might be:
<ul>
<li>1-2 terabytes if you&#8217;ve never bought anything past Oracle Standard Edition.</li>
<li>5-10 terabytes if you&#8217;re already paying for Oracle Enterprise Edition.</li>
<li>A lot higher than that if you actually find Oracle Exadata to be cost-effective.</li>
</ul>
</li>
<li>Depending on how big one acknowledges as &#8220;big&#8221;, the market share leader in &#8220;big bit bucket&#8221; use cases is either Splunk or Hadoop.</li>
<li>If we look at multi-structured big data management overall, MarkLogic joins the list of market share contenders, as do various NoSQL alternatives.</li>
<li>It is wrong to say that the large web companies invented &#8220;big data&#8221; technology. But it is more reasonable to say they invented much of &#8220;multi-structured big data&#8221; management. In particular (and this is just a partial list), Google, Amazon, Yahoo, Facebook, et al. can reasonably be credited with Hadoop, Cassandra, HBase and various predecessors to same.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2012/01/08/big-data-terminology-and-positioning/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Some big-vendor execution questions, and why they matter</title>
		<link>http://www.dbms2.com/2011/11/21/big-vendor-execution-analytics/</link>
		<comments>http://www.dbms2.com/2011/11/21/big-vendor-execution-analytics/#comments</comments>
		<pubDate>Mon, 21 Nov 2011 11:01:20 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Cognos]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[SAP AG]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5704</guid>
		<description><![CDATA[When I drafted a list of key analytics-sector issues in honor of look-ahead season, the first item was &#8220;execution of various big vendors&#8217; ambitious initiatives&#8221;.  By &#8220;execute&#8221; I mean mainly: &#8220;Deliver products that really meet customers&#8217; desires and needs.&#8221; &#8220;Successfully convince them that you&#8217;re doing so &#8230;&#8221; &#8220;&#8230; at an attractive overall cost.&#8221; Vendors mentioned [...]]]></description>
			<content:encoded><![CDATA[<p>When I drafted a list of key analytics-sector issues in honor of <a href="http://www.dbms2.com/2011/11/21/analytic-trends-in-2012-qa/">look-ahead season</a>, the first item was &#8220;execution of various big vendors&#8217; ambitious initiatives&#8221;.  By &#8220;execute&#8221; I mean mainly:</p>
<ul>
<li>&#8220;Deliver products that really meet customers&#8217; desires and needs.&#8221;</li>
<li> &#8220;Successfully convince them that you&#8217;re doing so &#8230;&#8221;</li>
<li>&#8220;&#8230; at an attractive overall cost.&#8221;</li>
</ul>
<p>Vendors mentioned here are Oracle, SAP, HP, and IBM. Anybody smaller got left out due to the length of this post. Among the bigger omissions were:</p>
<ul>
<li>salesforce.com (multiple subjects).</li>
<li><a href="../../../../../2011/04/21/sas-hpa-does-make-sense-after-all/">SAS HPA</a>.</li>
<li><a href="../../../../../2011/08/21/hadoop-evolution/">The evolution of Hadoop</a>.</li>
</ul>
<p><span id="more-5704"></span><strong>A (lingering) issue for SAP and Oracle alike</strong></p>
<p>As I noted in January of this year, <a href="../../../../../2011/01/03/the-six-useful-things-you-can-do-with-analytic-technology/">integration of business intelligence into operational apps is making very slow progress</a>. Even so, it&#8217;s a huge part of the apparent strategy at SAP and Oracle alike, as well it should be. Much of the benefit from automating routine desk work has already happened. The areas ripest for exploitation are the ones where analytics are part of the equation.</p>
<p>Given the lack of tangible progress, why do I think this is a genuine area of Oracle and SAP emphasis? Three reasons of many are:</p>
<ul>
<li>Why else did SAP buy Business Objects?</li>
<li>If they&#8217;re not trying to <a href="../../../../../2011/03/30/short-request-and-analytic-processing/">integrate operational apps and analytics</a>, why else does SAP&#8217;s emphasis on HANA make sense?</li>
<li>Without business intelligence in the picture, how does Oracle&#8217;s integrated-stack story promise any direct user benefits?*</li>
</ul>
<p><em>*As opposed to IT concerns &#8212; integration, administration, TCO (Total Cost of Ownership), etc.</em></p>
<p>After so many years of disappointment, I&#8217;m not going to forecast 2012 as a pivotal year for <strong>the integration of business intelligence into operational applications.</strong> But if one of SAP or Oracle ever does get a significant BI/operational app integration advantage over the other, it could be a major competitive advantage in those application market segments that are still up for grabs. It also is an opportunity for both vendors to gain BI market share in their respective application customer bases.</p>
<p><strong>A more urgent issue for SAP</strong></p>
<p>SAP has put huge amounts of credibility on the line for HANA, the integration of two different and not particularly mature in-memory database technologies. So far, it is difficult to find evidence that HANA is robust enough for widespread adoption. Whether or not SAP can fix that is a huge open question, which could have significant impact on the course of several technology areas: applications, business intelligence, in-memory DBMS, and maybe even hardware.</p>
<p>Based on current information, which is admittedly partial, I&#8217;m a short-term pessimist on HANA. Longer-term, I&#8217;m on record as saying that <a href="../../../../../2011/05/23/databases-ram/">traditional databases will eventually wind up in RAM</a>. SAP will surely get that technology right some day, whether or not the way it does so has anything to do with present-day HANA code.</p>
<p><strong>Four more issues for Oracle </strong></p>
<p>Oracle&#8217;s ambitions are near-endless, and so also therefore is its list of execution challenges. Four in the analytics area that I find particularly interesting are:</p>
<ul>
<li><strong>True hybrid columnar DBMS.</strong> <a href="../../../../../2011/09/22/teradata-columnar-compression/">I was guessing that Oracle, like Teradata, would announce true hybrid columnar the week of Oracle OpenWorld</a>. I was wrong. But if Oracle can&#8217;t bring out true hybrid columnar DBMS functionality relatively soon, Exadata will lose credibility as a competitor to more specialized analytic DBMS.</li>
<li><strong>Oracle Exalytics.</strong> With Exalytics in the mix, Oracle&#8217;s technology stack has HANA-like potential. But will Exalytics even ship in 2012? (I think so.) Will it be good for much in the first release? (I&#8217;m skeptical.)</li>
<li><strong>Oracle&#8217;s Big Data Appliance</strong>. I&#8217;m skeptical both about <a href="../../../../../2011/10/20/more-notes-on-oracle-nosql/">Oracle&#8217;s NoSQL product</a> &#8212; <a href="http://www.infoworld.com/d/data-explosion/first-look-oracle-nosql-database-179107">a favorable InfoWorld review</a> notwithstanding &#8212; and <a href="../../../../../2011/09/23/hadoop-appliances/">Hadoop appliances</a>. But if I&#8217;m wrong, and Oracle can successfully embrace/extend the new non-relational paradigms, then it really might regain control over the evolution of data management.</li>
<li><strong><a href="../../../../../2011/10/18/oracle-is-buying-endeca/">Oracle&#8217;s Endeca acquisition</a></strong> &#8212; will Oracle prove me wrong and integrate Endeca effectively into its overall analytic product line? If it does, we might finally see effective text (and eventually speech) navigation of enterprise software. (But as with all Oracle issues cited here, this is something that probably won&#8217;t amount to much in 2012 even if it does later go well.)</li>
</ul>
<p><strong>Three issues for IBM</strong></p>
<p>Like Oracle, IBM is a huge company with many ambitions and hence many execution challenges. The biggest of those is surely: <strong>How effective can IBM be at selling outside its existing customer base?</strong> I don&#8217;t hear as much competitively about IBM DataStage, IBM SPSS or now IBM Netezza as I did when their vendors were independent companies. Even Cognos may not be much of an exception to the rule, although it has its own large customer base outside of IBM&#8217;s traditional one. (To lesser extents , the same is of course true of Netezza and numerous other IBM acquisitions.)</p>
<p>Another general issue for IBM is <strong>substantively integrating its various product lines,</strong> at least to the extent that makes sense. DB2/Netezza integration sounds good, but even that is a matter more of product marketing (the admirable part of that discipline) more than of actual technology. Other integrations (e.g. Cognos/DB2 in various bundles) have tended toward the dubious side.*</p>
<p><em>*I&#8217;m still waiting for IBM to get back to me with examples of how Cognos/DB2 joint tuning amounts to anything. It&#8217;s been more than a year, so I&#8217;m glad I didn&#8217;t hold my breath.</em></p>
<p>In a somewhat narrower vein, I wonder: <strong><a href="../../../../../2011/11/10/cep-streaming-catchup/">Will IBM be able to gain traction for InfoSphere Streams</a>? </strong>And if so, when and where will the traction be?</p>
<p><strong>Will HP screw up Vertica?</strong></p>
<p>Vertica has a very attractive product offering. It&#8217;s perhaps <a href="../../../../../2011/06/20/columnar-dbms-vendor-customer-metrics/">the most scalable analytic DBMS outside of Teradata</a>, running on the hardware of your reasonable choice.  It&#8217;s also the one I recommend most often to clients in the 1-50 terabyte range.</p>
<p>So far HP doesn&#8217;t seem to have done much to leadfoot Vertica. (About all I&#8217;ve heard from competitors is that Vertica seems to have faded somewhat in the financial services market, and there could be multiple explanations if that is indeed true.) But if HP Vertica does somehow manage to botch things, opportunities will open up for a range of columnar analytic DBMS competitors.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/11/21/big-vendor-execution-analytics/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>NoSQL notes</title>
		<link>http://www.dbms2.com/2011/10/23/nosql-notes/</link>
		<comments>http://www.dbms2.com/2011/10/23/nosql-notes/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 04:20:27 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Basho and Riak]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Clustering]]></category>
		<category><![CDATA[Couchbase]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[Market share and customer counts]]></category>
		<category><![CDATA[MongoDB and 10gen]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Parallelization]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5522</guid>
		<description><![CDATA[Last week I visited with James Phillips of Couchbase, Max Schireson and Eliot Horowitz of 10gen, and Todd Lipcon, Eric Sammer, and Omer Trajman of Cloudera. I guess it&#8217;s time for a round-up NoSQL post. Views of the NoSQL market horse race are reasonably consistent, with perhaps some elements of “Where you stand depends upon [...]]]></description>
			<content:encoded><![CDATA[<p>Last week I visited with James Phillips of Couchbase, Max Schireson and Eliot Horowitz of 10gen, and Todd Lipcon, Eric Sammer, and Omer Trajman of Cloudera. I guess it&#8217;s time for a round-up NoSQL post. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Views of the NoSQL market horse race are reasonably consistent, with perhaps some elements of “Where you stand depends upon where you sit.”</p>
<ul>
<li>As      James tells it, NoSQL is simply a three-horse race between Couchbase,      MongoDB, and Cassandra.</li>
<li>Max      would include HBase on the list.</li>
<li>Further,      Max pointed out that metrics such as job listings suggest MongoDB has the      most development activity, and Couchbase/Membase/CouchDB perhaps have      less.</li>
<li>The Cloudera      guys remarked on some serious HBase adopters.*</li>
<li>Everybody      I spoke with agreed that Riak had little current market presence, although      some Basho guys could surely be found who&#8217;d disagree.</li>
</ul>
<p><span id="more-5522"></span><em>*I hope to do a separate post on HBase adoption soon. In connection with that, any info on HBase adoption by Facebook (said to be very heavy), Twitter, et al. would be much appreciated.</em></p>
<p>The reasons for using NoSQL of course are, in some order, <a href="../../../../../2011/07/31/dynamic-fixed-schema-databases/">dynamic schemas</a>, scale-out, and open source. <a href="http://www.dbms2.com/2011/10/23/transparent-relational-oltp-scale-out/">I find the scale-out argument somewhat bogus</a>,* but the data model one is very real. Depending on whom you talk with, the most important point about dynamic schemas may actually be that they’re changeable, or it may just be that you don’t have to specify a schema at the time of initial application design. MongoDB gets particular praise as a good platform on which to throw something together quickly, although predictions as to how far the application will then scale may differ depending on whether you’re talking with, say, Max or Todd.</p>
<p><em>*It’s fair to say that NoSQL systems are more proven in scale-out than most relational DBMS. Even so, I would cringe at any line of reasoning that concluded one should adopt NoSQL because it is more mature than relational alternatives.</em></p>
<p>Finally, I was perhaps too extreme when <a href="../../../../../2011/10/20/more-notes-on-oracle-nosql/">I suggested there was no good reason for Oracle to have adopted the major key/minor key approach it took in its NoSQL offering</a>. Todd offered a reason why that approach – which he characterized as similar to Project Voldemort’s – could make sense:</p>
<ul>
<li>If you      have some kind of global secondary index, it’s hard to maintain that index      consistently without what amounts to distributed transactions.</li>
<li>If you      want to avoid the overhead of those, one alternative is a column-group      system such as HBase or Cassandra. Those have no indexes at all, except in      the sense that a column is its own index.</li>
<li>Another      alternative is to load as much indexing information as you can into the      key of a key-value store.</li>
</ul>
<p>I’d be interested to learn about the Couchbase and MongoDB answers to that challenge.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/23/nosql-notes/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Transparent relational OLTP scale-out</title>
		<link>http://www.dbms2.com/2011/10/23/transparent-relational-oltp-scale-out/</link>
		<comments>http://www.dbms2.com/2011/10/23/transparent-relational-oltp-scale-out/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 04:19:09 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Clustering]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5521</guid>
		<description><![CDATA[There’s a perception that, if you want (relatively) worry-free database scale-out, you need a non-relational/NoSQL strategy. That perception is false. In the analytic case it’s completely ridiculous, as has been demonstrated by Teradata, Vertica, Netezza, and various other MPP (Massively Parallel Processing) analytic DBMS vendors. And now it’s false for short-request/OLTP (OnLine Transaction Processing) use [...]]]></description>
			<content:encoded><![CDATA[<p>There’s a perception that, if you want (relatively) worry-free database scale-out, you need a non-relational/NoSQL strategy. That perception is false. In the analytic case it’s completely ridiculous, as has been demonstrated by <a href="../../../../../2011/09/24/confusion-about-teradatas-big-customers/">Teradata</a>, <a href="../../../../../2011/06/20/columnar-dbms-vendor-customer-metrics/">Vertica</a>, Netezza, and various other MPP (Massively Parallel Processing) analytic DBMS vendors. And now it’s false for <a href="../../../../../2011/03/02/short-request-processing/">short-request</a>/OLTP (OnLine Transaction Processing) use cases as well.</p>
<p>My favorite relational OLTP scale-out choice these days is <a href="http://www.dbms2.com/2011/10/23/schooner-pivots-further/">the SchoonerSQL/dbShards partnership</a>. Schooner Information Technology (SchoonerSQL) and Code Futures (dbShards) are young, small companies, but I’m not too concerned about that, because the APIs they want you to write to are just MySQL’s. The main scenarios in which I can see them failing are ones in which they are competitively leapfrogged, either by other small competitors – e.g. ScaleBase, Akiban, TokuDB, or ScaleDB &#8212; or by Oracle/MySQL itself. While that could suck for my clients Schooner and Code Futures, it would still provide users relying on MySQL scale-out with one or more good product alternatives.</p>
<p>Relying on non-MySQL NewSQL startups, by way of contrast, would leave me somewhat more concerned. (However, if their code is open sourced. you have at least some vendor-failure protection.) And big-vendor scale-out offerings, such as Oracle RAC or <a href="../../../../../2011/05/06/db2-oltp-scale-out-purescale/">DB2 pureScale</a>, may be more complex to deploy and administer than the MySQL and NewSQL alternatives.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/23/transparent-relational-oltp-scale-out/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Schooner pivots further</title>
		<link>http://www.dbms2.com/2011/10/23/schooner-pivots-further/</link>
		<comments>http://www.dbms2.com/2011/10/23/schooner-pivots-further/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 04:18:11 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Clustering]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5523</guid>
		<description><![CDATA[Schooner Information Technology started out as a complete-system MySQL appliance vendor. Then Schooner went software-only, but continued to brag about great performance in configurations with solid-state drives. Now Schooner has pivoted further, and is emphasizing high availability, clustered performance, and other hardware-agnostic OLTP (OnLine Transaction Processing) features. Fortunately, Schooner has some interesting stuff in those [...]]]></description>
			<content:encoded><![CDATA[<p>Schooner Information Technology started out as a complete-system MySQL appliance vendor. Then <a href="../../../../../2011/01/28/schooner-software-onl/">Schooner went software-only, but continued to brag about great performance in configurations with solid-state drives</a>. Now Schooner has pivoted further, and is emphasizing high availability, clustered performance, and other hardware-agnostic OLTP (OnLine Transaction Processing) features. Fortunately, Schooner has some interesting stuff in those areas to talk about.</p>
<p>The short form of the SchoonerSQL (as Schooner’s product is now called) story goes roughly like this:</p>
<ul>
<li>SchoonerSQL      replicates data &#8212; synchronously if the replication target is local,      asynchronously if it is remote.</li>
<li>Local      synchronous replication provides high availability; remote asynchronous      replication provides disaster recovery.</li>
<li>SchoonerSQL’s      local synchronous replication also provides read scale-out.</li>
<li>Schooner      has a partnership with Code Futures/dbShards to provide write scale-out      via <a href="../../../../../2011/02/24/transparent-sharding/">transparent      sharding</a>.</li>
<li>SchoonerSQL      has some secret sauce in replication performance. This has the effect of      significantly increasing write performance (assuming you were going to      replicate anyway), because otherwise you might have to slow down the      master server&#8217;s write performance so that the slaves can keep up with it.</li>
<li>Schooner      believes it still has some single-server performance advantages as well.</li>
</ul>
<p><span id="more-5523"></span><em>Just to be clear here: Schooner is my client. Code Futures is my  client. I introduced them and suggested their partnership. I then  introduced them both to a user client, who was sufficiently impressed to  seriously evaluate both of them. Even so, some of the Schooner  replication story only became clear to me when I visited last Friday.</em></p>
<p>To flesh that out a bit more:</p>
<ul>
<li>Schooner      has been making performance tweaks to the MySQL/InnoDB stack for several      years. Some of them are still relevant, and can offer 50-100% performance      improvements, especially but not only if you use solid-state storage.</li>
<li>Schooner      has found a way to scale up the slave side of master-slave replication,      both in the synchronous and asynchronous cases.
<ul>
<li>The       core idea of SchoonerSQL parallel (i.e. scale-up) replication is       straightforward. The replication log streams in. A chunk is sent off to a       CPU core. The next chunk is examined for conflicts with the first chunk.       If there are none, it is sent off to a different core to be processed</li>
<li>Thus,       you can have both local replication/high availability and remote       replication/disaster recovery without slowing down writes on the master       the way you may have to with other alternatives.</li>
<li>Schooner       believes this feature alone can make SchoonerSQL 3X faster than MySQL       alternatives, at least for writes.</li>
</ul>
</li>
</ul>
<p>At least in casual conversation, Schooner synthesizes its 1.5-2X single-server figure and up-to-3X clustering figure into a single claim of frequent 2-5X speedup over generic MySQL/InnoDB. But the usual caveats about vendor-supplied performance numbers of course apply.</p>
<p>Finally, some housekeeping:</p>
<ul>
<li>At      Oracle’s polite request,* Schooner changed its product name to not mention      MySQL; hence the moniker SchoonerSQL.</li>
<li>SchoonerSQL      is being launched Monday.</li>
<li>Schooner      has determined that if its version numbers are different from MySQL’s,      confusion ensues. So the first ever version under the name SchoonerSQL is      SchoonerSQL 5.1, a factoid that Schooner is wisely omitting from the      official product launch press release.</li>
<li>The      synchronous part of SchoonerSQL’s replication story dates back to last      January’s product release.</li>
<li>Most      of the asynchronous part of SchoonerSQL story is new for Monday.</li>
<li>dbShards/SchoonerSQL      partnership engineering is still underway.</li>
</ul>
<p><em><span style="text-decoration: line-through;">*I mean that non-facetiously. </span><span style="text-decoration: line-through;">Schooner’s MySQL OEM contract was such that Oracle didn’t have a legal hammer to force the change.</span> Edit: Whoops! There turn out to have been inaccuracies in the original version of this footnote, which I now regret writing. The contract isn&#8217;t exactly OEM, and there actually were some trademark-based legal hammers.<span style="text-decoration: line-through;"><br />
</span></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/23/schooner-pivots-further/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>More notes on Oracle NoSQL</title>
		<link>http://www.dbms2.com/2011/10/20/more-notes-on-oracle-nosql/</link>
		<comments>http://www.dbms2.com/2011/10/20/more-notes-on-oracle-nosql/#comments</comments>
		<pubDate>Thu, 20 Oct 2011 15:49:31 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Web analytics]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5515</guid>
		<description><![CDATA[A reporter asked me for some thoughts on Oracle&#8217;s new NoSQL product. For the most part, I stand by my previous comments on Oracle NoSQL. Still, NoSQL in general deserves a place in Oracle shops, so it makes sense for Oracle to try to coopt it. Oracle&#8217;s core DBMS is not well suited to track [...]]]></description>
			<content:encoded><![CDATA[<p>A reporter asked me for some thoughts on Oracle&#8217;s new NoSQL product. For the most part, I stand by <a href="http://www.dbms2.com/2011/09/30/oracle-nosql/">my previous comments on Oracle NoSQL</a>. Still, NoSQL in general deserves a place in Oracle shops, so it makes sense for Oracle to try to coopt it.</p>
<p>Oracle&#8217;s core DBMS is not well suited to track interactions (e.g. web clicks), even in cases where it&#8217;s the choice for transactions; it&#8217;s unnecessarily heavyweight. What&#8217;s worse, <a href="http://www.dbms2.com/2010/09/16/chase-authentication-database-outage/">using the same database to store actions and interactions can lead to serious reliability problems.</a> If a better architecture is to dump the clicks into some NoSQL store, massage the information, and eventually put some derived data into a relational DBMS, then Oracle will naturally try to own each step of the data pipeline.</p>
<p><a href="http://www.dbms2.com/2011/07/31/dynamic-fixed-schema-databases/">Dynamic schemas</a> are another area of Oracle weakness, leading in some cases to outright <a href="http://www.dbms2.com/2011/07/27/mongodb-users-and-use-cases/">Oracle replacements</a>. However, pure key-value stores go too far to the opposite extreme; you should at least be able to index and retrieve data one field at a time. Based on what I&#8217;ve seen of Oracle&#8217;s marketing literature, that feature will be missing from the first release of Oracle&#8217;s NoSQL.* Until it&#8217;s in there, and until it works well, I don&#8217;t see why anybody should use Oracle&#8217;s NoSQL product.</p>
<p><em>*Frankly, that choice makes no sense to me on any level. Yet it&#8217;s the way Oracle seems to have elected to go &#8212; or, if it isn&#8217;t, then there&#8217;s somebody writing Oracle marketing collateral who&#8217;s clearly in the wrong line of work.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/20/more-notes-on-oracle-nosql/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Oracle is buying Endeca</title>
		<link>http://www.dbms2.com/2011/10/18/oracle-is-buying-endeca/</link>
		<comments>http://www.dbms2.com/2011/10/18/oracle-is-buying-endeca/#comments</comments>
		<pubDate>Tue, 18 Oct 2011 16:09:56 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Endeca]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5502</guid>
		<description><![CDATA[Oracle is buying Endeca. The official talking points for the deal aren&#8217;t a perfect match for Endeca&#8217;s actual technology, but so be it. In that post, I wrote: &#8230; the Endeca paradigm is really to help you make your way through a structured database, where different portions of the database have different structures. Thus, at [...]]]></description>
			<content:encoded><![CDATA[<p>Oracle is buying Endeca. The <a href="http://www.oracle.com/us/corporate/press/517791">official talking points for the deal</a> aren&#8217;t a perfect match for <a href="http://www.dbms2.com/2011/04/18/endeca-topics/">Endeca&#8217;s actual technology</a>, but so be it.</p>
<p>In that post, I wrote:</p>
<blockquote><p>&#8230; the Endeca paradigm is really to help you make your way through a  structured database, where different portions of the database have  different structures. Thus, at various points in your journey, it  automagically provides you a list of choices as to where you could go  next.</p></blockquote>
<p>That kind of thing could help Oracle with apps like <a href="http://www.dbms2.com/2011/07/27/mongodb-users-and-use-cases/">the wireless telco product catalog deal MongoDB got</a>.</p>
<p>Going back to the Endeca-post quote well, Endeca itself said:</p>
<blockquote><p>Inside the MDEX Engine there is no overarching schema; each data record  carries its own metadata. This enables the rapid combination of a wide  range of structured and unstructured content into Latitude’s unified  data model. Once inside, the MDEX Engine derives common dimensions and  metrics from the available metadata, instantly exposing each for  high-performance refinement and analysis in the <a href="http://www.endeca.com/en/products/endeca_latitude/technology-overview/Discovery-Framework.html">Discovery Framework</a>.  Have a new data source? Simply add it and the MDEX Engine will create  new relationships where possible. Changes in source data schema? No  problem, adjustments on the fly are easy.</p></blockquote>
<p>And I pointed out that the MDEX engine was a columnar DBMS.</p>
<p>Meanwhile, <a href="http://www.dbms2.com/2011/09/22/hybrid-columnar-soundbites/">Oracle&#8217;s own columnar DBMS efforts have been disappointing</a>. Endeca could be an intended answer to that. However, while Oracle&#8217;s track record with standalone DBMS acquisitions is admirable (DEC RDB, MySQL, etc.), Oracle&#8217;s track record of integrating DBMS acquisitions into the Oracle product itself is not so good. (Express? Essbase? The text product line? None of that has gone particularly well.)</p>
<p>So <strong>while I would expect Endeca&#8217;s flagship e-commerce shopping engine products to flourish under Oracle&#8217;s ownership, I would be cautious about the integration of Endeca&#8217;s core technology into the Oracle product line.</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/18/oracle-is-buying-endeca/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

