<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; In-memory DBMS</title>
	<atom:link href="http://www.dbms2.com/category/memory-centric-data-management/in-memory-dbms/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Wed, 08 Feb 2012 22:51:11 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>Comments on the analytic DBMS industry and Gartner&#8217;s Magic Quadrant for same</title>
		<link>http://www.dbms2.com/2012/02/08/gartner-magic-quadrant-data-warehouse-2011-2012/</link>
		<comments>http://www.dbms2.com/2012/02/08/gartner-magic-quadrant-data-warehouse-2011-2012/#comments</comments>
		<pubDate>Wed, 08 Feb 2012 17:17:32 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Exasol]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Kognitio]]></category>
		<category><![CDATA[Market share and customer counts]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[illuminate Solutions]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5926</guid>
		<description><![CDATA[This year&#8217;s Gartner Magic Quadrant for Data Warehouse Database Management Systems is out.* I shall now comment, just as I did on the 2010, 2009, 2008, 2007, and 2006 Gartner Data Warehouse Database Management System Magic Quadrants, to varying extents. To frame the discussion, let me start by saying: In general, I regard Gartner Magic [...]]]></description>
			<content:encoded><![CDATA[<p>This year&#8217;s Gartner Magic Quadrant for Data Warehouse Database Management Systems is out.* I shall now comment, just as I did on the <a href="http://www.dbms2.com/2011/02/05/gartner-magic-quadrant-data-warehouse-database-management-2010/">2010</a>, <a href="../../../../../2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/">2009</a>, <a href="../../../../../2009/01/12/gartners-2008-data-warehouse-database-management-system-magic-quadrant-is-out/">2008</a>, <a href="../../../../../2007/10/19/gartner-2007-magic-quadrant-for-data-warehouse-database-management-systems/">2007</a>, and <a href="../../../../../2006/10/03/vendor-segmentation-for-data-warehouse-dbms/">2006</a> Gartner Data Warehouse Database Management System Magic Quadrants, to varying extents. To frame the discussion, let me start by saying:</p>
<ul>
<li>In general, I regard Gartner Magic Quadrants as a bad use of good research.</li>
<li>Illustrating the uselessness of &#8212; or at least poor execution on &#8212; the  overall quadrant metaphor, a large majority of the vendors covered are  lined up near the line x = y, each outpacing the one below in both of  the quadrant&#8217;s dimensions.</li>
<li>I find fewer specifics to disagree with in this Gartner Magic Quadrant than in previous year&#8217;s versions. Two factors jump to mind as possible reasons:
<ul>
<li>This year&#8217;s Gartner Magic Quadrant for Data Warehouse Database Management Systems is somewhat less ambitious than others; while it gives as much company detail as its predecessors, it doesn&#8217;t add as much discussion of overall trends. So there&#8217;s less to (potentially) disagree with.</li>
<li><a href="http://www.dbms2.com/2010/12/28/evolving-definitions-and-technology-categories-for-2011/">Merv Adrian is now at Gartner</a>.</li>
</ul>
</li>
<li>Whatever the problems may be with Gartner&#8217;s approach, the whole thing comes out better than do <a href="http://www.dbms2.com/2011/02/11/comments-on-the-2011-forrester-wave-for-enterprise-data-warehouse-platforms/">Forrester&#8217;s failed imitations</a>.</li>
</ul>
<p><em>*At the time of this posting, I don&#8217;t yet have a link. However, I expect that to change quickly, and I plan to edit this paragraph accordingly. If nothing else, I hope people will drop links into the comment thread. </em></p>
<p>Specific company comments, roughly in line with Gartner&#8217;s rough single-dimensional rank ordering, include: <span id="more-5926"></span></p>
<ul>
<li>The Gartner Magic Quadrant&#8217;s comments on Teradata seem pretty fair. I don&#8217;t think I&#8217;m much in disagreement when I say:
<ul>
<li>Teradata has the richest, most mature analytic DBMS offering.</li>
<li>Teradata has an outstanding track record both for <a href="http://www.dbms2.com/2011/09/24/confusion-about-teradatas-big-customers/">managing large data volumes</a> and for high-concurrency mixed workloads.</li>
<li>Aster Data was a cool Teradata acquisition, even if Teradata/Aster synergies or integration have been nominal to date.</li>
<li>Teradata still needs to get out of its own way in marketing, positioning, packaging, and/or defining its premium-priced system vs. its more moderately-priced alternatives. Indeed, as necessary as this approach may have been to fending off encroachments by Netezza and others, what Teradata really needs to do is evolve to a more pick-your-own-node-combination mix-match kind of offering.</li>
</ul>
</li>
<li>Gartner has talked with a lot of Oracle Exadata users who say that the product works; Gartner has also stopped beating Oracle up for <a href="http://www.dbms2.com/2010/06/14/best-practices-analytic-database-poc/">its previous policy of almost never doing onsite POCs (Proofs of Concept)</a>; both parts of that ring true with me. But Gartner also rightly dings Oracle for various issues in cost and cumbersomeness. Overall, while I agree there are organizations for which Oracle should indeed be a top-ranked choice, there are many others who shouldn&#8217;t put Oracle on their short list.</li>
<li>Third in the Gartner MQ rankings is IBM.
<ul>
<li>Gartner gets so caught up in reciting the names of various IBM product offerings that it neglects to say much good about DB2 itself. (I tend to have a similar problem.)</li>
<li>But Gartner does mention concurrency as a strength. I agree, especially if we presume that that was a reference to DB2 rather than Netezza.</li>
<li>Gartner cites Netezza&#8217;s post-acquisition annual growth rate as 30%. Gartner seems to think this is a good number. I disagree, but in Netezza&#8217;s defense, it has had to endure IBM&#8217;s post-acquisition on-boarding process.</li>
</ul>
</li>
<li>Arguably fourth in the Gartner Data Warehouse Magic Quadrant rankings is EMC/Greenplum.
<ul>
<li>In general, Gartner likes the taste of Greenplum Kool-Aid.</li>
<li>Gartner neglects to ding Greenplum for concurrency challenges, which I view as an oversight given Gartner&#8217;s general stress on that area.</li>
<li>Gartner does ding Greenplum for support challenges.</li>
<li>Gartner neglects to praise Greenplum for true <a href="http://www.dbms2.com/2009/10/14/greenplum-hybrid-columnar/">hybrid row/columnar data management</a>, a feature shared by <a href="http://www.dbms2.com/2011/09/22/teradata-columnar-compression/">Teradata</a> and <a href="http://www.dbms2.com/2009/08/04/pax-analytica-row-and-column-stores-begin-to-come-together/">Vertica</a>, among others, but not by <a href="http://www.dbms2.com/2011/02/06/columnar-compression-database-storage/">Oracle</a>, DB2, or Netezza.</li>
<li>Gartner located a half-petabyte Greenplum database. This doesn&#8217;t surprise me, even though Greenplum has frequently made exaggerated claims about large-size database successes in the past.</li>
<li>Gartner reports a &gt;400 figure for Greenplum customers, which is plausible.</li>
</ul>
</li>
<li>In its first deviation from strict one-dimensional rank ordering, the Gartner Magic Quadrant ranks Sybase ahead of Greenplum in completeness of vision but behind in &#8220;ability to execute&#8221;.
<ul>
<li>If that were the other way around, it might make more sense. Greenplum promises anything and everything you might ever want for analytic data management or the associated analysis; but Sybase has vastly more analytic DBMS users than Greenplum does, running a variety of demanding workloads.</li>
<li>Gartner appears to think that Sybase IQ requires less database administration than I do.</li>
<li>Gartner seems concerned that SAP will position HANA and Sybase ASE as, between them, the only DBMS you&#8217;ll ever need, casting doubt on Sybase IQ&#8217;s future. I wouldn&#8217;t worry about that if you have a problem you want to solve today.</li>
</ul>
</li>
<li>The Gartner Magic Quadrant for Data Warehouse Database Management Systems ranks Microsoft sixth overall, despite noting that there isn&#8217;t a single production reference for Microsoft&#8217;s Parallel Data Warehouse. In support of this ranking, it for example cites the compression feature, which distinguishes Microsoft SQL Server from no other product on the list except Kognitio. If you have such an undemanding data warehousing problem that many different analytic DBMS could meet your needs, there&#8217;s a good chance Microsoft SQL Server can also do the job; and if you&#8217;ve bought into the Microsoft technology stack, you might as well keep going down that path. Otherwise, I don&#8217;t know why somebody should adopt Microsoft&#8217;s offering at this time.</li>
<li>Seventh along the main diagonal path in the Gartner Magic Quadrant is HP Vertica. I&#8217;d rank Vertica higher than that, but in fairness I note two execution concerns. First, HP has a lousy track record, both in acquisitions and in data warehousing/analytics. Second, Vertica is bad about answering my email. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Anyhow, Gartner doesn&#8217;t seem to have given Vertica credit either for <a href="http://www.dbms2.com/2011/06/20/columnar-dbms-vendor-customer-metrics/">its full customer count or for the multiple petabyte-scale databases Vertica runs</a>.</li>
<li>1010data is an outlier, with Gartner noting that it only partly fits in with other &#8220;Data Warehousing Database Management&#8221; companies, and hence kind of confessing that 1010data on the Magic Quadrant is somewhat arbitrary. Stuff like that is bound to happen, given <a href="http://www.strategicmessaging.com/no-market-categorization-is-ever-precise/2011/03/01/">the inherent difficulties of defining market categories</a>. Anyhow, my thoughts on 1010data include:
<ul>
<li>I&#8217;m nervous about the fact that 1010data doesn&#8217;t actually control its own DBMS technology, but rather relies on old code from the small private company KX Systems.</li>
</ul>
<ul>
<li> There are three main reasons to consider 1010data:
<ul>
<li>You want to enter the data mart outsourcing business in a casual way, and you like its SaaS offering.</li>
<li>You want to engage in <a href="http://www.dbms2.com/2010/05/15/stakeholder-facing-analytics/">stakeholder-facing analytics</a> in a casual way, and you like its SaaS offering.</li>
<li>You love 1010data&#8217;s particular set of interactive analytic features and performance.</li>
</ul>
</li>
</ul>
</li>
<li>Back to the main path winding along the Gartner Magic Quadrant main diagonal &#8212; next up is ParAccel. While I question some of the peripheral comments, I agree with Gartner&#8217;s core messages that:
<ul>
<li>ParAccel, the product, is blazingly fast in certain use cases.</li>
<li>ParAccel, the company, is dangerously small.</li>
</ul>
</li>
<li>Eighth on the Gartner MQ&#8217;s main path is Kognitio. This is too high. Kognitio positions itself as offering in-memory DBMS, yet stubbornly refuses to do any kind of data compression. That&#8217;s an awful combination of choices. As for using Kognitio&#8217;s data warehousing SaaS offering &#8212; why would you do that, when more modern products are available on a SaaS/cloud basis as well?</li>
<li>Ninth in the Gartner Magic Quadrant main rankings is SAND.
<ul>
<li>The SAND section is not a triumph of Gartner accuracy. For example:
<ul>
<li><a href="http://www.dbms2.com/2011/11/12/clarifying-sands-customer-metrics-positioning-and-technical-story/">Gartner completely missed the errors in SAND&#8217;s reported customer counts</a>.</li>
<li>Gartner refers to SAND as being &#8220;in existence for approximately nine years&#8221;, which is too low by at least a factor of 2.</li>
<li>Gartner says &#8220;SAND is a privately held company&#8221;, even though <a href="http://itmarketstrategy.com/2009/06/07/sand-technology-a-risky-bet/">Merv knows better than that</a>.</li>
</ul>
</li>
<li>Otherwise, Gartner&#8217;s opinion on SAND seems to boil down to &#8220;Interesting technology and ideas, but dangerously small company.&#8221; I agree.</li>
</ul>
</li>
<li>Tenth and too low in the Gartner MQ main rankings is Infobright.
<ul>
<li>At least by some metrics (e.g. customer count), Infobright isn&#8217;t as dangerously small as ParAccel, SAND, Kognitio, et al.</li>
<li>That said, Infobright is small and focused on <a href="http://www.dbms2.com/2010/12/30/examples-and-definition-of-machine-generated-data/">machine-generated data</a>. So I wouldn&#8217;t be confident in Infobright&#8217;s future technology path for human-generated data use cases.</li>
<li>Infobright&#8217;s performance is uneven &#8212; blazing in cases where the Knowledge Grid helps, but not necessarily stellar by analytic DBMS standards when full table scans are called for.</li>
<li>I agree with Gartner that the possibility of Oracle/MySQL future shenanigans is a concern. But while the energy behind MySQL forking efforts doesn&#8217;t seem too great right now, I&#8217;d expect them to revive and offer a successful escape path if it seemed Oracle was going to indeed play hardball.</li>
<li>Also, given that it&#8217;s already an open source vendor, there are various kinds of assurances Infobright could give that would also help alleviate customer concerns.</li>
</ul>
</li>
<li>Actian, formerly Ingres, took a big tumble in Gartner&#8217;s rankings versus last year, when I simply wrote &#8220;<a href="http://www.dbms2.com/2011/02/05/gartner-magic-quadrant-data-warehouse-database-management-2010/">What Gartner said in connection with <strong>Ingres</strong> is too inaccurate to deserve detailed attention</a>.&#8221; I&#8217;m even a little harsher about <a href="http://www.dbms2.com/2011/09/25/ingres-actian/">Ingres/Actian&#8217;s DBMS products and prospects</a> than Gartner is, but at least now we&#8217;re in the same ballpark.</li>
<li>Along with Infobright, ParAccel, and SAND, <a href="http://www.dbms2.com/2011/11/12/exasol-update/">Exasol</a> appears to be another of the &#8220;good columnar technology/small company&#8221; crowd. As with other such products, one should be careful about fit-and-finish features that are missing today, as there is no assurance they&#8217;ll be added in a timely manner going forward.</li>
<li>illuminate Solutions, which was on last year&#8217;s Gartner list, <a href="http://www.dbms2.com/2012/01/16/has-illuminate-solutions-joined-the-choir-invisible/">now appears to be an ex-company</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2012/02/08/gartner-magic-quadrant-data-warehouse-2011-2012/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Some big-vendor execution questions, and why they matter</title>
		<link>http://www.dbms2.com/2011/11/21/big-vendor-execution-analytics/</link>
		<comments>http://www.dbms2.com/2011/11/21/big-vendor-execution-analytics/#comments</comments>
		<pubDate>Mon, 21 Nov 2011 11:01:20 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Cognos]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[SAP AG]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5704</guid>
		<description><![CDATA[When I drafted a list of key analytics-sector issues in honor of look-ahead season, the first item was &#8220;execution of various big vendors&#8217; ambitious initiatives&#8221;.  By &#8220;execute&#8221; I mean mainly: &#8220;Deliver products that really meet customers&#8217; desires and needs.&#8221; &#8220;Successfully convince them that you&#8217;re doing so &#8230;&#8221; &#8220;&#8230; at an attractive overall cost.&#8221; Vendors mentioned [...]]]></description>
			<content:encoded><![CDATA[<p>When I drafted a list of key analytics-sector issues in honor of <a href="http://www.dbms2.com/2011/11/21/analytic-trends-in-2012-qa/">look-ahead season</a>, the first item was &#8220;execution of various big vendors&#8217; ambitious initiatives&#8221;.  By &#8220;execute&#8221; I mean mainly:</p>
<ul>
<li>&#8220;Deliver products that really meet customers&#8217; desires and needs.&#8221;</li>
<li> &#8220;Successfully convince them that you&#8217;re doing so &#8230;&#8221;</li>
<li>&#8220;&#8230; at an attractive overall cost.&#8221;</li>
</ul>
<p>Vendors mentioned here are Oracle, SAP, HP, and IBM. Anybody smaller got left out due to the length of this post. Among the bigger omissions were:</p>
<ul>
<li>salesforce.com (multiple subjects).</li>
<li><a href="../../../../../2011/04/21/sas-hpa-does-make-sense-after-all/">SAS HPA</a>.</li>
<li><a href="../../../../../2011/08/21/hadoop-evolution/">The evolution of Hadoop</a>.</li>
</ul>
<p><span id="more-5704"></span><strong>A (lingering) issue for SAP and Oracle alike</strong></p>
<p>As I noted in January of this year, <a href="../../../../../2011/01/03/the-six-useful-things-you-can-do-with-analytic-technology/">integration of business intelligence into operational apps is making very slow progress</a>. Even so, it&#8217;s a huge part of the apparent strategy at SAP and Oracle alike, as well it should be. Much of the benefit from automating routine desk work has already happened. The areas ripest for exploitation are the ones where analytics are part of the equation.</p>
<p>Given the lack of tangible progress, why do I think this is a genuine area of Oracle and SAP emphasis? Three reasons of many are:</p>
<ul>
<li>Why else did SAP buy Business Objects?</li>
<li>If they&#8217;re not trying to <a href="../../../../../2011/03/30/short-request-and-analytic-processing/">integrate operational apps and analytics</a>, why else does SAP&#8217;s emphasis on HANA make sense?</li>
<li>Without business intelligence in the picture, how does Oracle&#8217;s integrated-stack story promise any direct user benefits?*</li>
</ul>
<p><em>*As opposed to IT concerns &#8212; integration, administration, TCO (Total Cost of Ownership), etc.</em></p>
<p>After so many years of disappointment, I&#8217;m not going to forecast 2012 as a pivotal year for <strong>the integration of business intelligence into operational applications.</strong> But if one of SAP or Oracle ever does get a significant BI/operational app integration advantage over the other, it could be a major competitive advantage in those application market segments that are still up for grabs. It also is an opportunity for both vendors to gain BI market share in their respective application customer bases.</p>
<p><strong>A more urgent issue for SAP</strong></p>
<p>SAP has put huge amounts of credibility on the line for HANA, the integration of two different and not particularly mature in-memory database technologies. So far, it is difficult to find evidence that HANA is robust enough for widespread adoption. Whether or not SAP can fix that is a huge open question, which could have significant impact on the course of several technology areas: applications, business intelligence, in-memory DBMS, and maybe even hardware.</p>
<p>Based on current information, which is admittedly partial, I&#8217;m a short-term pessimist on HANA. Longer-term, I&#8217;m on record as saying that <a href="../../../../../2011/05/23/databases-ram/">traditional databases will eventually wind up in RAM</a>. SAP will surely get that technology right some day, whether or not the way it does so has anything to do with present-day HANA code.</p>
<p><strong>Four more issues for Oracle </strong></p>
<p>Oracle&#8217;s ambitions are near-endless, and so also therefore is its list of execution challenges. Four in the analytics area that I find particularly interesting are:</p>
<ul>
<li><strong>True hybrid columnar DBMS.</strong> <a href="../../../../../2011/09/22/teradata-columnar-compression/">I was guessing that Oracle, like Teradata, would announce true hybrid columnar the week of Oracle OpenWorld</a>. I was wrong. But if Oracle can&#8217;t bring out true hybrid columnar DBMS functionality relatively soon, Exadata will lose credibility as a competitor to more specialized analytic DBMS.</li>
<li><strong>Oracle Exalytics.</strong> With Exalytics in the mix, Oracle&#8217;s technology stack has HANA-like potential. But will Exalytics even ship in 2012? (I think so.) Will it be good for much in the first release? (I&#8217;m skeptical.)</li>
<li><strong>Oracle&#8217;s Big Data Appliance</strong>. I&#8217;m skeptical both about <a href="../../../../../2011/10/20/more-notes-on-oracle-nosql/">Oracle&#8217;s NoSQL product</a> &#8212; <a href="http://www.infoworld.com/d/data-explosion/first-look-oracle-nosql-database-179107">a favorable InfoWorld review</a> notwithstanding &#8212; and <a href="../../../../../2011/09/23/hadoop-appliances/">Hadoop appliances</a>. But if I&#8217;m wrong, and Oracle can successfully embrace/extend the new non-relational paradigms, then it really might regain control over the evolution of data management.</li>
<li><strong><a href="../../../../../2011/10/18/oracle-is-buying-endeca/">Oracle&#8217;s Endeca acquisition</a></strong> &#8212; will Oracle prove me wrong and integrate Endeca effectively into its overall analytic product line? If it does, we might finally see effective text (and eventually speech) navigation of enterprise software. (But as with all Oracle issues cited here, this is something that probably won&#8217;t amount to much in 2012 even if it does later go well.)</li>
</ul>
<p><strong>Three issues for IBM</strong></p>
<p>Like Oracle, IBM is a huge company with many ambitions and hence many execution challenges. The biggest of those is surely: <strong>How effective can IBM be at selling outside its existing customer base?</strong> I don&#8217;t hear as much competitively about IBM DataStage, IBM SPSS or now IBM Netezza as I did when their vendors were independent companies. Even Cognos may not be much of an exception to the rule, although it has its own large customer base outside of IBM&#8217;s traditional one. (To lesser extents , the same is of course true of Netezza and numerous other IBM acquisitions.)</p>
<p>Another general issue for IBM is <strong>substantively integrating its various product lines,</strong> at least to the extent that makes sense. DB2/Netezza integration sounds good, but even that is a matter more of product marketing (the admirable part of that discipline) more than of actual technology. Other integrations (e.g. Cognos/DB2 in various bundles) have tended toward the dubious side.*</p>
<p><em>*I&#8217;m still waiting for IBM to get back to me with examples of how Cognos/DB2 joint tuning amounts to anything. It&#8217;s been more than a year, so I&#8217;m glad I didn&#8217;t hold my breath.</em></p>
<p>In a somewhat narrower vein, I wonder: <strong><a href="../../../../../2011/11/10/cep-streaming-catchup/">Will IBM be able to gain traction for InfoSphere Streams</a>? </strong>And if so, when and where will the traction be?</p>
<p><strong>Will HP screw up Vertica?</strong></p>
<p>Vertica has a very attractive product offering. It&#8217;s perhaps <a href="../../../../../2011/06/20/columnar-dbms-vendor-customer-metrics/">the most scalable analytic DBMS outside of Teradata</a>, running on the hardware of your reasonable choice.  It&#8217;s also the one I recommend most often to clients in the 1-50 terabyte range.</p>
<p>So far HP doesn&#8217;t seem to have done much to leadfoot Vertica. (About all I&#8217;ve heard from competitors is that Vertica seems to have faded somewhat in the financial services market, and there could be multiple explanations if that is indeed true.) But if HP Vertica does somehow manage to botch things, opportunities will open up for a range of columnar analytic DBMS competitors.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/11/21/big-vendor-execution-analytics/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>McObject and eXtremeDB</title>
		<link>http://www.dbms2.com/2011/07/22/mcobject-extremedb/</link>
		<comments>http://www.dbms2.com/2011/07/22/mcobject-extremedb/#comments</comments>
		<pubDate>Fri, 22 Jul 2011 12:32:16 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[McObject]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Object]]></category>
		<category><![CDATA[Objectivity and Infinite Graph]]></category>
		<category><![CDATA[Telecommunications]]></category>
		<category><![CDATA[solidDB]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5004</guid>
		<description><![CDATA[I talked with McObject yesterday. McObject has two product lines, both of which are something like in-memory DBMS &#8212; eXtremeDB, which is the main one, and Perst. McObject has been around since at least 2003, probably has no venture capital, and probably has a very low double-digit number of employees.* *I could be wrong in [...]]]></description>
			<content:encoded><![CDATA[<p>I talked with McObject yesterday. McObject has two product lines, both of which are something like in-memory DBMS &#8212; eXtremeDB, which is the main one, and <a href="../../../../../2008/06/08/perst/">Perst</a>. McObject has been around since at least 2003, probably has no venture capital, and probably has a very low double-digit number of employees.*</p>
<p><em>*I could be wrong in those guesses; as small companies go, McObject is unusually prone to secrecy games.</em></p>
<p>As best I understand:</p>
<ul>
<li>eXtremeDB is something like an in-memory <a href="../../../../../2011/05/21/object-oriented-database-management-systems-oodbms/">object-oriented DBMS</a>, designed to be embeddable.</li>
<li>However, much as with Objectivity and other old-school OODBMS, eXtremeDB winds up being more of a toolkit with which to build DBMS than a full DBMS.</li>
<li>eXtremeDB has a few indexing schemes. The main one is good old B-trees. One customer wanted Patricia tries, so they&#8217;re in there. (Perhaps not coincidentally, solidDB relies on Patricia tries.) At least one wanted R-trees, so they&#8217;re in there too.</li>
<li>eXtremeDB has long had the option of persistent logs.</li>
<li>eXtremeDB newly has a hybrid memory-centric option, in which you can have more data in the database than fits into RAM.</li>
<li>eXtremeDB newly has multi-master two-phase-commit clustering.</li>
</ul>
<p>My guess three years ago that <a href="../../../../../2008/05/13/mcobject-extremedb-a-soliddb-alternative/">eXtremeDB might emerge as an alternative to solidDB</a> seems to have been borne out. McObject CEO Steve Graves says that the core of McObject&#8217;s business is OEMs, in sectors such as telecom equipment and defense/aerospace. That&#8217;s exactly solidDB&#8217;s traditional market, except that <a href="../../../../../2007/12/21/ibm-acquires-soliddb/">solidDB got acquired by IBM and deemphasized it</a>.</p>
<p>I&#8217;ve said before that if I were starting a SaaS effort &#8212; and it wasn&#8217;t just focused on analytics &#8212; <a href="../../../../../2011/05/21/object-oriented-database-management-systems-oodbms/">I&#8217;d look at using a memory-centric OODBMS</a>. Perhaps eXtremeDB is worth looking at in such scenarios.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/07/22/mcobject-extremedb/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Soundbites: the Facebook/MySQL/NoSQL/VoltDB/Stonebraker flap, continued</title>
		<link>http://www.dbms2.com/2011/07/15/facebook-mysql-nosql-voltdb-stonebraker/</link>
		<comments>http://www.dbms2.com/2011/07/15/facebook-mysql-nosql-voltdb-stonebraker/#comments</comments>
		<pubDate>Fri, 15 Jul 2011 08:27:18 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Akiban]]></category>
		<category><![CDATA[Cache]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Clustrix]]></category>
		<category><![CDATA[Couchbase]]></category>
		<category><![CDATA[Data models and architecture]]></category>
		<category><![CDATA[Database diversity]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Michael Stonebraker]]></category>
		<category><![CDATA[MongoDB and 10gen]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[ScaleBase]]></category>
		<category><![CDATA[ScaleDB]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Tokutek]]></category>
		<category><![CDATA[VoltDB and H-Store]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>
		<category><![CDATA[memcached]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4977</guid>
		<description><![CDATA[As a follow-up to the latest Stonebraker kerfuffle, Derrick Harris asked me a bunch of smart followup questions. My responses and afterthoughts include: Facebook et al. are in effect Software as a Service (SaaS) vendors, not enterprise technology users. In particular: They have the technical chops to rewrite their code as  needed. Unlike packaged software [...]]]></description>
			<content:encoded><![CDATA[<p>As a follow-up to the latest <a href="http://www.dbms2.com/2011/07/14/an-odd-claim-attributed-to-mike-stonebraker/">Stonebraker kerfuffle</a>, Derrick Harris asked me a bunch of smart followup questions. My responses and afterthoughts include:</p>
<ul>
<li>Facebook et al. are in effect Software as a Service (SaaS) vendors, not enterprise technology users. In particular:
<ul>
<li>They have the technical chops to rewrite their code as  needed.</li>
<li>Unlike packaged software vendors, they&#8217;re not answerable to anybody for keeping legacy code alive after a rewrite. That makes migration a lot easier.</li>
<li>If they want to write different parts of their system on different technical underpinnings, nobody can stop them. For example &#8230;</li>
<li>&#8230;  <a href="http://www.dbms2.com/2008/07/21/project-cassandra-facebook-open-sourced-quasi-dbms/">Facebook innovated Cassandra</a>, and is now heavily committed to HBase.</li>
</ul>
</li>
<li>It makes little sense to talk of Facebook&#8217;s use of &#8220;MySQL.&#8221; Better to talk of Facebook&#8217;s use of &#8220;MySQL +  memcached  + non-transparent sharding.&#8221; That said:
<ul>
<li>It&#8217;s hard to see why somebody today would use MySQL +  memcached  + non-transparent sharding for a new project. At least one of <a href="http://www.dbms2.com/2011/02/08/couchbase-membase-couchone-couchdb/">Couchbase</a> or <a href="http://www.dbms2.com/2011/02/24/transparent-sharding/">transparently-sharded</a> MySQL is very likely a superior alternative. Other alternatives might be better yet.</li>
<li>As noted above in the example of Facebook, the many major web businesses that are using MySQL +  memcached  + non-transparent sharding for existing projects can be presumed able to migrate away from that stack as the need arises.</li>
</ul>
</li>
</ul>
<p>Continuing with that discussion of DBMS alternatives:</p>
<ul>
<li>If you just want to write to the memcached API anyway, why not go with Couchbase?</li>
<li>If you want to go relational, why not go with MySQL? There are many alternatives for scaling or accelerating MySQL &#8212; dbShards, Schooner, Akiban, Tokutek, ScaleBase, ScaleDB, Clustrix, and Xeround come to mind quickly, so there&#8217;s a great chance that one or more will fit your use case. (And if you don&#8217;t get the choice of MySQL flavor right the first time, porting to another one shouldn&#8217;t be all THAT awful.)</li>
<li>If you really, really want to go in-memory, and don&#8217;t mind writing Java stored procedures, and don&#8217;t need to do the kinds of joins it isn&#8217;t good at, but do need to do the kinds of joins it is, VoltDB could indeed be a good alternative.</li>
</ul>
<p>And while we&#8217;re at it &#8212; going <strong>schema-free</strong> often makes a whole lot of sense. I need to write much more about the point, but for now let&#8217;s just say that I look favorably on the Big Four schema-free/NoSQL options of MongoDB, Couchbase, HBase, and Cassandra.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/07/15/facebook-mysql-nosql-voltdb-stonebraker/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>An odd claim attributed to Mike Stonebraker</title>
		<link>http://www.dbms2.com/2011/07/14/an-odd-claim-attributed-to-mike-stonebraker/</link>
		<comments>http://www.dbms2.com/2011/07/14/an-odd-claim-attributed-to-mike-stonebraker/#comments</comments>
		<pubDate>Thu, 14 Jul 2011 11:10:34 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cache]]></category>
		<category><![CDATA[Clustering]]></category>
		<category><![CDATA[Couchbase]]></category>
		<category><![CDATA[Games and virtual worlds]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Michael Stonebraker]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Theory and architecture]]></category>
		<category><![CDATA[VoltDB and H-Store]]></category>
		<category><![CDATA[memcached]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4964</guid>
		<description><![CDATA[This post has a sequel. Last week, Mike Stonebraker insulted MySQL and Facebook&#8217;s use of it, by implication advocating VoltDB instead. Kerfuffle ensued. To the extent Mike was saying that non-transparently sharded MySQL isn&#8217;t an ideal way to do things, he&#8217;s surely right. That still leaves a lot of options for massive short-request databases, however, [...]]]></description>
			<content:encoded><![CDATA[<p><em>This post has a <a href="http://www.dbms2.com/2011/07/15/facebook-mysql-nosql-voltdb-stonebraker/">sequel</a>.</em></p>
<p>Last week, Mike Stonebraker <a href="http://gigaom.com/cloud/facebook-trapped-in-mysql-fate-worse-than-death/">insulted MySQL and Facebook&#8217;s use of it</a>, by implication advocating <a href="http://www.dbms2.com/2010/06/30/details-and-analysis-of-the-voltdb-argument/">VoltDB</a> instead. Kerfuffle ensued. To the extent Mike was saying that non-transparently sharded MySQL isn&#8217;t an ideal way to do things, he&#8217;s surely right. That still leaves a lot of options for massive <a href="http://www.dbms2.com/2011/03/02/short-request-processing/">short-request</a> databases, however, including <a href="http://www.dbms2.com/2011/02/24/transparent-sharding/">transparently sharded</a> RDBMS, scale-out <a href="http://www.dbms2.com/2011/05/23/databases-ram/">in-memory DBMS</a> (whether or not VoltDB*), and various NoSQL options. If nothing else, <a href="http://www.dbms2.com/2011/02/08/couchbase-membase-couchone-couchdb/">Couchbase</a> would seem superior to memcached/non-transparent MySQL if you were starting a project today.</p>
<p><em>*The big problem with VoltDB, last I checked, was its reliance on Java stored procedures to get work done.</em></p>
<p>Pleasantries continued in <em><a href="http://www.theregister.co.uk/2011/07/13/mike_stonebraker_versus_facebook/">The Register</a>,</em> which got an amazing-sounding quote from Mike. If <em>The Reg</em> is to be believed &#8212; something <a href="http://www.monashreport.com/2006/03/22/goodmail-esther-dyson-andrew-orlowski-etc/">I wouldn&#8217;t necessarily take for granted</a> &#8212; Mike claimed that he (i.e. VoltDB) knows how to solve the <strong>distributed join</strong> performance problem.  <span id="more-4964"></span></p>
<blockquote><p>So, it&#8217;s Stonebraker against the web. And the difference of option is  severe. In May, at a MongoDB developer conference in San Francisco,  Mongo creator Dwight Merriman told his audience there was &#8220;no way&#8221; to do distributed joins in a way that really scales.  &#8220;I&#8217;m not smart enough to do distributed joins that scale horizontally,  widely, and are super fast. You have to choose something else. We have  no choice but to not be relational,&#8221; he said</p>
<p>&#8220;You can do distributed transactions, but if you do them with no loss  of generality and you do them across a thousand machines, it&#8217;s not  going to be that fast.&#8221;</p>
<p>Stonebraker says precisely the opposite, and in typical fashion, he  goes right for the jugular. &#8220;I reject what Merriman says out of hand,&#8221;  he tells <em>The Register</em>. Merriman and his company, 10gen, declined  to comment for this story. But Stonebaker says words don&#8217;t matter. As  much as he likes to wield his opinions, he insists the debate will be  decided elsewhere. &#8220;Let the bake-off begin,&#8221; he crows.</p></blockquote>
<p>But when last I checked, VoltDB made nowhere near that claim. And well it shouldn&#8217;t have. In the fully general case, there&#8217;s no way to ensure super distributed join performance other than by throwing lots and lots of gear at the problem. But if you do that, many alternatives are fast. More specialized cases may be a different matter &#8212; but there are many fast alternatives for those too.</p>
<p>I imagine there will be use cases for which VoltDB sustains a lead as the truly fastest alternative, similarly-architected competitors perhaps excepted.* But what Mike supposedly said seems quite forward-leaning when compared to technical reality.</p>
<p><em>*The canonical VoltDB use case is <a href="http://www.dbms2.com/2010/05/25/voltdb-finally-launches/">e-commerce in virtual goods</a>, the point of &#8220;virtual&#8221; being that physical inventory might necessitate costlier kinds of joins.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/07/14/an-odd-claim-attributed-to-mike-stonebraker/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
		<item>
		<title>Forthcoming Oracle appliances</title>
		<link>http://www.dbms2.com/2011/06/24/forthcoming-oracle-appliances/</link>
		<comments>http://www.dbms2.com/2011/06/24/forthcoming-oracle-appliances/#comments</comments>
		<pubDate>Fri, 24 Jun 2011 06:44:56 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Object]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4822</guid>
		<description><![CDATA[Edit: I checked with Oracle, and it&#8217;s indeed TimesTen that&#8217;s supposed to be the basis of this new appliance, as per a comment below. That would be less cool, alas. Oracle seems to have said on yesterday&#8217;s conference call Oracle OpenWorld (first week in October) will feature appliances based on Tangosol and Hadoop. As I [...]]]></description>
			<content:encoded><![CDATA[<p><em>Edit: I checked with Oracle, and it&#8217;s indeed TimesTen that&#8217;s supposed to be the basis of this new appliance, as per a comment below. That would be less cool, alas.</em></p>
<p>Oracle seems to have said on yesterday&#8217;s conference call Oracle OpenWorld (first week in October) will feature appliances based on Tangosol and Hadoop. As I post this, <a href="http://seekingalpha.com/article/276425-oracle-s-ceo-discusses-q4-2011-results-earnings-call-transcript?part=qanda">the Seeking Alpha transcript of Oracle&#8217;s call</a> is riddled with typos. Bolded comments below are by me.  <span id="more-4822"></span></p>
<blockquote><p>Well, we&#8217;re planning to add a couple of appliances and announcing them this fall. One appliance, that should surprise you is a large memory addition to Exadata for analytics and memory, so we continue to invest. We thought that would &#8212; we&#8217;ve been the leader of in-memory database technology ever since we bought Tungsten. <strong>I presume that&#8217;s a typo for <a href="../../../../../2007/03/25/oracle-tangosol-objects-caching-and-disruption/">&#8220;Tangosol&#8221;</a>. And it sort of denigrates Oracle TimesTen.</strong> And that&#8217;s for both for transactions and for preprocessing. We are, as memories become cheaper and larger scale, we&#8217;ve changed as much of our algorithms and this in-memory analytics accelerator is going to be, again, coming out and we&#8217;ll be announcing it in the fall at Oracle OpenWorld.</p></blockquote>
<p>That part, especially in connection with the last sentence of the next quote, sounds almost as if Tangosol will be positioned as a  kind of memory-centric object-oriented DBMS, albeit with Oracle as its  persistence layer. Well, I favor both <a href="../../../../../2011/05/23/databases-ram/">in-memory</a> and <a href="../../../../../2011/05/21/object-oriented-database-management-systems-oodbms/">object-oriented</a> DBMS, and especially the intersection of those two categories. So in  principle this could be a very cool product. Exploiting that coolness, however, may require one heck of a missionary sell.</p>
<blockquote><p>In addition, attaching to our Exalogic box, there&#8217;s a lot of misunderstanding about what&#8217;s a dupe is, and is it a replacement for database.<strong> I presume &#8220;a dupe&#8221; is a typo for &#8220;Hadoop&#8221;</strong>. So the dupe is not a replacement for database. It&#8217;s an adjunct to the database, which we think, is very, very important. It really is a tool for Java programmers. And we&#8217;re the world leader in Java technology and we are building a big data accelerator to attach to our Exalogic box, which comes out also this fall. The big data accelerator includes some of the standard open source heavy software, HTFF, the heavy file system and a number of other pieces, but also some Oracle components that we think can dramatically speed up the entire math-produced process. <strong>I presume that&#8217;s a series of typos for &#8220;HDFS&#8221; and &#8220;MapReduce</strong>&#8220;. And will be particularly attractive to Java programmers who are the ones, who asked for &#8212; aspire to do. There are some interesting applications they do, ETL is one. Log processing is another. <strong>Those last two sentences are more evidence for the theory that this is about Hadoop. Besides, I spoke with somebody who listened to the call. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </strong> We&#8217;re going to have a lot of those features, functions and prebuilt applications in our big data accelerator. So, Oracle has always followed database technology trends, whether it&#8217;s object databases, in-memory databases and kept up with this technology and some, quite often led on innovation.</p></blockquote>
<p>And that part sounds as if Oracle will announce a Hadoop appliance, positioning it more as a Java software accelerator than a place to  store cheap data. Be the positioning as it may, my <a href="../../../../../2011/06/02/why-you-would-want-an-appliance-and-when-you-wouldnt/">objections  to the idea of a Hadoop appliance</a> still stand, although <a href="../../../../../2011/06/02/why-you-would-want-an-appliance-and-when-you-wouldnt/#comment-228238">Amr  Awadallah&#8217;s counterarguments</a> make sense as well.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/06/24/forthcoming-oracle-appliances/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Traditional databases will eventually wind up in RAM</title>
		<link>http://www.dbms2.com/2011/05/23/databases-ram/</link>
		<comments>http://www.dbms2.com/2011/05/23/databases-ram/#comments</comments>
		<pubDate>Mon, 23 May 2011 16:05:24 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Cache]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Oracle TimesTen]]></category>
		<category><![CDATA[SAP AG]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Theory and architecture]]></category>
		<category><![CDATA[VoltDB and H-Store]]></category>
		<category><![CDATA[memcached]]></category>
		<category><![CDATA[solidDB]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4520</guid>
		<description><![CDATA[In January, 2010, I posited that it might be helpful to view data as being divided into three categories: Human/Tabular data –i.e., human-generated data that fits well into relational tables or arrays. Human/Nontabular data — i.e., all other data generated by humans. Machine-Generated data. I won&#8217;t now stand by every nuance in that post, which [...]]]></description>
			<content:encoded><![CDATA[<p>In January, 2010, I posited that <a href="http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/">it might be helpful to view data as being divided into three categories</a>:</p>
<ul>
<li><strong>Human/Tabular</strong> data –i.e., human-generated data that  fits well 	into relational tables or arrays.</li>
<li><strong>Human/Nontabular</strong> data — i.e., all other data  generated by humans.</li>
<li><strong>Machine-Generated</strong> data.</li>
</ul>
<p>I won&#8217;t now stand by every nuance in that post, which may differ slightly from those in my more recent posts about <a href="http://www.dbms2.com/2010/12/30/examples-and-definition-of-machine-generated-data/">machine-generated data</a> and <a href="http://www.dbms2.com/2011/05/17/poly-structured-database/">poly-structured databases</a>. But one general idea is hard to dispute:</p>
<p><strong>Traditional database data</strong> &#8212; records of human transactional activity, referred to as &#8220;Human/Tabular data above&#8221; &#8212; <strong>will not grow as fast as Moore&#8217;s Law makes computer chips cheaper.</strong></p>
<p>And that point has a straightforward corollary, namely:</p>
<p><strong>It will become ever more affordable to</strong><strong> put traditional database data entirely into RAM. </strong> <span id="more-4520"></span> </p>
<p>Actually, there are numerous ways for OLTP, other <a href="http://www.dbms2.com/2011/03/30/short-request-and-analytic-processing/">short-request</a>, and some analytic databases to wind up in RAM.</p>
<ul>
<li><a href="http://www.dbms2.com/2009/07/07/hasso-plattner-calls-for-in-memory-oltp-column-stores/">SAP has some good ideas</a> for how it could happen, banging transactions into what is essentially an in-memory analytic database. (I dispute SAP&#8217;s claims of transformational database technology leadership, but that doesn&#8217;t mean the underlying ideas aren&#8217;t good.)</li>
<li>For those who can afford the associated technology disruption, <a href="http://www.dbms2.com/2011/05/21/object-oriented-database-management-systems-oodbms/">memory-centric object-oriented DBMS</a> could be appealing.</li>
<li>Web scalability best practices commonly include keeping data in RAM (e.g., that&#8217;s pretty much the point of caching layer memcached).</li>
<li>SaaS (Software as a Service) companies &#8212; such as <a href="http://www.dbms2.com/2010/08/22/workday-technology-stack/">Workday</a> &#8212; often bring a particular tenant&#8217;s database entirely into RAM.</li>
<li><a href="http://www.dbms2.com/2010/06/12/the-underlying-technology-of-qlikview/">QlikView</a> highlights the benefits of doing business intelligence in RAM.</li>
<li><a href="http://www.dbms2.com/2011/04/21/sas-hpa-does-make-sense-after-all/">SAS HPA</a> makes the argument that even &#8220;big data analytics&#8221; should sometimes be done in RAM.</li>
<li>I don&#8217;t have particularly favorable opinions at this time about marketing strategies or momentum at <a href="http://www.dbms2.com/2008/12/29/ordinary-oltp-dbms-vs-memory-centric-processing/">Oracle TimesTen, IBM solidDB</a>, or <a href="http://www.dbms2.com/2010/06/30/details-and-analysis-of-the-voltdb-argument/">VoltDB</a>, but those examples at least serve to illustrate that memory-centric OLTP DBMS have existed for years.</li>
<li>Actually, SAP has at least two good ideas, if you count <a href="http://www.dbms2.com/2010/02/05/sybase-aleri-rap/">Sybase</a> as part of SAP.</li>
</ul>
<p>And here&#8217;s the kicker: Intel told me last year that <strong>CPUs are headed to 46-bit address spaces around mid-decade.</strong> Indeed, they hired me to help figure out if that was enough.* That multiplies out to <strong>64 terabytes of RAM on a single server,</strong> chip costs permitting. So most of what we now think of as operational databases &#8212; and many of the analytic ones too &#8212; will fit in-memory, even if they run very large businesses.</p>
<p><em>*And did so without putting the discussion under any kind of NDA.</em></p>
<p>Likely consequences of all this include:</p>
<ul>
<li><strong>Legacy apps will</strong> (eventually)<strong> be consolidated and virtualized in-memory.</strong> Their underlying databases will grow so slowly that eventually the cost of putting them in RAM will be too low to worry about.</li>
<li><strong>Expensive storage systems will </strong>(continue to)<strong> be irrelevant to database processing. </strong>Databases that don&#8217;t fit in RAM will typically be big enough to require the attention of a lot of CPUs &#8212; and in those cases the DBMS software itself will handle all the storage tasks.</li>
<li><strong>Major OLTP DBMS vendors, </strong>such as Oracle,<strong> will need alternate in-memory code lines, </strong>because disk-centric architectures are sub-optimal in-memory. Well, that&#8217;s what they have those big R&amp;D budgets for.</li>
<li><strong>SaaS vendors and web businesses may not rely on today&#8217;s major OLTP DBMS vendors.</strong> (I was going to say &#8220;won&#8217;t&#8221; rather than &#8220;may not&#8221; until I recalled the likely M&amp;A endgame.) Traditional enterprises may blanch at migrating away from their legacy DBMS environments, but the trade-offs are different for technology companies using DBMS as subsystems.</li>
</ul>
<p>Of course, the same trends that make data-storing chips cheaper will make data-generating chips cheaper too. So, just as there are huge amounts of machine-generated data that you&#8217;d never pay to store in RAM, the same will still be true 10 years from now; the data volumes involved will just be a lot bigger. And thus there will still be plenty of very large analytic databases using relatively cheap forms of storage, perhaps even disk.</p>
<p>But <strong>OLTP and other short-request processing are likely to wind up in-memory.</strong> And the same may be true for a considerable amount of <strong>analytics,</strong> especially but not only if the analytics have a low-latency requirement.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/05/23/databases-ram/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Object-oriented database management systems (OODBMS)</title>
		<link>http://www.dbms2.com/2011/05/21/object-oriented-database-management-systems-oodbms/</link>
		<comments>http://www.dbms2.com/2011/05/21/object-oriented-database-management-systems-oodbms/#comments</comments>
		<pubDate>Sat, 21 May 2011 10:45:49 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cache]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Intersystems and Cache']]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Objectivity and Infinite Graph]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Starcounter]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4512</guid>
		<description><![CDATA[There seems to be a fair amount of confusion about object-oriented database management systems (OODBMS). Let&#8217;s start with a working definition: An object-oriented database management system (OODBMS, but sometimes just called &#8220;object database&#8221;) is a DBMS that stores data in a logical model that is closely aligned with an application program&#8217;s object model. Of course, [...]]]></description>
			<content:encoded><![CDATA[<p>There seems to be a fair amount of confusion about object-oriented database management systems (OODBMS). Let&#8217;s start with a working definition:</p>
<p><strong>An object-oriented database management system</strong> (OODBMS, but sometimes just called &#8220;object database&#8221;) is a <strong>DBMS that stores data in a logical model that is closely aligned with an application program&#8217;s object model. </strong>Of course, an OODBMS will have a physical data model optimized for the kinds of logical data model it expects.</p>
<p>If you&#8217;re guessing from that definition that there can be difficulties drawing boundaries between the application, the application programming language, the data manipulation language, and/or the DBMS &#8212; you&#8217;re right. Those difficulties have been a big factor in relegating OODBMS to being a relatively niche technology to date.</p>
<p>Examples of what I would call OODBMS include:  <span id="more-4512"></span></p>
<ul>
<li>Intersystems Cache&#8217;, <a href="../../../../../2010/01/15/intersystems-cache-highlights/">the most successful OODBMS product by far</a>, with good OLTP (OnLine Transaction Processing) capabilities and a strong presence in the health care market. Although it was designed around the specialized MUMPS/M language, Cache&#8217; happily talks Java and SQL.</li>
<li><a href="../../../../../2008/02/01/dan-weinreb-on-objectstore/">ObjectStore</a>, a well-pedigreed startup a couple decades ago, which wound up focusing on complex objects in markets such as computer-aided design. ObjectStore was eventually sold to Progress Software, which is positioning ObjectStore more as a <a href="http://web.progress.com/en/objectstore/">distributed caching system</a> than anything else (<a href="../../../../../2005/10/10/the-amazoncom-bookstore-is-a-huge-modern-oltp-app-so-is-it-relational/">Amazon</a> was an impressive reference for that use case). That said, Progress&#8217; ObjectStore business is small, as is its ObjectStore level of effort. Both Cache&#8217; and ObjectStore were at some point unsuccessfully targeted at the XML database market.</li>
<li>Part of <a href="../../../../../2010/08/22/workday-technology-stack/">Workday&#8217;s technology stack</a>. Very-well-pedigreed SaaS (Software as a Service) application vendor Workday decided to go with what amounts to an in-memory OODBMS. This makes all kinds of sense, and is a lot of what rekindled my interest in object-oriented database management.</li>
<li><a href="../../../../../2010/06/19/objectivity-infinite-graph/">Objectivity</a>, also from the 20-years-ago generation, and a poster child for the &#8220;DBMS toolkit as much as a DBMS&#8221; issue.</li>
<li><a href="../../../../../2008/06/08/perst/">McObject Perst</a>, an embeddable memory-centric OODBMS.</li>
<li><a href="../../../../../2008/06/08/perst/">Versant</a>. Actually, by now the Versant company has several different OODBMS; I&#8217;m not sure whether what it&#8217;s selling has much to do with the original Versant product. Anyhow, both the original and current Versant product seem to be positioned in OLTP. Versant has recently suffered from <a href="http://sec.gov/Archives/edgar/data/865917/000104746911000392/a2201670z10-k.htm">declining revenues</a>, in license fees and maintenance alike.</li>
<li><a href="../../../../../2011/05/18/starcounter-high-speed-memory-centric-object-oriented-dbms-coming-soon/">Forthcoming technology from Starcounter</a>, in the area of high-performance memory-centric OLTP. According to my correspondents, Starcounter still needs to explain how its technology is different from what Versant and ObjectStore introduced 20 or so years ago. Interestingly, while ObjectStore shines as a distributed system, Starcounter&#8217;s developers have consigned scale-out to the &#8220;too hard to bother with&#8221; category.</li>
<li>Gemstone, which seemed to be on an ObjectStore-like caching track until it was acquired by VMware.</li>
</ul>
<p>Arguably, OODBMS have all the benefits of <a href="../../../../../2011/02/07/notes-on-document-oriented-nosql/">document-model DBMS</a>, but with different language bindings. And if you&#8217;re going to write in an object-oriented language anyway, those language bindings can seem pretty appealing. In particular, they might be preferable to fighting your way through object/relational mapping.</p>
<p>Other than the double-edged language sword, the main criticism of object-oriented DBMS is that they include a whole lot of pointers. Intersystems and others have shown that, even in a disk-centric world, OODBMS can have excellent performance in OLTP and tolerable performance in simple reporting. As RAM gets cheaper, memory-centric operation becomes ever more viable, making the pointers even less problematic.</p>
<p><strong>Bottom line: If I were starting a SaaS project today, I&#8217;d give serious consideration to memory-centric OODBMS technology. </strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/05/21/object-oriented-database-management-systems-oodbms/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Starcounter high-speed memory-centric object-oriented DBMS, coming soon</title>
		<link>http://www.dbms2.com/2011/05/18/starcounter-high-speed-memory-centric-object-oriented-dbms-coming-soon/</link>
		<comments>http://www.dbms2.com/2011/05/18/starcounter-high-speed-memory-centric-object-oriented-dbms-coming-soon/#comments</comments>
		<pubDate>Wed, 18 May 2011 10:14:11 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data models and architecture]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Object]]></category>
		<category><![CDATA[Starcounter]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4502</guid>
		<description><![CDATA[Since posting recently about Starcounter, I&#8217;ve had the chance to actually talk with the company (twice). Hence I know more than before. Starcounter: Has been around as a company since 2006. Has developed memory-centric object-oriented DBMS technology that has been OEMed by a few application software companies (especially in bricks-and-mortar retailing and in online advertising). [...]]]></description>
			<content:encoded><![CDATA[<p>Since posting recently about <a href="../../../../../2011/04/13/starcounter/">Starcounter</a>, I&#8217;ve had the chance to actually talk with the company (twice). Hence I know more than before. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Starcounter:</p>
<ul>
<li>Has been around as a company since 2006.</li>
<li>Has developed <strong>memory-centric object-oriented DBMS</strong> technology that has been OEMed by a few application software companies (especially in bricks-and-mortar retailing and in online advertising).</li>
<li>Is planning to actually launch an OODBMS product sometime this summer.</li>
<li>Has 14 employees (most or all of whom are in Sweden, which is also where I think Starcounter&#8217;s current customers are centered).</li>
<li>Is planning to shift emphasis soon to the US market.</li>
</ul>
<p>Starcounter&#8217;s value propositions are <strong>programming ease (no object/relational impedance mismatch) </strong>and<strong> performance. </strong>Starcounter believes its DBMS has 100X the performance of conventional DBMS at short-request transaction processing, and 10X the performance of other memory-centric and/or object-oriented DBMS (e.g. Oracle TimesTen, or Versant). That said, Starcounter has not yet tested VoltDB. Starcounter does not claim performance much beyond that of disk-based DBMS on analytic tasks such as aggregations.</p>
<p>The key technical aspect to Starcounter is <strong>integration between the DBMS and the virtual machine,</strong> so that <strong>the same copy of the data is accessed by both the DBMS and the application program,</strong> without any movement or transformation being needed. (Starcounter isn&#8217;t aware of any other object-oriented DBMS that work this way.) Transient and persistent data are handled in the same way, seamlessly.</p>
<p>Other Starcounter technical highlights include:  <span id="more-4502"></span></p>
<ul>
<li>Starcounter is focused on OLTP (OnLine Transaction Processing).</li>
<li>The Starcounter OODBMS is ACID-compliant.</li>
<li>The Starcounter DBMS is single-server, albeit multi-core. Starcounter thinks this is OK because the Starcounter DBMS can do millions of transactions per second on a server with 8 cores or less. (I neglected to ask how quickly Starcounter thinks RAM would fill up on a single server at that kind of update rate.)</li>
<li>A Starcounter database sits in RAM. Logs go to disk, and Starcounter doesn&#8217;t commit a transaction in RAM until there&#8217;s been an acknowledgement that it&#8217;s been logged to disk.*</li>
<li>Since Starcounter never wants to read from disk (except in the case of recovery), logs can be written pretty much at sequential/batch update speeds.</li>
<li>You can do SQL queries against Starcounter objects, based on T-tree indexes. Otherwise, Starcounter data manipulation is done via what effectively is a proprietary object-oriented data manipulation language. (I neglected to ask whether there was any concept of join, or whether Starcounter SQL just does SELECTs. Starcounter did say that the only schema in a Starcounter database is the object model.)</li>
<li>Naturally, Starcounter objects are compressed, so that the most possible data fits into the fastest possible tier of memory. Proxy objects also come into play here.</li>
<li>Starcounter runs on Windows and .NET, supposedly for reasons of better virtual machine performance. Ports to Linux, Java, etc. are on the drawing boards, but won&#8217;t be particularly easy.</li>
</ul>
<p><em>*I thought Starcounter said that the core that runs the operating system communicates the acknowledgement via Direct Memory Access to a core that runs the Starcounter DBMS, obviating the need for an interrupt (except to the core that runs the operating system). But upon reflection, that doesn&#8217;t really seem to make sense.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/05/18/starcounter-high-speed-memory-centric-object-oriented-dbms-coming-soon/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>What Starcounter may be up to</title>
		<link>http://www.dbms2.com/2011/04/13/starcounter/</link>
		<comments>http://www.dbms2.com/2011/04/13/starcounter/#comments</comments>
		<pubDate>Wed, 13 Apr 2011 17:55:47 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Starcounter]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4289</guid>
		<description><![CDATA[Starcounter seems to be offering an in-memory object-based/object-oriented/whatever short-request DBMS that also talks SQL. I haven&#8217;t been briefed at this point, and hence don&#8217;t have detail beyond what&#8217;s on their rather breathless web site. I&#8217;m guessing this isn&#8217;t an H-Store/VoltDB architecture, but rather something more like what Workday runs. Most of the crunch I found [...]]]></description>
			<content:encoded><![CDATA[<p>Starcounter seems to be offering an in-memory object-based/object-oriented/whatever <a href="http://www.dbms2.com/2011/03/30/short-request-and-analytic-processing/">short-request DBMS</a> that also talks SQL. I haven&#8217;t been briefed at this point, and hence don&#8217;t have detail beyond what&#8217;s on their rather breathless web site. I&#8217;m guessing this isn&#8217;t an <a href="http://www.dbms2.com/2008/02/19/h-store-architecture/">H-Store/VoltDB</a> architecture, but rather something more like what <a href="http://www.dbms2.com/2010/08/22/workday-technology-stack/">Workday</a> runs.</p>
<p>Most of the crunch I found on the <a href="http://www.starcounter.com/developer.php">Starcounter website</a> (emphasis mine) is:</p>
<blockquote><p>Let&#8217;s say that it is possible to make a database that is 10,000 times  faster than what you use today. It would then be possible for your  computer language objects to live inside the database from the very  beginning. From the first { Customer a = new Customer(); }. <strong>The objects  could live in the database, not as a copy, but as both database object  and a Java or C# object at the same time.</strong> The database would  transparently be your heap. The time it would take to save your object  to the database would be reduced to nothing.</p>
<p>If such a database existed,<strong> you could say goodbye to caches and the  duality of business objects, the database objects/rows and the  complexity that follows. </strong>The speed would be amazing. <strong>Goodbye to time  consuming scale-out solutions.</strong> Actually, you would be able to say good  bye to the databases as you know them. You only need your simple  objects.</p>
<p>Such a technology would be the ultimate NoSQL database. But what if the  ultimate NoSQL database had SQL support, ACID, checkpoints and recovery  and other enterprise features? Your pure, clean objects would then  become the fastest and most powerful database in the world.</p></blockquote>
<p>Beside that, other clues to what Starcounter is doing include references to Hibernate and to the declining cost of RAM.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/04/13/starcounter/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
	</channel>
</rss>

