<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; MySQL</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/mysql/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Wed, 08 Feb 2012 12:22:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>Microsoft SQL Server 2012 and enterprise database choices in general</title>
		<link>http://www.dbms2.com/2012/01/24/microsoft-sql-server-2012/</link>
		<comments>http://www.dbms2.com/2012/01/24/microsoft-sql-server-2012/#comments</comments>
		<pubDate>Tue, 24 Jan 2012 14:42:34 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Mid-range]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5859</guid>
		<description><![CDATA[Microsoft is launching SQL Server 2012 on March 7. An IM chat with a reporter resulted, and went something like this. Reporter: [Care to comment]? CAM: SQL Server is an adequate product if you don&#8217;t mind being locked into the Microsoft stack. For example, the ColumnStore feature is very partial, given that it can&#8217;t be [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sqlserverlaunch.com/ww/Home">Microsoft is launching SQL Server 2012 on March 7</a>. An IM chat with a reporter resulted, and went something like this.</p>
<p><strong>Reporter: [Care to comment]?</strong><br />
<strong>CAM:</strong> SQL Server is an adequate product if you don&#8217;t mind being locked into the Microsoft stack. For example, the ColumnStore feature is very partial, given that <a href="http://msdn.microsoft.com/en-us/library/gg492088%28v=sql.110%29.aspx#Update">it can&#8217;t be updated</a>; but Oracle doesn&#8217;t have columnar storage at all.</p>
<p><strong>Reporter: Is the lock-in overall worse than IBM DB2, Oracle?</strong><br />
<strong>CAM:</strong> Microsoft locks you into an operating system, so yes.</p>
<p><strong>Reporter: Is this release something larger Oracle or IBM shops could consider as a lower-cost alternative a co-habitation scenario, in the event they&#8217;re mulling whether to buy more Oracle or IBM licenses?</strong><br />
<strong>CAM:</strong> If they have a strong Microsoft-stack investment already, sure. Otherwise, why?</p>
<p><strong>Reporter: [How about] just cost?</strong><br />
<strong>CAM:</strong> DB2 works just as well to keep Oracle honest as SQL Server does, and without a major operating system commitment. For analytic databases you want an analytic DBMS or appliance anyway.</p>
<p>Best is to have one major vendor of OTLP/general-purpose DBMS, a web DBMS, a DBMS for disposable projects (that may be the same as one of the first two), plus however many different analytic data stores you need to get the job done.</p>
<p>By &#8220;web DBMS&#8221; I mean MySQL, NewSQL, or NoSQL. Actually, you might need more than one product in that area.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2012/01/24/microsoft-sql-server-2012/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Hope for a new PostgreSQL era?</title>
		<link>http://www.dbms2.com/2011/11/23/hope-for-a-new-postgresql-era/</link>
		<comments>http://www.dbms2.com/2011/11/23/hope-for-a-new-postgresql-era/#comments</comments>
		<pubDate>Wed, 23 Nov 2011 14:18:00 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[EnterpriseDB and Postgres Plus]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Vertica Systems]]></category>
		<category><![CDATA[salesforce.com]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5728</guid>
		<description><![CDATA[In a comedy of briefing errors, I&#8217;m not too clear on the details of my client salesforce.com&#8217;s new PostgreSQL-as-a-service offering, nor exactly on what my clients at VMware are bringing to the PostgreSQL virtualization/cloud party. That said: PostgreSQL is good technology. MySQL is narrowing the gap, but PostgreSQL is still ahead of MySQL in some [...]]]></description>
			<content:encoded><![CDATA[<p>In a comedy of briefing errors, I&#8217;m not too clear on the details of my client <a href="http://gigaom.com/cloud/heroku-launches-sql-database-as-a-service/">salesforce.com&#8217;s new PostgreSQL-as-a-service offering</a>, nor exactly on what my clients at VMware are bringing to the PostgreSQL virtualization/cloud party. That said:</p>
<ul>
<li>PostgreSQL is good technology.</li>
<li>MySQL is narrowing the gap, but PostgreSQL is still ahead of MySQL in some ways.  (Database extensibility if nothing else.)</li>
<li>PostgreSQL has a lot of users. (Many of them in academia and/or Russia.)</li>
<li>Neither EnterpriseDB (which now calls itself &#8220;The enterprise PostgreSQL company&#8221;) nor the PostgreSQL community leadership have covered themselves with stewardship glory.</li>
<li>A significant number of interesting DBMS products can be regarded as PostgreSQL forks (e.g. Greenplum, Aster Data nCluster, Netezza if you squint, and Vertica if you stand on your head*).</li>
<li>PostgreSQL advancement is not dead. For example, <a href="../../../../../2011/11/08/hadapt-is-moving-forward/">Hadapt beta users are running actual PostgreSQL on many nodes each</a>.</li>
<li><a href="../../../../../2009/12/14/oracle-mysql-storage-engine/">There&#8217;s no assurance that Oracle will be a benevolent MySQL steward forever</a>. (Specifically, Oracle&#8217;s &#8220;Play nicely with others&#8221; antitrust commitments expire in 2014.)</li>
</ul>
<p>So I think it would be cool if one or the other big company put significant wood behind the PostgreSQL arrow.</p>
<p><em>*While Vertica was originally released using little or no PostgreSQL code &#8212; reports varied &#8212; it featured high degrees of PostgreSQL compatibility.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/11/23/hope-for-a-new-postgresql-era/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Transparent relational OLTP scale-out</title>
		<link>http://www.dbms2.com/2011/10/23/transparent-relational-oltp-scale-out/</link>
		<comments>http://www.dbms2.com/2011/10/23/transparent-relational-oltp-scale-out/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 04:19:09 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Clustering]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5521</guid>
		<description><![CDATA[There’s a perception that, if you want (relatively) worry-free database scale-out, you need a non-relational/NoSQL strategy. That perception is false. In the analytic case it’s completely ridiculous, as has been demonstrated by Teradata, Vertica, Netezza, and various other MPP (Massively Parallel Processing) analytic DBMS vendors. And now it’s false for short-request/OLTP (OnLine Transaction Processing) use [...]]]></description>
			<content:encoded><![CDATA[<p>There’s a perception that, if you want (relatively) worry-free database scale-out, you need a non-relational/NoSQL strategy. That perception is false. In the analytic case it’s completely ridiculous, as has been demonstrated by <a href="../../../../../2011/09/24/confusion-about-teradatas-big-customers/">Teradata</a>, <a href="../../../../../2011/06/20/columnar-dbms-vendor-customer-metrics/">Vertica</a>, Netezza, and various other MPP (Massively Parallel Processing) analytic DBMS vendors. And now it’s false for <a href="../../../../../2011/03/02/short-request-processing/">short-request</a>/OLTP (OnLine Transaction Processing) use cases as well.</p>
<p>My favorite relational OLTP scale-out choice these days is <a href="http://www.dbms2.com/2011/10/23/schooner-pivots-further/">the SchoonerSQL/dbShards partnership</a>. Schooner Information Technology (SchoonerSQL) and Code Futures (dbShards) are young, small companies, but I’m not too concerned about that, because the APIs they want you to write to are just MySQL’s. The main scenarios in which I can see them failing are ones in which they are competitively leapfrogged, either by other small competitors – e.g. ScaleBase, Akiban, TokuDB, or ScaleDB &#8212; or by Oracle/MySQL itself. While that could suck for my clients Schooner and Code Futures, it would still provide users relying on MySQL scale-out with one or more good product alternatives.</p>
<p>Relying on non-MySQL NewSQL startups, by way of contrast, would leave me somewhat more concerned. (However, if their code is open sourced. you have at least some vendor-failure protection.) And big-vendor scale-out offerings, such as Oracle RAC or <a href="../../../../../2011/05/06/db2-oltp-scale-out-purescale/">DB2 pureScale</a>, may be more complex to deploy and administer than the MySQL and NewSQL alternatives.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/23/transparent-relational-oltp-scale-out/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Schooner pivots further</title>
		<link>http://www.dbms2.com/2011/10/23/schooner-pivots-further/</link>
		<comments>http://www.dbms2.com/2011/10/23/schooner-pivots-further/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 04:18:11 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Clustering]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5523</guid>
		<description><![CDATA[Schooner Information Technology started out as a complete-system MySQL appliance vendor. Then Schooner went software-only, but continued to brag about great performance in configurations with solid-state drives. Now Schooner has pivoted further, and is emphasizing high availability, clustered performance, and other hardware-agnostic OLTP (OnLine Transaction Processing) features. Fortunately, Schooner has some interesting stuff in those [...]]]></description>
			<content:encoded><![CDATA[<p>Schooner Information Technology started out as a complete-system MySQL appliance vendor. Then <a href="../../../../../2011/01/28/schooner-software-onl/">Schooner went software-only, but continued to brag about great performance in configurations with solid-state drives</a>. Now Schooner has pivoted further, and is emphasizing high availability, clustered performance, and other hardware-agnostic OLTP (OnLine Transaction Processing) features. Fortunately, Schooner has some interesting stuff in those areas to talk about.</p>
<p>The short form of the SchoonerSQL (as Schooner’s product is now called) story goes roughly like this:</p>
<ul>
<li>SchoonerSQL      replicates data &#8212; synchronously if the replication target is local,      asynchronously if it is remote.</li>
<li>Local      synchronous replication provides high availability; remote asynchronous      replication provides disaster recovery.</li>
<li>SchoonerSQL’s      local synchronous replication also provides read scale-out.</li>
<li>Schooner      has a partnership with Code Futures/dbShards to provide write scale-out      via <a href="../../../../../2011/02/24/transparent-sharding/">transparent      sharding</a>.</li>
<li>SchoonerSQL      has some secret sauce in replication performance. This has the effect of      significantly increasing write performance (assuming you were going to      replicate anyway), because otherwise you might have to slow down the      master server&#8217;s write performance so that the slaves can keep up with it.</li>
<li>Schooner      believes it still has some single-server performance advantages as well.</li>
</ul>
<p><span id="more-5523"></span><em>Just to be clear here: Schooner is my client. Code Futures is my  client. I introduced them and suggested their partnership. I then  introduced them both to a user client, who was sufficiently impressed to  seriously evaluate both of them. Even so, some of the Schooner  replication story only became clear to me when I visited last Friday.</em></p>
<p>To flesh that out a bit more:</p>
<ul>
<li>Schooner      has been making performance tweaks to the MySQL/InnoDB stack for several      years. Some of them are still relevant, and can offer 50-100% performance      improvements, especially but not only if you use solid-state storage.</li>
<li>Schooner      has found a way to scale up the slave side of master-slave replication,      both in the synchronous and asynchronous cases.
<ul>
<li>The       core idea of SchoonerSQL parallel (i.e. scale-up) replication is       straightforward. The replication log streams in. A chunk is sent off to a       CPU core. The next chunk is examined for conflicts with the first chunk.       If there are none, it is sent off to a different core to be processed</li>
<li>Thus,       you can have both local replication/high availability and remote       replication/disaster recovery without slowing down writes on the master       the way you may have to with other alternatives.</li>
<li>Schooner       believes this feature alone can make SchoonerSQL 3X faster than MySQL       alternatives, at least for writes.</li>
</ul>
</li>
</ul>
<p>At least in casual conversation, Schooner synthesizes its 1.5-2X single-server figure and up-to-3X clustering figure into a single claim of frequent 2-5X speedup over generic MySQL/InnoDB. But the usual caveats about vendor-supplied performance numbers of course apply.</p>
<p>Finally, some housekeeping:</p>
<ul>
<li>At      Oracle’s polite request,* Schooner changed its product name to not mention      MySQL; hence the moniker SchoonerSQL.</li>
<li>SchoonerSQL      is being launched Monday.</li>
<li>Schooner      has determined that if its version numbers are different from MySQL’s,      confusion ensues. So the first ever version under the name SchoonerSQL is      SchoonerSQL 5.1, a factoid that Schooner is wisely omitting from the      official product launch press release.</li>
<li>The      synchronous part of SchoonerSQL’s replication story dates back to last      January’s product release.</li>
<li>Most      of the asynchronous part of SchoonerSQL story is new for Monday.</li>
<li>dbShards/SchoonerSQL      partnership engineering is still underway.</li>
</ul>
<p><em><span style="text-decoration: line-through;">*I mean that non-facetiously. </span><span style="text-decoration: line-through;">Schooner’s MySQL OEM contract was such that Oracle didn’t have a legal hammer to force the change.</span> Edit: Whoops! There turn out to have been inaccuracies in the original version of this footnote, which I now regret writing. The contract isn&#8217;t exactly OEM, and there actually were some trademark-based legal hammers.<span style="text-decoration: line-through;"><br />
</span></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/23/schooner-pivots-further/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Defining NoSQL</title>
		<link>http://www.dbms2.com/2011/10/02/defining-nosql/</link>
		<comments>http://www.dbms2.com/2011/10/02/defining-nosql/#comments</comments>
		<pubDate>Mon, 03 Oct 2011 00:32:02 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[MarkLogic]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Object]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Petabyte-scale data management]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5394</guid>
		<description><![CDATA[A reporter tweeted:  &#8221;Is there a simple plain English definition for NoSQL?&#8221; After reminding him of my cynical yet accurate Third Law of Commercial Semantics, I gave it a serious try, and came up with the following. More precisely, I tweeted the bolded parts of what&#8217;s below; the rest is commentary added for this post. NoSQL [...]]]></description>
			<content:encoded><![CDATA[<p>A reporter tweeted:  &#8221;Is there a simple plain English definition for NoSQL?&#8221; After reminding him of my cynical yet accurate <a href="http://www.strategicmessaging.com/no-market-categorization-is-ever-precise/2011/03/01/">Third Law of Commercial Semantics</a>, I gave it a serious try, and came up with the following. More precisely, I tweeted the bolded parts of what&#8217;s below; the rest is commentary added for this post.</p>
<p><strong>NoSQL is most easily defined by what it excludes: SQL, joins, strong analytic alternatives to those, and some forms of database integrity. If you leave all four out, and you have a strong scale-out story, you&#8217;re in the NoSQL mainstream.</strong>   <span id="more-5394"></span></p>
<ul>
<li>Thus, I&#8217;d say Cassandra, HBase, Mongo DB, and Couchbase are prime examples, in no particular order. Riak as well.</li>
<li>I might have phrased that better if I&#8217;d used a different word than simply &#8220;strong&#8221; &#8212; but hey, there was a 140-character limit, and he was on deadline.</li>
</ul>
<p><strong>Using NoSQL can make sense when at least one of two things is paramount: low-cost scale-out or dynamic schemas.</strong></p>
<ul>
<li>There are some seriously sensible use cases for <a href="../../../../../2011/07/31/dynamic-fixed-schema-databases/">dynamic schemas</a>.</li>
<li>&#8220;Low-cost&#8221; generally boils down to:
<ul>
<li>Performance.</li>
<li>Open source free-like-beer.</li>
<li>Not a lot of database administration.</li>
</ul>
</li>
</ul>
<p>I&#8217;ve generally given object-oriented DBMS vendors and also MarkLogic hard times whenever they consider saying they&#8217;re &#8220;NoSQL&#8221;. Reasons include:</p>
<ul>
<li>Closed source.</li>
<li>Database administration overhead (even if you get good stuff for incurring that overhead, like MarkLogic&#8217;s comprehensive indexing).</li>
</ul>
<p>Also, NoSQL started out being ACID-unfriendly.</p>
<p><strong>What you give up are the query flexibility and the easily automatic data integrity of SQL-based systems.</strong> I should have added something about a mature ecosystem.</p>
<p>In the most recent live example, I influenced a <a href="../../../../../2011/09/19/oltp-disk-solid-state/">client</a> away from Cassandra and toward scale-out MySQL (dbShards and/or Schooner flavors, most likely). Part of the reason was the ability to do joins, which are useful in their application. Another part is that their development practices obviated any significant benefit from dynamic schemas. But perhaps the most important &#8212; or at least resonant &#8212; reason of all was that they really, really cared about .NET support.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/10/02/defining-nosql/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Oracle NoSQL is unlikely to be a big deal</title>
		<link>http://www.dbms2.com/2011/09/30/oracle-nosql/</link>
		<comments>http://www.dbms2.com/2011/09/30/oracle-nosql/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 18:20:53 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Object]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[Structured documents]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5384</guid>
		<description><![CDATA[Alex Williams noticed that there will be a NoSQL session at Oracle OpenWorld next week, and is wondering whether this will be a big deal. I think it won&#8217;t be. There really are three major points to NoSQL. Dynamic schemas. This is the only one of the three that truly depends on NoSQL. Scale-out short-request [...]]]></description>
			<content:encoded><![CDATA[<p>Alex Williams noticed that there will be a NoSQL session at Oracle OpenWorld next week, and is wondering whether this will be a big deal. I think it won&#8217;t be.</p>
<p>There really are three major points to NoSQL.</p>
<ul>
<li><strong><a href="http://www.dbms2.com/2011/07/31/dynamic-fixed-schema-databases/">Dynamic schemas</a>.</strong> This is the only one of the three that truly depends on NoSQL.</li>
<li><strong>Scale-out <a href="http://www.dbms2.com/2011/03/30/short-request-and-analytic-processing/">short-request processing</a>.</strong> If you want to scale out efficiently at high request volumes, you&#8217;re best off not using all the flexibility SQL/relational DBMS offer. (In particular, you don&#8217;t want to do cross-node joins). Not coincidentally, a number of the best scale-out offerings were built to be NoSQL.</li>
<li><strong>Open source</strong>. Doing a relational DBMS is a big project. It seems easier to build NoSQL ones.</li>
</ul>
<p>Oracle can address the latter two points as aggressively as it wishes via MySQL. It so happens I would generally recommend MySQL enhanced by dbShards, Schooner, and/or dbShards/Schooner, rather than Oracle-only MySQL &#8230; but that&#8217;s a detail. In some form or other, Oracle&#8217;s MySQL is a huge player in the scale-out, open source, short-request database management market.</p>
<p>So that leaves us with dynamic schemas. Oracle has at least four different sets of technology in that area:</p>
<ul>
<li> As <a href="http://www.dbms2.com/2010/08/22/workday-technology-stack/">Workday</a> noticed years ago, MySQL can be used as a functional, basic key-value store.</li>
<li>Oracle also has XML-based Berkeley DB/SleepyCat kicking around.*</li>
<li>The XML extensions to Oracle&#8217;s core DBMS could be alleged to have a dynamic schema/NoSQL flavor. (Blech.)</li>
<li>A dynamic schema argument could also be made for object-oriented DBMS technology. While Oracle doesn&#8217;t to my knowledge exactly sell that, it does have the <a href="http://www.dbms2.com/2007/03/25/oracle-tangosol-objects-caching-and-disruption/">Tangosol</a> Coherence line of technology, with a potentially similar programming model.</li>
</ul>
<p>If Oracle is now refreshing and rebranding one or more of these as &#8220;NoSQL&#8221;, there&#8217;s no reason to view that as a big deal at all.</p>
<p><em>*That&#8217;s Mike Olson&#8217;s former company, if you&#8217;re keeping score at home.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/09/30/oracle-nosql/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Oracle Database Appliance soundbites</title>
		<link>http://www.dbms2.com/2011/09/21/oracle-database-appliance-soundbites/</link>
		<comments>http://www.dbms2.com/2011/09/21/oracle-database-appliance-soundbites/#comments</comments>
		<pubDate>Wed, 21 Sep 2011 19:59:21 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5286</guid>
		<description><![CDATA[It turns out that Oracle&#8217;s new small appliance isn&#8217;t really an Exadata Mini-Me. Rather, the Oracle Database Appliance is &#8212; well, it seems to be a box with an Oracle DBMS in it. (Plus Oracle RAC and so on.) The whole thing is priced for and targeted at the SMB (Small &#38; Medium Business) market, [...]]]></description>
			<content:encoded><![CDATA[<p>It turns out that Oracle&#8217;s new small appliance isn&#8217;t really an <a href="http://www.dbms2.com/2011/09/19/exadata-mini-me/">Exadata Mini-Me</a>. Rather, the Oracle Database Appliance is &#8212; well, it seems to be a box with an Oracle DBMS in it. (Plus Oracle RAC and so on.) The whole thing is priced for and targeted at the SMB (Small &amp; Medium Business) market, whatever that means to Oracle.</p>
<p>I&#8217;m not hugely optimistic about the Oracle Database Appliance. Rather, my thoughts &#8212; lightly edited from a chat with a reporter &#8212; include:</p>
<ul>
<li>This doesn&#8217;t solve Oracle&#8217;s SMB problems, which include:
<ul>
<li>Oracle software is too difficult and costly to administer. The appliance will make a modest dent in that one, but it&#8217;s not any kind of game-changer, because the issues relate to the antique design of the Oracle DBMS. (I.e., I think ongoing database administration is a bigger deal than, say, one-time system set-up.)</li>
<li>SMBs use third-party applications whenever they can, with an increasing preference for SaaS.  Application and SaaS vendors prefer non-Oracle alternatives when they are feasible.</li>
</ul>
</li>
<li>Thus, Oracle is not well positioned to thrive in the SMB market &#8230; except maybe through its MySQL subsidiary, but that has a long way to go too.</li>
<li>Clayton Christensen&#8217;s <strong><em>The Innovator&#8217;s Solution</em></strong> teaches us that Oracle should focus on selling a thick stack of technology to its highest-end customers, and that&#8217;s exactly what Oracle does focus on.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/09/21/oracle-database-appliance-soundbites/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Are there any remaining reasons to put new OLTP applications on disk?</title>
		<link>http://www.dbms2.com/2011/09/19/oltp-disk-solid-state/</link>
		<comments>http://www.dbms2.com/2011/09/19/oltp-disk-solid-state/#comments</comments>
		<pubDate>Mon, 19 Sep 2011 18:07:07 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Clustering]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5257</guid>
		<description><![CDATA[Once again, I&#8217;m working with an OLTP SaaS vendor client on the architecture for their next-generation system. Parameters include: 100s of gigabytes of data at first, growing to &#62;1 terabyte over time. High peak loads. Public cloud portability (but they have private data centers they can use today). Simple database design &#8212; not a lot [...]]]></description>
			<content:encoded><![CDATA[<p>Once again, I&#8217;m working with an OLTP SaaS vendor client on the architecture for their next-generation system. Parameters include:</p>
<ul>
<li>100s of gigabytes of data at first, growing to &gt;1 terabyte over time.</li>
<li>High peak loads.</li>
<li>Public cloud portability (but they have <strong>private data centers they can use today).</strong></li>
<li>Simple database design &#8212; not a lot of tables, not a lot of columns, not a lot of joins, and everything can be distributed on the same customer_ID key.</li>
<li>Stream the data to a data warehouse, that will grow to a few terabytes. (Keeping only one year of OLTP data online actually makes sense in this application, but of course everything should go into the DW.)</li>
</ul>
<p>So I&#8217;m leaning to saying:   <span id="more-5257"></span></p>
<ul>
<li>They should go with a scalable, MySQL-based solution.
<ul>
<li>Lots of third-party software works with MySQL, in case that&#8217;s helpful.</li>
<li>Yes, any one vendor is small and not yet firmly established, but there are numerous vendors around with interesting MySQL scaling stories.</li>
<li>In a vendor emergency, just going with Oracle&#8217;s MySQL stuff would probably work &#8230;</li>
<li>&#8230; especially because there are these lovely things in the world called <strong>solid-state drives.</strong></li>
<li>There&#8217;s also good escapability if one wants to move away from MySQL, because everybody knows how to handle MySQL data.</li>
</ul>
</li>
<li>The first product to look at is dbShards, because it meets all the topology needs:
<ul>
<li>Local scale-out (<a href="http://www.dbms2.com/2011/02/24/transparent-sharding/">transparent sharding</a>).</li>
<li><a href="http://www.dbms2.com/2011/02/09/clarification-on-dbshards-shard-replication/">Local high availability</a>.</li>
<li>Remote disaster recovery (details of that are underway).</li>
</ul>
</li>
<li>The first analytic DBMS to look at is Infobright.
<ul>
<li>Yes, I know Infobright is focused more on machine-generated data these days, but this client&#8217;s analytic needs are so straightforward Infobright should pass with flying colors.</li>
<li>The MySQL-to-MySQL aspect should make ETL dead simple.</li>
<li>Again, there&#8217;s escapability.</li>
</ul>
</li>
</ul>
<p>Mainly, this is all fine. But I&#8217;m getting pushback on the solid-state aspect, for fear that it will compromise public cloud portability.</p>
<p>Am I missing something here? As far as I&#8217;m concerned, <strong>if you&#8217;re planning an OLTP system with a many-year lifespan today, </strong>of course <strong>you should assume solid-state storage.</strong> Maybe you scale out just as far as you would with disk, striping indexes or entire databases across the RAM of multiple servers. It that case, having solid-state backing reduces the risk of bottlenecks. Maybe you don&#8217;t scale out as far as you would with disk. In that case, solid-state backing saves you money.</p>
<p><strong>As for public-cloud support for solid-state storage, that&#8217;s coming fast, right? </strong>(Actually, I have data points in support of that theory, but they&#8217;re a bit tenuous.) A large fraction of web businesses with private data centers seem to be using solid-state storage &#8212; from Facebook on down &#8212; or so the NoSQL/NewSQL/<a href="http://www.dbms2.com/2011/03/02/short-request-processing/">short-request</a> DBMS guys tell me. Surely a number of public cloud vendors are close behind.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/09/19/oltp-disk-solid-state/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Couchbase technical update</title>
		<link>http://www.dbms2.com/2011/08/13/couchbase-technical-update/</link>
		<comments>http://www.dbms2.com/2011/08/13/couchbase-technical-update/#comments</comments>
		<pubDate>Sun, 14 Aug 2011 04:08:03 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cache]]></category>
		<category><![CDATA[Clustering]]></category>
		<category><![CDATA[Couchbase]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[memcached]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5081</guid>
		<description><![CDATA[My Couchbase business update with Bob Wiederhold was very interesting, but it didn&#8217;t answer much about the actual Couchbase product. For that, I talked with Dustin Sallings. We jumped around a lot, and some important parts of the Couchbase product haven&#8217;t had their designs locked down yet anyway. But here&#8217;s at least a partial explanation [...]]]></description>
			<content:encoded><![CDATA[<p>My <a href="http://www.dbms2.com/2011/08/13/couchbase-business-update/">Couchbase business update</a> with Bob Wiederhold was very interesting, but it didn&#8217;t answer much about the actual Couchbase product. For that, I talked with Dustin Sallings. We jumped around a lot, and some important parts of the Couchbase product haven&#8217;t had their designs locked down yet anyway. But here&#8217;s at least a partial explanation of what&#8217;s up.</p>
<p>memcached is a way to cache data in RAM across a cluster of servers and have it all look logically like a single memory pool, extremely popular among large internet companies. The Membase product &#8212; which is what Couchbase has been selling this year &#8212; adds persistence to memcached, an obvious improvement on requiring application developers to write both to memcached and to <a href="http://www.dbms2.com/2011/02/24/transparent-sharding/">non-transparently-sharded MySQL</a>. The main technical points in adding persistence seem to have been:</p>
<ul>
<li>A <strong>persistent backing store</strong> (duh), namely SQLite.</li>
<li>A <strong>change to the hashing algorithm,</strong> to avoid losing data when the cluster configuration is changed.</li>
</ul>
<p>Couchbase is essentially Membase improved by integrating CouchDB into it, with the main changes being:</p>
<ul>
<li><strong>Changing the backing store to CouchDB</strong> (duh). This will be in the first Couchbase release.</li>
<li><strong>Adding cross data center replication on CouchDB&#8217;s consistency model.</strong> This will not, I believe, be in the first Couchbase release.</li>
<li><strong>Offering CouchDB&#8217;s programming and query interfaces as an option.</strong> So far as I can tell, this will be implemented straightforwardly in the first Couchbase release, with elegance planned for later down the road.</li>
</ul>
<p>Let&#8217;s drill down a bit into <strong>Membase/Couchbase clustering and consistency. </strong><span id="more-5081"></span></p>
<ul>
<li>When data is written to RAM in memcached, it immediately gets copied to another server. The same is of course true in Membase/Couchbase. The terminology on all this is confusing, but I think:
<ul>
<li>The portion of data that is stored as a primary copy on any given server is called a &#8220;shard&#8221;.</li>
<li>That would seem to make sense, as that data could correspond to what goes &#8212; <a href="http://www.dbms2.com/2011/02/24/transparent-sharding/">non-transparently</a> &#8212; into an instance of MySQL in a classical memcached/MySQL set-up.</li>
</ul>
</li>
<li>Updates are of course also banged to disk ASAP &#8212; but at times of heavy load, that can take a while. A few seconds to a couple of minutes is normal operation; if it takes an hour, you really should buy more hardware. (Or solid-state storage.)</li>
<li>Similarly, the replication of data to a second machine&#8217;s RAM may not happen at times of heavy load &#8212; and that&#8217;s another sign you don&#8217;t have enough machines.</li>
<li>Each Membase/Couchbase &#8220;shard&#8221; has lots of logical sub-shards.* (1024 for now, at least as default, although Dustin finds that number excessive and is looking to lower it.)  So if you add a node, some of the sub-shards get sent over to the new node. Unlike the case for straight memcached, no data is lost from cache (and of course not also from the persistent store). Blocking of operations from such a move only happens in narrow time windows, and then only in edge cases.</li>
</ul>
<p><em>*Edit: They&#8217;re called <a href="http://dustin.github.com/2010/06/29/memcached-vbuckets.html">vbuckets</a>.</em></p>
<p><em></em>So if we consider Membase technology alone, Couchbase is CA in the CAP Theorem.  CouchDB, however, is gloriously AP in the CAP Theorem, in that it was written to assume an occasionally connected topology.* Based on that, Couchbase will allow AP operation between data centers (i.e. &#8220;stay synchronized if you can, to within the limitations of physics and so on, but don&#8217;t beat yourself up on the rare occasions that you can&#8217;t.&#8221;) I don&#8217;t know that that capability will quite be in the first release of Couchbase, but it&#8217;s coming soon.</p>
<p><em>*CouchDB also has other features friendly to occasionally-connected use cases, such as a lot of flexibility as to which parts of the database are or aren&#8217;t synced when you do reconnect. These are at the heart of the Couchbase Mobile offering.</em></p>
<p>memcached and Membase have a very simple key-value interface. CouchDB adds secondary indexes and so on. I think in the first release of Couchbase this is pretty much like having two different APIs for the same product; more elegant integration is planned down the road, and more language support as well.</p>
<p>The highest-performing way to use Couchbase will probably always be to just pretend it is Membase, which is to say memcached+. Dustin told me of Membase users who demanded 10-40 millisecond response times, and that not even for single queries but rather for sequences of several queries in succession. He further told me of customers asking for 1-200 microsecond response, and insisting on no worse than 1 millisecond. Frankly, the first requirement could be met by lots of technologies I can think of, at least if  you don&#8217;t rely on disk; the second is thoroughly impossible if you rely on disk, and pretty demanding no matter what kind of hardware and storage you have.</p>
<p>Couchbase performance against disk is a work in progress. CouchDB started out 8X slower than SQLite as a backing store, apples to apples, but Couchbase is fixing that before they roll the product out. (After all, they wouldn&#8217;t want to slow the product down in the course of an upgrade.) Beyond that, when you do exploit the indexing capability of CouchDB, performance of course slows down. Work is underway to lower the performance hit; I imagine much improvement can indeed be made, given how few resources CouchDB has been able to devote to date to <a href="http://www.dbms2.com/2009/08/21/bottleneck-whack-a-mole/">Bottleneck Whack-A-Mole</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/08/13/couchbase-technical-update/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Couchbase business update</title>
		<link>http://www.dbms2.com/2011/08/13/couchbase-business-update/</link>
		<comments>http://www.dbms2.com/2011/08/13/couchbase-business-update/#comments</comments>
		<pubDate>Sun, 14 Aug 2011 04:02:42 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Basho and Riak]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[CouchDB]]></category>
		<category><![CDATA[Couchbase]]></category>
		<category><![CDATA[Games and virtual worlds]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[Market share and customer counts]]></category>
		<category><![CDATA[Mid-range]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Structured documents]]></category>
		<category><![CDATA[Web analytics]]></category>
		<category><![CDATA[memcached]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5080</guid>
		<description><![CDATA[I decided I needed some Couchbase drilldown, on business and technology alike, so I had solid chats with both CEO Bob Wiederhold and Chief Architect Dustin Sallings. Pretty much everything I wrote at the time Membase and CouchOne merged to form Couchbase (the company) still holds up. But I have more detail now. Context for [...]]]></description>
			<content:encoded><![CDATA[<p>I decided I needed some Couchbase drilldown, on business and technology alike, so I had solid chats with both CEO Bob Wiederhold and Chief Architect Dustin Sallings. Pretty much everything I wrote at the time <a href="../../../../../2011/02/08/couchbase-membase-couchone-couchdb/">Membase and CouchOne merged to form Couchbase</a> (the company) still holds up. But I have more detail now. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>Context for any comments on customer traction includes:</p>
<ul>
<li>Membase went into limited production release in October, and full release in January. Similar things are true of CouchDB.</li>
<li>Hence, most sales of Couchbase&#8217;s products have been made over the past 6 months.</li>
<li>Couchbase (the merged product) is at this point only in a pre-production developer&#8217;s release.</li>
<li>Couchbase has both a direct sales force and a classic open-source &#8220;funnel&#8221;-based online selling model. Naturally, Couchbase&#8217;s understanding of what its customers are doing is more solid with respect to the direct sales base.</li>
<li>Most of Couchbase&#8217;s revenue to date seems to have come from a limited number of big-ticket &#8220;lighthouse&#8221; accounts (as opposed to, say, the larger number of smaller deals that come in through the online funnel).</li>
</ul>
<p>That said,</p>
<ul>
<li>Most Membase purchases are for new applications, as opposed to memcached migrations. However, customers are the kinds of companies that probably also are using memcached elsewhere.</li>
<li>Most other Membase purchases are replacements for the Membase/MySQL combination. Bob says those are easy sales with short sales cycles.</li>
<li>Pure memcached support is a small but non-zero business for Couchbase, and a fine source of upsell opportunities.</li>
<li>In the pipeline but not so much yet in the customer base are SaaS vendors and the like who use and may want to replace traditional DBMS such as Oracle. Other than among those, Couchbase doesn&#8217;t compete much yet with Oracle et al.</li>
<li>Pure CouchDB isn&#8217;t all that much of a business, at least relative to community size, as CouchDB is a single-server product commonly used by people who are content not to pay for support.</li>
</ul>
<p>Membase sales are concentrated in five kinds of internet-centric companies, which in declining order are: <span id="more-5080"></span></p>
<ul>
<li>Social gaming</li>
<li>Ad platforms</li>
<li>Online retail</li>
<li>Online business, including B2B  SaaS</li>
<li>Social networking</li>
</ul>
<p>Bob said that Couchbase often sees MongoDB competitively, but never Riak, HBase, or Redis. I got the impression Couchbase sees at least a little Cassandra. That would, of course, all pertain only to direct sales, rather than download/community kinds of usage.</p>
<p>Couchbase is also excited about the potential for the CouchDB-based Couchbase Mobile occasionally-connected offering. The hottest use cases, interestingly, seem to be non-consumer; Bob rattled off military, farming, and health care, and surely could have named more besides. However, the Couchbase Mobile sales effort still seems to be in early days, as is evidenced by the fact that Couchbase has not yet competitively encountered <a href="../../../../../2010/07/17/sybase-sql-anywhere/">Sybase SQL Anywhere</a>.</p>
<p>With all that said, I&#8217;ll go now to a separate post for a <a href="http://www.dbms2.com/2011/08/13/couchbase-technical-update/">Couchbase technical update</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/08/13/couchbase-business-update/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.313 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-02-08 11:24:15 -->

