<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; Tokutek</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/tokutek/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 09 Feb 2012 09:21:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>Soundbites: the Facebook/MySQL/NoSQL/VoltDB/Stonebraker flap, continued</title>
		<link>http://www.dbms2.com/2011/07/15/facebook-mysql-nosql-voltdb-stonebraker/</link>
		<comments>http://www.dbms2.com/2011/07/15/facebook-mysql-nosql-voltdb-stonebraker/#comments</comments>
		<pubDate>Fri, 15 Jul 2011 08:27:18 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Akiban]]></category>
		<category><![CDATA[Cache]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Clustrix]]></category>
		<category><![CDATA[Couchbase]]></category>
		<category><![CDATA[Data models and architecture]]></category>
		<category><![CDATA[Database diversity]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Michael Stonebraker]]></category>
		<category><![CDATA[MongoDB and 10gen]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[ScaleBase]]></category>
		<category><![CDATA[ScaleDB]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Tokutek]]></category>
		<category><![CDATA[VoltDB and H-Store]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>
		<category><![CDATA[memcached]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4977</guid>
		<description><![CDATA[As a follow-up to the latest Stonebraker kerfuffle, Derrick Harris asked me a bunch of smart followup questions. My responses and afterthoughts include: Facebook et al. are in effect Software as a Service (SaaS) vendors, not enterprise technology users. In particular: They have the technical chops to rewrite their code as  needed. Unlike packaged software [...]]]></description>
			<content:encoded><![CDATA[<p>As a follow-up to the latest <a href="http://www.dbms2.com/2011/07/14/an-odd-claim-attributed-to-mike-stonebraker/">Stonebraker kerfuffle</a>, Derrick Harris asked me a bunch of smart followup questions. My responses and afterthoughts include:</p>
<ul>
<li>Facebook et al. are in effect Software as a Service (SaaS) vendors, not enterprise technology users. In particular:
<ul>
<li>They have the technical chops to rewrite their code as  needed.</li>
<li>Unlike packaged software vendors, they&#8217;re not answerable to anybody for keeping legacy code alive after a rewrite. That makes migration a lot easier.</li>
<li>If they want to write different parts of their system on different technical underpinnings, nobody can stop them. For example &#8230;</li>
<li>&#8230;  <a href="http://www.dbms2.com/2008/07/21/project-cassandra-facebook-open-sourced-quasi-dbms/">Facebook innovated Cassandra</a>, and is now heavily committed to HBase.</li>
</ul>
</li>
<li>It makes little sense to talk of Facebook&#8217;s use of &#8220;MySQL.&#8221; Better to talk of Facebook&#8217;s use of &#8220;MySQL +  memcached  + non-transparent sharding.&#8221; That said:
<ul>
<li>It&#8217;s hard to see why somebody today would use MySQL +  memcached  + non-transparent sharding for a new project. At least one of <a href="http://www.dbms2.com/2011/02/08/couchbase-membase-couchone-couchdb/">Couchbase</a> or <a href="http://www.dbms2.com/2011/02/24/transparent-sharding/">transparently-sharded</a> MySQL is very likely a superior alternative. Other alternatives might be better yet.</li>
<li>As noted above in the example of Facebook, the many major web businesses that are using MySQL +  memcached  + non-transparent sharding for existing projects can be presumed able to migrate away from that stack as the need arises.</li>
</ul>
</li>
</ul>
<p>Continuing with that discussion of DBMS alternatives:</p>
<ul>
<li>If you just want to write to the memcached API anyway, why not go with Couchbase?</li>
<li>If you want to go relational, why not go with MySQL? There are many alternatives for scaling or accelerating MySQL &#8212; dbShards, Schooner, Akiban, Tokutek, ScaleBase, ScaleDB, Clustrix, and Xeround come to mind quickly, so there&#8217;s a great chance that one or more will fit your use case. (And if you don&#8217;t get the choice of MySQL flavor right the first time, porting to another one shouldn&#8217;t be all THAT awful.)</li>
<li>If you really, really want to go in-memory, and don&#8217;t mind writing Java stored procedures, and don&#8217;t need to do the kinds of joins it isn&#8217;t good at, but do need to do the kinds of joins it is, VoltDB could indeed be a good alternative.</li>
</ul>
<p>And while we&#8217;re at it &#8212; going <strong>schema-free</strong> often makes a whole lot of sense. I need to write much more about the point, but for now let&#8217;s just say that I look favorably on the Big Four schema-free/NoSQL options of MongoDB, Couchbase, HBase, and Cassandra.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/07/15/facebook-mysql-nosql-voltdb-stonebraker/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Notes on short-request scale-out MySQL</title>
		<link>http://www.dbms2.com/2011/04/19/notes-on-short-request-scale-out-mysql/</link>
		<comments>http://www.dbms2.com/2011/04/19/notes-on-short-request-scale-out-mysql/#comments</comments>
		<pubDate>Tue, 19 Apr 2011 09:52:28 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Akiban]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Kaminario]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ScaleBase]]></category>
		<category><![CDATA[ScaleDB]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[Tokutek]]></category>
		<category><![CDATA[Web analytics]]></category>
		<category><![CDATA[dbShards and CodeFutures]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=4329</guid>
		<description><![CDATA[A press person recently asked about: &#8230; start-ups that are building technologies to enable MySQL and other SQL databases to get over some of the problems they have in scaling past a certain size. &#8230; I’d like to get a sense as to whether or not the problems are as severe and wide spread as [...]]]></description>
			<content:encoded><![CDATA[<p>A press person recently asked about:</p>
<blockquote><p>&#8230; start-ups that are building technologies to enable MySQL and other SQL databases to get over some of the problems they have in scaling past a certain size. &#8230; I’d like to get a sense as to whether or not the problems are as severe and wide spread as these companies are telling me? If so, why wouldn’t a customer just move to a new database?</p></blockquote>
<p>While that sounds as if he was asking about scale-out relational DBMS in general, MySQL or otherwise, <a href="http://www.dbms2.com/2011/03/30/short-request-and-analytic-processing/">short-request or analytic</a>, it turned out that he was asking just about <strong>short-request scale-out MySQL.</strong> My thoughts and comments on that narrower subject include(d) but are not limited to:  <span id="more-4329"></span></p>
<ul>
<li>The biggest web companies had to go to non-<a href="http://www.dbms2.com/2011/02/24/transparent-sharding/">transparently sharded</a> MySQL years ago. The NoSQL movement is, in no small part, <a href="http://www.dbms2.com/2010/03/02/cassandra-nosql-scalable-oltp/">an attempt to improve upon that</a>. Ditto for scale-out short-request MySQL.</li>
<li>Some overlapping categories of companies or projects who need scale-out short-request database processing are:
<ul>
<li>The aforementioned big companies who have other applications they haven&#8217;t hand-sharded yet.</li>
<li>Other web companies whose applications are getting that big.</li>
<li>Conventional enterprises whose web efforts happen to be very big.</li>
<li>Sensor networks and other massive sources of <a href="http://www.dbms2.com/2010/12/30/examples-and-definition-of-machine-generated-data/">machine-generated data</a>.</li>
<li>Certain specialized areas (e.g., financial trading).</li>
</ul>
</li>
</ul>
<ul>
<li>Relatively few of these applications are totally impossible to do in Oracle. But the Oracle approach might be very expensive.</li>
<li>In particular, there&#8217;s a break point when companies &#8212; often SaaS vendors &#8212; <a href="http://www.dbms2.com/2011/04/01/the-client-that-was-confused-about-security/">outgrow Oracle Standard Edition</a>.</li>
<li>Yes, the alternatives usually are one of MySQL or Oracle.</li>
<li>InnoDB isn&#8217;t an alternative to these newer technologies; it&#8217;s just a piece of the puzzle and indeed of default MySQL now. Several of them &#8212; e.g. dbShards &#8212; are meant to be used in conjunction with InnoDB.</li>
<li>Merging his list and mine, the high-performance/scale-out MySQL alternatives look like <a href="http://www.dbms2.com/2011/01/25/dbshards-update/">dbShards</a>, <a href="http://www.dbms2.com/2011/01/28/schooner-software-onl/">Schooner</a>, <a href="http://www.dbms2.com/2011/01/25/scalebase-another-mpp-oltp-quasi-dbms/">ScaleBase</a>, <a href="http://www.dbms2.com/2008/04/13/scaledb-presents-the-revenge-of-the-pointer/">ScaleDB</a>, <a href="http://www.dbms2.com/2009/04/16/introduction-to-tokutek/">Tokutek</a>, <a href="http://www.dbms2.com/2010/04/03/akiban-highlights/">Akiban</a>, Xeround, and <a href="http://www.dbms2.com/2010/05/12/the-clustrix-story/">Clustrix</a>. The first two are to my knowledge more proven than the rest.</li>
<li>Proprietary hardware and the associated hardware/appliance pricing aren&#8217;t very appealing for these applications. That speaks against Oracle Exadata and Clustrix, and is the reason Schooner switched to a software-only strategy despite some initial appliance sales.</li>
<li>However, hardware band-aids such as solid-state drives or even <a href="http://www.dbms2.com/2010/10/19/introduction-to-kaminario/">RAM-based solid-state storage</a> could make more sense:
<ul>
<li>If, for performance, you&#8217;ve scaling out your database so that it fits in RAM on each box, you don&#8217;t really have a disk-based architecture anyway, now do you?</li>
<li>Even if you&#8217;re not doing that yet &#8212; if your problem is throughput rather than storage capacity, silicon-based storage could be a big help.</li>
<li>In principle, devices of that kind can be moved from one application to another, after the first one is rearchitected not to need them. (In practice, however, I don&#8217;t know of anybody who is doing that. I also don&#8217;t believe that Kaminario et al. are marketing that kind of idea, more&#8217;s the pity.)</li>
</ul>
</li>
<li>My notes on all this from <a href="http://www.dbms2.com/2010/04/05/oltp-database-management-systems-2/">April, 2010</a> are already badly outdated, but may be interesting anyway.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/04/19/notes-on-short-request-scale-out-mysql/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>More on NoSQL and HVSP (or OLRP)</title>
		<link>http://www.dbms2.com/2010/08/26/nosql-hvsp-olrp/</link>
		<comments>http://www.dbms2.com/2010/08/26/nosql-hvsp-olrp/#comments</comments>
		<pubDate>Thu, 26 Aug 2010 09:10:31 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Akiban]]></category>
		<category><![CDATA[Basho and Riak]]></category>
		<category><![CDATA[Cache]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Clustrix]]></category>
		<category><![CDATA[CouchDB]]></category>
		<category><![CDATA[DataStax]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Object]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Schooner Information Technology]]></category>
		<category><![CDATA[Theory and architecture]]></category>
		<category><![CDATA[Tokutek]]></category>
		<category><![CDATA[memcached]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2907</guid>
		<description><![CDATA[Since posting last Wednesday morning that I&#8217;m looking into NoSQL and HVSP, I&#8217;ve had a lot of conversations, including with (among others): Dwight Merriman of 10gen (MongoDB) Damien Katz of Couchio (CouchDB) Matt Pfeil of Riptano (Cassandra) Todd Lipcon of Cloudera (HBase committer) Tony Falco of Basho (Riak) John Busch of Schooner Ori Herrnstadt of [...]]]></description>
			<content:encoded><![CDATA[<p>Since posting last Wednesday morning that <a href="http://www.dbms2.com/2010/08/18/nosql-hvsp-adoption/">I&#8217;m looking into NoSQL and HVSP</a>, I&#8217;ve had a lot of conversations, including with (among others):</p>
<ul>
<li>Dwight Merriman of 10gen (MongoDB)</li>
<li>Damien Katz of Couchio (CouchDB)</li>
<li>Matt Pfeil of <a href="http://www.dbms2.com/2010/07/06/riptano-and-cassandra-adoption/">Riptano</a> (Cassandra)</li>
<li>Todd Lipcon of Cloudera (HBase committer)</li>
<li>Tony Falco of Basho (Riak)</li>
<li>John Busch of Schooner</li>
<li><strong><span style="font-weight: normal;">Ori Herrnstadt</span></strong> of <a href="http://www.dbms2.com/2010/04/03/akiban-highlights/">Akiban</a></li>
</ul>
<p><span id="more-2907"></span>By no means do I have time to do these conversations justice, in terms of giving them the write-ups and/or immediate follow-up that they deserve. Indeed, I&#8217;ll leave for vacation Saturday morning with my 2000-word NoSQL article still unwritten. So I&#8217;ll dump as many observations as I can into one or a few posts now, and play catch-up later as circumstances allow.</p>
<p>In no particular order:</p>
<ul>
<li>A number of NoSQL offerings have had more uptake to date than most of the scale-out SQL offerings have.</li>
<li>&#8220;Document-oriented&#8221; NoSQL projects CouchDB and MongoDB have probably had the most users get into production, but perhaps for pretty small systems.</li>
<li>Cassandra and Hbase &#8212; the column-group-architecture guys &#8212; have probably had the most bang-in-lots-of-writes <a href="http://www.dbms2.com/2010/03/13/the-naming-of-the-foo/">HVSP</a> production uptake.*</li>
<li>I didn&#8217;t talk customer count with Schooner, but the decently-stocked <a href="http://www.schoonerinfotech.com/customers">Schooner customer page</a> suggests Schooner may be something of an exception to these generalities.</li>
<li>A lot of these companies are in the low-to-mid-teens of employees.</li>
<li>The SQL-oriented companies, despite having fewer or no customers, often seem to have more money. (One reason I get the impression SQL guys have more money is, frankly, that more  of them are talking about engaging <a href="http://www.monash.com/advantage.html">my services</a>.)
<ul>
<li>Schooner cites $20 million in VC.</li>
<li><a href="http://www.dbms2.com/2010/05/12/the-clustrix-story/">Clustrix</a> cites a figure close to that.</li>
<li>Basho cites $10 million, plus <a href="http://www.masshightech.com/stories/2010/08/02/daily35-Basho-rejects-VC-takes-late-friends-and-family-round.html">a new round of $1.5 or $2 or $2.5 million</a>. The new round is at a lowered valuation.</li>
<li>That same site says <a href="http://www.dbms2.com/2009/04/16/introduction-to-tokutek/">Tokutek</a> finally was able to<a href="http://www.masshightech.com/stories/2010/08/16/daily47-Database-software-firm-Tokutek-lands-28M.html"> raise some VC</a>. Congrats!</li>
</ul>
</li>
<li>It&#8217;s only a two-company trend, but I was pleased to hear that both 10gen/MongoDB and Akiban were seeing Drupal as a major use case or potential use case. No word on rescuing WordPress from its MySQL implementation, alas, but it seems that a Drupal site typically has 40-200+ tables, while a WordPress one has 10ish.</li>
<li>Another trend I think I&#8217;m seeing is serious object-oriented apps banging things straight into a simple back end. <a href="http://www.dbms2.com/2010/08/22/workday-stan-swete-database-architecture/">Workday</a> is a huge example of that. Akiban hopes to do something similar with Hibernate.</li>
<li>Stability and maturity are still issues for many of these products. E.g., HBase isn&#8217;t even in Release 1.0 yet. Ditto Cassandra, and surely many of the others. Unsurprisingly, <a href="http://blog.mikiobraun.de/2010/08/cassandra-gc-tuning.html">making Cassandra stable is still a challenge</a>.</li>
</ul>
<p><em>*As is common for terms I suggest, the &#8220;HVSP&#8221; name is not getting any traction. What do you think of Marton Trencseni&#8217;s suggestion of <a href="http://www.dbms2.com/2010/03/13/the-naming-of-the-foo/#comment-182138">OLRP, for OnLine Request Processing</a>?</em></p>
<p>One thing that makes following this area interesting is that so many projects are open source, leading there to be a lot of information in the wild. I hardly have time to read the mailing list for each project; but the people I talk with do, and often they may sorta kinda remember something somebody else posted one or several months back. As just one example, the mailing lists are said to confirm:</p>
<ul>
<li>Contrary to rumor, <a href="http://twitter.com/eventcloudpro/status/17872687577">Facebook hasn&#8217;t moved in-box search off of Cassandra</a>.</li>
<li>Apparently, however, it&#8217;s true that <a href="http://www.dbms2.com/2008/07/21/project-cassandra-facebook-open-sourced-quasi-dbms/">Cassandra inventor Facebook</a> has stopped working on Cassandra, and Facebook&#8217;s core Cassandra developers have shifted over to HBase.</li>
</ul>
<p>Also, figuring out usage of open source software can be &#8230; interesting.</p>
<ul>
<li> People who use open source software don&#8217;t have to reveal themselves, as there&#8217;s no purchase transaction to kick things off.</li>
<li>On the other hand, if they&#8217;re serious enough in their use, they often do.
<ul>
<li>There are two main ways to get tech support for open source software &#8212; the community or a company that sells support &#8212; and both ways let the main support-selling company know that one is a user.</li>
<li>Some folks even add themselves to open lists of users, for example these rather long lists for <a href="http://wiki.apache.org/hadoop/Hbase/PoweredBy">HBase</a> and <a href="http://wiki.apache.org/couchdb/CouchDB_in_the_wild">CouchDB</a>.</li>
<li>Or they show up at conferences. For example, <a href="http://twitter.com/spyced/status/21490457839">two</a> <a href="http://twitter.com/spyced/status/21675203015">tweets</a> from Riptano founder Jonathan Ellis suggest at least 30 production Cassandra users were represented at a recent event. That&#8217;s more detail than his colleague Matt Pfeil wanted to give me when talked. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
</ul>
</li>
</ul>
<p>OK. This post has gotten pretty long, even without me saying anything resembling an overview of any of the seven companies I listed up top, or of their products&#8217; adoption. So I&#8217;ll just publish this now.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/08/26/nosql-hvsp-olrp/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Some NoSQL links</title>
		<link>http://www.dbms2.com/2010/03/12/some-nosql-links/</link>
		<comments>http://www.dbms2.com/2010/03/12/some-nosql-links/#comments</comments>
		<pubDate>Fri, 12 Mar 2010 23:51:42 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Amazon and its cloud]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[Continuent]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[RDF and graphs]]></category>
		<category><![CDATA[Tokutek]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1692</guid>
		<description><![CDATA[I plan to post a few things soon about MongoDB, Cassandra, and NoSQL in general. So I&#8217;m poking around a bit reading stuff on the subjects. Here are some links I found. A little over a year ago, Julian Browne put up a great post on Eric Brewer&#8217;s CAP conjecture/theorem, which provides much of the [...]]]></description>
			<content:encoded><![CDATA[<p>I plan to post a few things soon about MongoDB, Cassandra, and NoSQL in general. So I&#8217;m poking around a bit reading stuff on the subjects. Here are some links I found.<span id="more-1692"></span></p>
<ul>
<li>A little over a year ago, Julian Browne put up a great post on <a href="http://www.julianbrowne.com/article/viewer/brewers-cap-theorem">Eric Brewer&#8217;s CAP conjecture/theorem</a>, which provides much of the impetus to relax the traditional requirement for atomicity/consistency.</li>
<li>Even more directly inspirational to NoSQL technology development were two seminal papers: Google&#8217;s on <a href="http://labs.google.com/papers/bigtable.html">BigTable</a> and Amazon&#8217;s on <a href="http://s3.amazonaws.com/AllThingsDistributed/sosp/amazon-dynamo-sosp2007.pdf">Dynamo</a>. (That said, I&#8217;m having trouble getting myself to actually read them from start to finish, especially since they&#8217;ve been superseded by subsequent technology development.)</li>
<li>10gen (the MongoDB guys) hosted a NoSQL conference yesterday. Much blogging has ensued. The best post I&#8217;ve seen so far was by <a href="http://blog.marcua.net/post/442594842/notes-from-nosql-live-boston-2010">Adam Marcus</a>. I find the graph database notes near the bottom particularly interesting.</li>
<li>Mark Callaghan hit back against the <a href="http://mysqlha.blogspot.com/2010/03/plays-well-with-others.html">NoSQL <span style="text-decoration: line-through;">movement</span> hype</a>, and in particular against the <a href="http://www.dbms2.com/2010/03/02/cassandra-nosql-scalable-oltp/">MySQL/memcached is passe</a>&#8216; meme. On the other hand, he also bemoaned many failings of MySQL. On the third hand, he praised or at least expressed hope for a variety of MySQL-related technologies, including <a href="http://www.dbms2.com/2009/04/16/introduction-to-tokutek/">Tokutek&#8217;s TokuDB</a> and <a href="http://www.dbms2.com/2009/09/03/continuent-on-clustering/">Continuent&#8217;s Tungsten</a>.</li>
<li>In connection with that debate, Mark Rendle offered a <a href="http://blog.markrendle.net/2010/03/do-you-need-relational-database.html">funny rant</a>, mainly pro-NoSQL, in the style of a Socratic dialogue.</li>
<li>John Quinn of Digg recently described <a href="http://www.stumbleupon.com/su/5099Ti/about.digg.com/node/564">Digg&#8217;s move from MySQL to Cassandra</a>, and outlined a lot of features Digg was adding to Cassandra, all of which it is open-sourcing.</li>
<li>The NoSQL guys maintain their own long <a href="http://nosql-database.org/links.html">list of NoSQL-related links</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/03/12/some-nosql-links/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>MySQL storage engine round-up, with Oracle-related thoughts</title>
		<link>http://www.dbms2.com/2009/04/20/mysql-storage-engine-round-up-with-oracle-related-thoughts/</link>
		<comments>http://www.dbms2.com/2009/04/20/mysql-storage-engine-round-up-with-oracle-related-thoughts/#comments</comments>
		<pubDate>Mon, 20 Apr 2009 13:39:39 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Calpont]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Tokutek]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=757</guid>
		<description><![CDATA[Here&#8217;s what I know about MySQL storage engines, more or less. MySQL with MyISAM is fast. But it&#8217;s not transactional. Except for limited purposes, MySQL with MyISAM is a pretty crummy DBMS.  Nothing can change that. MySQL with InnoDB is transactional. But it&#8217;s not particularly fast. MySQL with InnoDB is a pretty mediocre DBMS.  Oracle [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s what I know about MySQL storage engines, more or less.</p>
<ul>
<li>MySQL with MyISAM is fast. But it&#8217;s not transactional. Except for limited purposes, MySQL with MyISAM is a pretty crummy DBMS.  Nothing can change that.</li>
<li>MySQL with InnoDB is transactional. But it&#8217;s not particularly fast. MySQL with InnoDB is a pretty mediocre DBMS.  Oracle could fix that, at least partially, over time.</li>
<li>I don&#8217;t know much about Falcon, Maria, and so on. With Oracle winding up owning both MySQL and InnoDB, the motivation for those engines (except as Oracle-free forks) might fade.</li>
<li><a href="http://www.dbms2.com/2009/04/20/infobright-update-3/">Infobright</a> is the most established of the rest. At the moment I&#8217;m not recommending it for most industrial-strength uses unless the user is particularly cash-constrained. But I wouldn&#8217;t be surprised if that changed soon.  A cheap, fast, simple columnar analytic DBMS has a place in the world.</li>
<li><a href="http://www.dbms2.com/2009/03/25/kickfire-update/">Kickfire</a> is next in line, offering a hardware-based growth path for users who&#8217;ve maxed out on what unaided MySQL can do.  It remains to be seen for how many users the desire to keep things simple and stay with MySQL outweighs the desire to avoid custom hardware.  Having Oracle salespeople all over those accounts surely wouldn&#8217;t help. Kickfire also has a second market, namely OEM vendors who are mainly interested in the superfast chip. That would probably be pretty unaffected by Oracle.</li>
<li><a href="http://www.dbms2.com/2009/04/16/introduction-to-tokutek/">Tokutek</a> offers a technical proposition that&#8217;s hard to match head-on without going the CEP route.  Users who care are likely to be MySQL shops.  Tokutek&#8217;s main challenge is to prove that it sufficiently outdoes competing technical strategies for sufficiently many users. Oracle ownership of MySQL seems pretty irrelevant to Tokutek&#8217;s success or failure.</li>
<li><a href="http://www.dbms2.com/2009/04/20/calpont-update-you-read-it-here-first/">Calpont</a> offers a kind of lightweight Exadata alternative.  With Calpont&#8217;s packaging and positioning perennially unclear, it&#8217;s difficult to predict the effect of a particular change &#8212; i.e., Oracle buying MySQL &#8212; in Calpont&#8217;s market environment.</li>
<li>I haven&#8217;t heard from transactionally-oriented <a href="http://www.dbms2.com/2008/04/13/scaledb-presents-the-revenge-of-the-pointer/">ScaleDB</a> since I wrote about them a year ago.  Apparently, they&#8217;re rolling out beta product this week, and their venerable techie guru sadly passed away earlier this month.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/04/20/mysql-storage-engine-round-up-with-oracle-related-thoughts/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Introduction to Tokutek</title>
		<link>http://www.dbms2.com/2009/04/16/introduction-to-tokutek/</link>
		<comments>http://www.dbms2.com/2009/04/16/introduction-to-tokutek/#comments</comments>
		<pubDate>Thu, 16 Apr 2009 16:10:04 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Tokutek]]></category>
		<category><![CDATA[Web analytics]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=752</guid>
		<description><![CDATA[Tokutek has a paradoxical pitch: Tokutek writes data particularly quickly, and therefore you&#8217;re supposed to buy Tokutek for query-oriented uses. Highlights of the Tokutek story include: Tokutek is a MySQL storage engine. MySQL/Tokutek writes indexed data a lot faster than B-tree-based alternatives. (The claim is 10s of 1000s of rows per second on a single [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Tokutek has a paradoxical pitch:  <em>Tokutek writes data particularly quickly, and therefore you&#8217;re supposed to buy Tokutek for query-oriented uses. </em>Highlights of the Tokutek story include:</p>
<ul>
<li>Tokutek is a MySQL storage engine.</li>
<li>MySQL/Tokutek writes indexed data 	a lot faster than B-tree-based alternatives. (The claim is 10s of 	1000s of rows per second on a single server.)</li>
<li>MySQL/Tokutek reads data at B-tree 	speeds.  (But not, I presume, at the speed of specialized analytic 	DBMS.)</li>
<li>Tokutek is not yet ACID-compliant. 	 They&#8217;re working on that, but we don&#8217;t know what the performance 	implications will be when they achieve it.  ACID compliance won&#8217;t 	come as soon as the May release (Tokutek Version 2.0).</li>
<li>Tokutek has made one sale. Others 	are in the pipeline.</li>
</ul>
<p style="margin-bottom: 0in;">Tokutek&#8217;s initial target market is the usual combination of clickstream/personalization/other network management.  The idea is that many data warehouse technologies have trouble getting latency below, say, 15 seconds to 5 minutes, at least at very high update volumes. So if immediacy is more important than raw complex query performance, Tokutek&#8217;s performance profile could be attractive.<span id="more-752"></span></p>
<p style="margin-bottom: 0in;">Tokutek&#8217;s core technology is something it&#8217;s named &#8220;fractal tree&#8221; indexing. The &#8220;fractal&#8221; part of the name is based on a kind of &#8220;self-similar at different size ranges&#8221; flavor to the scheme.  But if I understood correctly, it doesn&#8217;t sound like much of a tree, in that it always goes root-branch-leaf, rather than ever going root-branch-branch-branch-leaf or whatever.</p>
<p style="margin-bottom: 0in;">The basic idea is that (to a first approximation) there&#8217;s one index block each of length 1, 2, 4, 8, and so on. With each insert, blocks get emptied and filled, so that each block is either completely empty or completely full. (And each block is in a sorted state as soon as it&#8217;s filled.)  This results in a huge amount of shuffling in RAM &#8212; where the shorter blocks live &#8212; but disk only gets touched for occasional orderly pre-sorted merges.  Tricks are obviously needed to:</p>
<ul>
<li>Make Tokutek lookup speed be order 	(log(N)) rather than order (log^2(N)) &#8212; that&#8217;s done by salting each 	block with information about where values lie in the next one.</li>
<li>Make Tokutek disk merges operate in 	background.</li>
<li>Dump data quickly when it&#8217;s time 	to delete it &#8212; I didn&#8217;t ask how Tokutek does that.</li>
</ul>
<p style="margin-bottom: 0in;">I have a plane to catch, so I&#8217;ll just post this much and stop.  More later.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/04/16/introduction-to-tokutek/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
	</channel>
</rss>

