<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS2 -- DataBase Management System Services &#187; DBMS product categories</title>
	<atom:link href="http://www.dbms2.com/category/database-management-system/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Fri, 30 Jul 2010 15:51:32 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Advice for some non-clients</title>
		<link>http://www.dbms2.com/2010/07/30/advice-for-some-non-clients/</link>
		<comments>http://www.dbms2.com/2010/07/30/advice-for-some-non-clients/#comments</comments>
		<pubDate>Fri, 30 Jul 2010 14:35:52 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[Information Builders]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Kalido]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Objectivity and Infinite Graph]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[RDF and graphs]]></category>
		<category><![CDATA[SenSage]]></category>
		<category><![CDATA[Structured documents]]></category>
		<category><![CDATA[Tableau Software]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2699</guid>
		<description><![CDATA[Most of what I get paid for is in some form or other consulting. (The same would be true for many other analysts.) And so I can be a bit stingy with my advice toward non-clients. But my non-clients are a distinguished and powerful group, including in their number Oracle, IBM, Microsoft, and most of [...]]]></description>
			<content:encoded><![CDATA[<p>Most of what I get paid for is in some form or other consulting. (<a href="http://www.strategicmessaging.com/blurring-analyst-consultant-line/2010/07/28/" onclick="javascript:pageTracker._trackPageview('/www.strategicmessaging.com');">The same would be true for many other analysts</a>.) And so I can be a bit stingy with my advice toward non-clients. But my non-clients are a distinguished and powerful group, including in their number Oracle, IBM, Microsoft, and most of the BI vendors. So here&#8217;s a bit of advice for them too.</p>
<p><strong>Oracle. </strong>On the plus side, you guys have been making progress against your reputation for untruthfulness. Oh, I&#8217;ve dinged you for some <a href="http://www.dbms2.com/2008/09/30/oracle-crosses-the-line-on-integrity/" >past</a> <a href="http://www.dbms2.com/2008/06/28/response-to-rita-sallam-of-oracle/" >slip-ups</a>, but on the whole they&#8217;ve been no worse than other vendors.&#8217; But recently you pulled a doozy. The <a href="http://www.oracle.com/us/corporate/analystreports/infrastructure/index.html" onclick="javascript:pageTracker._trackPageview('/www.oracle.com');">analyst reports</a> section of your website fails to distinguish between unsponsored and sponsored work.* That is a horrible ethical stumble. Fix it fast. Then put processes in place to ensure nothing that dishonest happens again for a good long time.</p>
<p><em>*Merv Adrian&#8217;s &#8220;report&#8221; listed high on that page is actually a sponsored white paper. That Merv himself screwed up by not labeling it clearly as such in no way exonerates Oracle. Besides, I&#8217;m sure Merv won&#8217;t soon repeat the error &#8212; but for Oracle, this represents a whole pattern of behavior.</em></p>
<p><strong>Oracle.</strong> And while I&#8217;m at it, outright dishonesty isn&#8217;t your only unnecessary credibility problem. <a href="http://www.strategicmessaging.com/so-what-is-an-analyst-anyway/2010/07/25/" onclick="javascript:pageTracker._trackPageview('/www.strategicmessaging.com');">You&#8217;re also playing too many games in analyst relations</a>.</p>
<p><strong>HP.</strong> Neoview will never succeed. Admit it to yourselves. Go buy something that can.  <span id="more-2699"></span></p>
<p><strong>Smaller BI vendors.</strong> Analytic DBMS evaluations commonly include BI strategy and tool selection as well. If an analytic DBMS expert tells you he needs to learn more about your product line, don&#8217;t blow him off. In fact, you should be particularly embracing anybody who&#8217;s shown a fondness for small DBMS vendors; maybe he or his clients will like small BI vendors as well. That means (among others) you, <strong>Jaspersoft, Endeca, </strong>and <strong>Tableau.</strong></p>
<p><strong>Information Builders. </strong>Is there anything about your BI products that is in any way technologically differentiated? If so, you might want to mention some examples to somebody some time.</p>
<p><strong>Kalido.</strong> I&#8217;ve said this to you before, but it bears repeating &#8212; your positioning translates to &#8220;I-CASE for analytics,&#8221; and that&#8217;s not a good thing. If your product is not as cumbersome and entrapping as that sounds, you need to do a much better job of explaining why not.</p>
<p><strong>SenSage.</strong> You are what you are. Sell out while the selling is good. You don&#8217;t have the corporate personality to make it into the analytic DBMS mainstream on your own.</p>
<p><strong>Ingres. </strong>You need to be more engaged with analysts than you are. <a href="http://www.softwarememories.com/2010/07/25/ingres-history/" onclick="javascript:pageTracker._trackPageview('/www.softwarememories.com');">Ingres navel-gazed too much 25 years ago</a>, and evidently you haven&#8217;t outgrown it yet.</p>
<p><strong>TIBCO.</strong> You probably have a lot of cool analytic technology, but I don&#8217;t know of an influencer who has much relationship with or trust in you. Rethink how you&#8217;re approaching influencer relations top to bottom.</p>
<p><strong>Tableau.</strong> You had a lot of mindshare, but it&#8217;s fading. Do something.</p>
<p><strong>MarkLogic, graph DBMS vendors, etc.</strong> You&#8217;re clinging too hard to the NoSQL label. Nobody is out there deciding among Cassandra, neo4j, and MarkLogic. They might be deciding between MongoDB and MarkLogic, I guess, but if you admit to yourself that&#8217;s all it is you&#8217;ll probably change your messaging somewhat.</p>
<p><strong>Objectivity.</strong> Get real about marketing. Infinite Graph is a cool opportunity. But I didn&#8217;t even ping you for a meeting when I&#8217;m in your area next week, because I wouldn&#8217;t have known who to reach out to.</p>
<p><strong>Everybody (especially Objectivity).</strong> &#8220;First X deployed in the cloud&#8221; is almost surely an inaccurate claim. Don&#8217;t make it. And by the way, even if it were true, it probably wouldn&#8217;t be interesting.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/30/advice-for-some-non-clients/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>How should somebody teach themselves database and programming skills?</title>
		<link>http://www.dbms2.com/2010/07/29/how-should-somebody-teach-themselves-programming-skills/</link>
		<comments>http://www.dbms2.com/2010/07/29/how-should-somebody-teach-themselves-programming-skills/#comments</comments>
		<pubDate>Thu, 29 Jul 2010 07:36:57 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Microstrategy]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open source]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2677</guid>
		<description><![CDATA[From time to time,  I get in a conversation with somebody who is:

Unemployed, underemployed, or otherwise desirous of having more commercial skills.
Not a programmer, but desirous of having some technical skills.
Astute enough to realize s/he will never be a serious techie.

I generally have two models in mind when guiding such a person:

Analytics/business intelligence/stats.
Website building.

Those are [...]]]></description>
			<content:encoded><![CDATA[<p>From time to time,  I get in a conversation with somebody who is:</p>
<ul>
<li>Unemployed, underemployed, or otherwise desirous of having more commercial skills.</li>
<li>Not a programmer, but desirous of having some technical skills.</li>
<li>Astute enough to realize s/he will never be a serious techie.</li>
</ul>
<p>I generally have two models in mind when guiding such a person:</p>
<ul>
<li>Analytics/business intelligence/stats.</li>
<li>Website building.</li>
</ul>
<p>Those are both useful skill sets for people who aren&#8217;t full-time techies, the first perhaps best for those who are more quantitative and big-company-friendly, the second perhaps better for the creative and/or rebellious types.</p>
<p>So what SPECIFICALLY should one guide them to do? My initial thoughts include:  <span id="more-2677"></span></p>
<ul>
<li>Learning Java is overkill for most of these people.</li>
<li>Learning C++ is overkill for ALL of these people. If you&#8217;re not out to be a hardcore engineer, the &#8220;advantages&#8221; of C++ over Java are pointless.</li>
<li>They all should learn some SQL.</li>
<li>MySQL is the most accessible DBMS against which to learn SQL. They should download a (free) copy and install it on their PC.</li>
<li>But I have no idea which books or websites they should go to to learn about SQL.</li>
<li>While at first blush it sounds like overkill, downloading and installing the free version of Microstrategy 9 is a good way to learn about BI and also the analytic side of SQL.</li>
<li>The first thing you learn in an app dev tool used to be and probably still is how to do a master-detail form. That would cover the other side of learning SQL. But what would be a good choice of tool? (Preferably free, as building serious OLTP apps is probably not what these people will want to do.)</li>
<li>One idea I had is that the website-oriented ones should learn how to modify WordPress, by which I really mean modifying WordPress themes. That would involve learning PHP, SQL, and HTML/CSS, which seems like a great place to start.</li>
<li>But I have no idea which books or websites they should go to to learn  about PHP.</li>
<li>I also have no idea which books or websites they should go to to learn  about CSS &#8212; or for that matter even basic HTML.</li>
<li>If they want to take the analytics route, I assume R is the way to go. Thoughts?</li>
<li>Python isn&#8217;t the ideal language for much of anything, but it&#8217;s an easily accessible &#8220;first language&#8221;. Umm, is that a good way to go, or would PHP be a better choice?</li>
<li>Any other ideas?</li>
</ul>
<p>For anybody who pitches in &#8212; thanks!! I hope to get enough useful answers so as to keep editing this post with people&#8217;s ideas.</p>
<p><em>Edit:</em> Suggestions have started to come in on Twitter. A couple of folks are saying that HTML is a good place to start. Hard to argue with that, although it&#8217;s hardly where one should finish. There also was <a href="http://twitter.com/labsji/status/19813118791" onclick="javascript:pageTracker._trackPageview('/twitter.com');">a vote for Yahoo YQL</a>, and of course for a vendor&#8217;s own product.</p>
<p>Some great points are in the comments below, including the idea that you should pick an actual, fun, small project to build to get you started. (A site built in WordPress or Mambo would be a pretty obvious choice for such a project, come to think of it.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/29/how-should-somebody-teach-themselves-programming-skills/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>dbShards &#8212; a lot like an MPP OLTP DBMS based on MySQL or PostgreSQL</title>
		<link>http://www.dbms2.com/2010/07/28/dbshards/</link>
		<comments>http://www.dbms2.com/2010/07/28/dbshards/#comments</comments>
		<pubDate>Wed, 28 Jul 2010 09:39:11 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Code Futures and dbShards]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[PostgreSQL]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2662</guid>
		<description><![CDATA[I talked yesterday w/ Cory Isaacson, who runs CodeFutures, makers of dbShards.  dbShards is a software layer that turns an ordinary DBMS (currently MySQL or PostgreSQL) into an MPP shared-nothing ACID-compliant OLTP DBMS. Technical highlights included:  

Despite heavy emphasis on the 	word “sharding,” dbShards&#8217;s scale-out is transparent to the 	application programmer. E.g., in dbShards [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">I talked yesterday w/ Cory Isaacson, who runs CodeFutures, makers of dbShards.  dbShards is a software layer that turns an ordinary DBMS (currently MySQL or PostgreSQL) into an MPP shared-nothing ACID-compliant OLTP DBMS. Technical highlights included:  <span id="more-2662"></span></p>
<ul>
<li>Despite heavy emphasis on the 	word “sharding,” dbShards&#8217;s scale-out is transparent to the 	application programmer. E.g., in dbShards + MySQL, the APIs are more 	or less the same ones you&#8217;d expect for MySQL (JDBC, etc.)</li>
<li>If the DBMS underneath is 	ACID-compliant (e.g., MySQL + InnoDB), then the dbShards version is 	ACID-compliant too.</li>
<li>Beyond those basics, I forgot to 	check the fine details of dbShards&#8217; MySQL (or PostgreSQL) syntax 	support. <a href="http://highscalability.com/blog/2010/6/23/product-dbshards-share-nothing-shard-everything.html" onclick="javascript:pageTracker._trackPageview('/highscalability.com');">Todd 	Hoff, however, did not forget</a>.</li>
<li>dbShards keeps copies of each 	shard on two different servers, via asynchronous log-shipping. This 	allows for failover in both planned and unplanned outages.</li>
<li>dbShards wants you to distribute 	big tables among shards via a “shard key,” which is a lot like 	the distribution key in MPP analytic DBMS. You&#8217;re encouraged to 	replicate small, low-update-volume tables across each shard.</li>
<li>Cory says that dbShards has good 	join performance when – you guessed it! – everything being joined 	is co-located shard-by-shard, because the tables were distributed on 	the same shard key and/or replicated across each shard. Cory can&#8217;t 	imagine why you&#8217;d want to do an inner join under any other 	circumstances.</li>
<li>The basic dbShards query execution 	model is: A query comes in; it&#8217;s parsed; a shard key is 	automagically detected (one hopes); the “global configuration 	file” is checked to see which shard to ship the work off too. I 	forgot to ask whether lookup was done via a hash table (the obvious 	guess) or something else. The programmer can put hints in the code 	comments to direct the sharding, but Cory asserts those aren&#8217;t 	needed very often.</li>
<li>Cory says that insert performance 	with dbShards + MySQL + InnoDB is 1500-3000 inserts per shard per 	second, scaling almost linearly with the number of shards. I forgot 	to ask how many shards this had been tested for.</li>
<li>If you want blazing dbShards 	performance, Cory&#8217;s base-case figure is 25 gigabytes of data per 	node, so that the most commonly used indexes can camp out in memory. 	(I forgot to ask what kind of hardware he was assuming per node.) 	This is if you&#8217;re going to be doing joins or aggregrations. If it&#8217;s 	just single-row inserts and updates, or if your performance 	requirements are lower, you can go with 10X that figure.</li>
<li>Cory tells stories wherein going 	from an unsharded database to 4 or so shards took database 	re-indexing time down 50X or more.  Apparently, such tasks can be 	exponential or even super-exponential with database size over 	InnoDB. (That said, I&#8217;d be surprised if all large InnoDB users 	suffered from that problem to the same degree.)</li>
<li>dbShards&#8217; customer workloads are 	all &gt;= 50% reads. This is reflective of dbShards&#8217; design 	priorities.</li>
<li>As long as it can be in charge, 	dbShards is happy to interface to whatever kind of database backup 	software you want to use on a node by node basis. (dbShards wants to 	drive your backup software for you so that it can be sure the 	replicas are handled properly.)</li>
<li>It&#8217;s “fairly common” for 	dbShards to be paired with memcached. I forgot to ask whether 	memcached typically lived on its own pool of servers, or on the same 	pool that runs dbShards.</li>
<li>Future DBMS options under 	consideration for dbShards include Oracle and (unspecified) 	in-memory.</li>
</ul>
<p style="margin-bottom: 0in;">Business highlights for CodeFutures and dbShards include:</p>
<ul>
<li>dbShards&#8217; price is 	$5000/server/year, including support and OEMed MySQL, with stated 	quantity discounts up to 40%.</li>
<li>dbShards cloud pricing is 	different (on a usage basis).</li>
<li>dbShards has 6 or so customers, 	half each on-premises and in the cloud. One of them is Facebook. (Those &#8220;100s&#8221; of customers mentioned on the dbShards website are for a fairly unrelated product.)</li>
<li>CodeFutures has been at this 2 ½ 	years or so. There is no venture capital in the company.</li>
<li>Early deals dbShards deals have 	evidently involved a fair amount of professional services.</li>
<li>Counting contractors, Code Futures 	has 10-12 people, which has been as high as 15.</li>
<li>Target dbShards customers are as 	you&#8217;d expect. Cory says he&#8217;s actually been more successful getting 	early-adopted money out of Web companies than Wall Street firms.</li>
<li>There are a couple of dbShards 	PostgreSQL customers for greenfield applications. Most dbShards 	customers and prospects, however, are looking to scale out existing 	apps.</li>
<li>Despite its connection to open source DBMS, there&#8217;s nothing open source about dbShards itself.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/28/dbshards/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Kickfire unlikely to survive</title>
		<link>http://www.dbms2.com/2010/07/27/kickfire-unlikely-to-survive/</link>
		<comments>http://www.dbms2.com/2010/07/27/kickfire-unlikely-to-survive/#comments</comments>
		<pubDate>Tue, 27 Jul 2010 18:56:48 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Kickfire]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2658</guid>
		<description><![CDATA[Following up on a previous report of Kickfire&#8217;s troubles &#8212; a Kickfire customer tipped me off that Kickfire told him they&#8217;re selling their IP and engineers, and the Kickfire products will be discontinued.
At this time, I have no idea who the lucky buyer is.
]]></description>
			<content:encoded><![CDATA[<p>Following up on a previous report of <a href="http://www.dbms2.com/2010/06/11/kickfire-update-2/" >Kickfire&#8217;s troubles</a> &#8212; a Kickfire customer tipped me off that Kickfire told him they&#8217;re selling their IP and engineers, and the Kickfire products will be discontinued.</p>
<p>At this time, I have no idea who the lucky buyer is.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/27/kickfire-unlikely-to-survive/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Some interesting links</title>
		<link>http://www.dbms2.com/2010/07/23/some-interesting-links/</link>
		<comments>http://www.dbms2.com/2010/07/23/some-interesting-links/#comments</comments>
		<pubDate>Fri, 23 Jul 2010 09:04:48 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[EnterpriseDB and Postgres Plus]]></category>
		<category><![CDATA[Fun stuff]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Humor]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[SAP AG]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2626</guid>
		<description><![CDATA[In no particular order:  

Neil Raden points out that business intelligence dashboards can be dangerously misleading. His reasoning (sound) is that whatever you measure is apt to be distorted by the fact people know they&#8217;re being measured. His solution (implied) is to hire a good-looking consultant like himself to do it right.
I&#8217;ve had my issues [...]]]></description>
			<content:encoded><![CDATA[<p>In no particular order:  <span id="more-2626"></span></p>
<ul>
<li>Neil Raden points out that <a href="http://www.b-eye-network.com/channels/5083/view/9618/" onclick="javascript:pageTracker._trackPageview('/www.b-eye-network.com');">business intelligence dashboards can be dangerously misleading</a>. His reasoning (sound) is that whatever you measure is apt to be distorted by the fact people know they&#8217;re being measured. His solution (implied) is to hire a <a href="http://twitter.com/NeilRaden/status/19110492482" onclick="javascript:pageTracker._trackPageview('/twitter.com');">good-looking</a> consultant like himself to do it right.</li>
<li>I&#8217;ve had my issues with Fred Holahan, who was VP of Marketing when I posted that <a href="http://www.dbms2.com/2009/04/20/first-thoughts-on-oracle-acquiring-sun/" >EnterpriseDB was not to be trusted</a>. (That said, Fred is long gone from EnterpriseDB and my opinion hasn&#8217;t changed.) But he&#8217;s put up a good series of posts on the basis of the open source &#8220;progressive engagement&#8221; marketing funnel, including this gem on <a href="http://opensourceadvisory.com/wordpress/?p=860" onclick="javascript:pageTracker._trackPageview('/opensourceadvisory.com');">why you shouldn&#8217;t count on monetizing your community/free users</a>.</li>
<li><a href="http://tech.fortune.cnn.com/2010/07/22/oracle-plans-to-double-acquisition-budget/" onclick="javascript:pageTracker._trackPageview('/tech.fortune.cnn.com');">Oracle plans to increase its acquisition budget</a>. The figure given is $70 billion over the next 5 years. <em>Edit: But see this funny <a href="http://www.theregister.co.uk/2010/07/23/oracle_acquisition_budget/" onclick="javascript:pageTracker._trackPageview('/www.theregister.co.uk');">Register</a> followup.</em></li>
<li>Clayton Christensen wrote a phenomenal article on <a href="http://hbr.org/2010/07/how-will-you-measure-your-life/ar/1" onclick="javascript:pageTracker._trackPageview('/hbr.org');">how to live a good life</a>, from a very business-y perspective. (Only in one anecdote was it too religiously-oriented for my tastes.) Takeaways include:
<ul>
<li>Your core goals probably revolve around something other than business success. (E.g., family.) Don&#8217;t lose sight of that.</li>
<li>To the extent you&#8217;re a manager or leader, you may have a huge impact on other people&#8217;s lives. Use that power in admirable ways.</li>
<li>Teach people how to fish for answers, rather than just giving them answers. They&#8217;ll probably come to better conclusions than you would have anyway. (This is a core principle in my own consulting.)</li>
<li>Take time to reflect. And by the way, the same techniques you use for strategic analysis in business can be applied to your life as well.</li>
</ul>
</li>
<li><a href="http://www.bothsidesofthetable.com/2010/07/19/life-is-10-how-you-make-it-and-90-how-you-take-it/" onclick="javascript:pageTracker._trackPageview('/www.bothsidesofthetable.com');">Mark Suster</a> has a pretty good post expanding on my first Christensen takeaway, highlighting a point too often missing from articles in that genre: It&#8217;s not just family; it&#8217;s also all the cool things around us.</li>
<li>I haven&#8217;t gone through the <a href="http://developer.yahoo.com/events/hadoopsummit2010/agenda.html" onclick="javascript:pageTracker._trackPageview('/developer.yahoo.com');">Hadoop Summit archives</a> yet, but it looks as if there&#8217;s a lot of insight there about current Hadoop application activity.</li>
<li>If you&#8217;re a cat lover and don&#8217;t hate simple/traditional music, check out <a href="http://www.marcgunn.com/poetry/labels/cat_songs.shtml" onclick="javascript:pageTracker._trackPageview('/www.marcgunn.com');">Marc Gunn&#8217;s cat filksongs</a>, especially the infectious &#8220;What Shall We Do With a Catnipped Kitty?&#8221; and &#8220;Lord of the Pounce&#8221;, both playable from the right sidebar of that page (#7 and #10 respectively). Gunn is also a chief perpetrator of the justly (in)famous <a href="http://www.thebards.net/" onclick="javascript:pageTracker._trackPageview('/www.thebards.net');">Do Virgins Taste Better?</a> cycle of filksongs.</li>
<li>Former SAP exec Dennis Moore offers a theory as to <a href="http://dbmoore.blogspot.com/2010/05/why-is-in-memory-database-important-to.html" onclick="javascript:pageTracker._trackPageview('/dbmoore.blogspot.com');">why SAP cares so much about in-memory DBMS</a>. It&#8217;s to integrate business processes, because SAP has no other software layer good at doing same. Interestingly, Dennis originated SAP&#8217;s previous attempt at meeting a similar need via its composite applications initiative. However, in Dennis&#8217; view this benefit would only be achieved by a major rewrite of SAP&#8217;s applications.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/23/some-interesting-links/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yet more on the GPL, WordPress themes, and the implications for MySQL storage engines</title>
		<link>http://www.dbms2.com/2010/07/23/gpl-wordpress-themes-mysql-storage-engines/</link>
		<comments>http://www.dbms2.com/2010/07/23/gpl-wordpress-themes-mysql-storage-engines/#comments</comments>
		<pubDate>Fri, 23 Jul 2010 04:53:56 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open source]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2616</guid>
		<description><![CDATA[The debate I wrote about a few days ago over whether or not the WordPress theme called Thesis needed to be GPLed has been resolved in practice &#8211; it will be. More precisely, the parts that WordPress developers and the Free Software Foundation said need to be GPLed will be GPLed, while the rest won&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>The debate I wrote about a few days ago over <a href="http://www.dbms2.com/2010/07/17/mysql-gpl-storage-engine-wordpress-theme/" >whether or not the WordPress theme called Thesis needed to be GPLed</a> has been resolved in practice &#8211; it will be. More precisely, <a href="http://thenextweb.com/socialmedia/2010/07/22/wordpress-vs-thesis-the-battle-is-over/" onclick="javascript:pageTracker._trackPageview('/thenextweb.com');">the parts that WordPress developers and the Free Software Foundation said need to be GPLed will be GPLed</a>, while the rest won&#8217;t be, those parts being, in essence, the more &#8220;artistic&#8221; elements.</p>
<p>A consensus seems to have emerged that Thesis had actually copied beyond-fair-use amounts of WordPress code, which if true was Game Over. Beyond that, however, both sides of the strongly-viral-GPL debate scored some points.  <span id="more-2616"></span></p>
<ul>
<li>Public pressure, FUD, etc. in favor of the GPL clearly were successful.</li>
<li>Mark Aquith carefully explained  <a href="http://markjaquith.wordpress.com/2010/07/17/why-wordpress-themes-are-derivative-of-wordpress/" onclick="javascript:pageTracker._trackPageview('/markjaquith.wordpress.com');">how tightly integrated WordPress and WordPress themes are</a>. So we don&#8217;t necessarily have much of a precedent for more hands-off integration.</li>
<li>Previous precedents were dredged up on both sides. More on that below.</li>
<li>Once again, <strong>the claims about the GPL were not tested in court.</strong> Various open source developers and lawyers have stated their opinions as to what their rights are, but the courts were not given the opportunity in this case to (dis)agree on <strong>the core issue</strong> &#8230;</li>
<li>&#8230; which everybody seems to be correctly agreeing is: <strong>&#8220;When is one software program a &#8216;derivative work&#8217; of another one, in the copyright-law sense of &#8216;derivative work&#8217;?&#8221;</strong></li>
</ul>
<p>The pro-GPL argument on that last point would probably boil down, colloquially, to &#8220;Well, is it one program or two? If it&#8217;s one, then it clearly is derivative from the big part that&#8217;s copyrighted under the GPL. If it&#8217;s two, then the creator of the second one is home free &#8212; but c&#8217;mon, now, it&#8217;s really just one.&#8221; That, in turn, would be supported by Dan Weinreb&#8217;s point from <a href="http://www.dbms2.com/2009/04/21/i-dont-see-why-the-gpl-would-be-a-major-barrier-to-a-useful-mysql-fork/" >a previous MySQL storage engine/GPL comment thread</a> (which also has a lot of applicability to the WordPress theme case) &#8211;<strong> you really don&#8217;t want to make the user do two installs, so you really do want to include the GPLed code in your package.</strong> And by the way &#8212; so far the product packaging by the MySQL storage engine vendors* is in line with Dan&#8217;s observation.</p>
<p><em>*Infobright, Akiban, Tokutek, Calpont, et al.</em></p>
<p>One point I haven&#8217;t seen discussed much yet is this:</p>
<p><strong>Suppose a MySQL storage engine vendor integrated with a forked, GPLed MySQL, and then didn&#8217;t obey the GPL. Who would have standing to sue them? </strong>It&#8217;s obvious that the developers of the forked MySQL would. But it&#8217;s not at all obvious that Oracle would. A derivative of a derivative of a copyrighted work is NOT necessarily a derivative of the original. (Think about it.) Unless Oracle could prove that the MySQL storage engine really did happen to be a derivative of MySQL Classic, I don&#8217;t know why Boogeyman Oracle would have standing to sue.</p>
<p>Finally, those precedents.</p>
<ul>
<li>A couple of comments on <a href="http://www.dbms2.com/2010/07/17/mysql-gpl-storage-engine-wordpress-theme/" >my earlier post</a> point out that the Linux community tends to be pretty tolerant of proprietary code that links tightly into Linux. Oh, they may not like &#8212; but in most cases they neither do sue nor believe they successfully can.</li>
<li>Wikipedia cites <a href="http://en.wikipedia.org/wiki/GPL#The_GPL_in_court" onclick="javascript:pageTracker._trackPageview('/en.wikipedia.org');">some cases in which the GPL has been successfully enforced</a>.</li>
<li>One of the earliest GPL controversies was over what sounds like a MySQL storage engine &#8212; <a href="http://www.nusphere.com/products/library/gemini.pdf" onclick="javascript:pageTracker._trackPageview('/www.nusphere.com');">Progress Software&#8217;s NuSphere</a>, developed in connection with MySQL AB. Litigation ensued, and before the case was settled, the judge wrote &#8220;<a href="http://pacer.mad.uscourts.gov/dc/opinions/saris/pdf/progress%20software.pdf" onclick="javascript:pageTracker._trackPageview('/pacer.mad.uscourts.gov');">After hearing, MySQL seems to have the better argument here, but the matter is one of fair dispute.</a>&#8220;</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/23/gpl-wordpress-themes-mysql-storage-engines/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>New insights into the GPL vs. MySQL storage engine debates</title>
		<link>http://www.dbms2.com/2010/07/17/mysql-gpl-storage-engine-wordpress-theme/</link>
		<comments>http://www.dbms2.com/2010/07/17/mysql-gpl-storage-engine-wordpress-theme/#comments</comments>
		<pubDate>Sun, 18 Jul 2010 02:12:45 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2604</guid>
		<description><![CDATA[Around the time of Oracle&#8217;s acquisition of Sun and hence MySQL, there was a lot of discussion as to whether MySQL&#8217;s GPL license could inhibit MySQL storage engine vendors from selling their products without MySQL code (e.g., with MySQL-fork front-ends).  I argued No. Most people, however, seemed to think &#8220;Yes, and even if the matter [...]]]></description>
			<content:encoded><![CDATA[<p>Around the time of Oracle&#8217;s acquisition of Sun and hence MySQL, there was a lot of discussion as to whether MySQL&#8217;s GPL license could inhibit MySQL storage engine vendors from selling their products without MySQL code (e.g., with MySQL-fork front-ends).  I <a href="http://www.dbms2.com/2009/04/21/i-dont-see-why-the-gpl-would-be-a-major-barrier-to-a-useful-mysql-fork/" >argued</a> <a href="http://www.dbms2.com/2009/05/22/yet-more-on-mysql-forks-and-storage-engines/" >No</a>. Most people, however, seemed to think &#8220;Yes, and even if the matter isn&#8217;t clear, the threat of nasty lawyers creates enough FUD to be a practical market problem for the storage engine vendors.&#8221; Based on those concerns, I eventually took the position that <a href="http://www.dbms2.com/2009/09/10/what-could-or-should-make-oraclemysql-antitrust-concerns-go-away/" >Oracle should be inhibited for antitrust reasons from invoking its real or alleged GPL rights to mess with the MySQL storage engine vendors</a>. Oracle&#8217;s agreement with the EU <a href="http://www.dbms2.com/2009/12/14/oracle-mysql-storage-engine/" >alleviated that concern</a>, except that there was an annoying time limit on the alleviation.</p>
<p>Now a related can of worms has been opened in a related technology area &#8212; <strong>WordPress</strong> and <strong>WordPress themes</strong>. Since many bloggers use WordPress, this has gotten a lot of attention, and some interesting new insights have emerged. <span id="more-2604"></span><em></em></p>
<p><em>Um, in case you didn&#8217;t know: WordPress is the software that runs blogs such as this, and it&#8217;s a GPLed open source project. However, the user interface &#8212; look, feel, and behavior alike &#8212; are determined by separate</em> themes,<em> that one usually gets from third parties (WordPress ships with a a few default choices). </em></p>
<p>It started when Matt Mullenweg went after the makers of an unfree theme Thesis, and wielding <a href="http://wordpress.org/news/2009/07/themes-are-gpl-too/" onclick="javascript:pageTracker._trackPageview('/wordpress.org');">a legal opinion from the Software Freedom Law Center</a>. The gist of the SFLC&#8217;s argument seems to be</p>
<blockquote><p>They are derivative of WordPress because every part of them is determined by the content of the WordPress functions they call. As works of authorship, they are designed only to be combined with WordPress into a larger work.</p></blockquote>
<p>And of course the point of the GPL is that if you create a derivative work of something GPLed, you have to GPL it yourself.</p>
<p>However, <em>Perpetual Beta</em> pointed out that, under the rules of copyright law as expressed in a court case known as Galoob, <a href="http://perpetualbeta.com/release/2009/11/why-the-gpl-does-not-apply-to-premium-wordpress-themes/" onclick="javascript:pageTracker._trackPageview('/perpetualbeta.com');">depending on another program does not make something a derivative work</a>. This is actually blindingly obvious, as in the example of any program that runs on top of an operating system. Or for more examples see <a href="http://scripting.com/stories/2010/07/16/areWordpressThemesNecessar.html" onclick="javascript:pageTracker._trackPageview('/scripting.com');">Dave Winer</a> on the point.</p>
<p><em>Perpetual Beta</em> further argued that, even if it were a derivative work, <a href="http://perpetualbeta.com/release/2009/11/why-the-gpl-does-not-apply-to-premium-wordpress-themes/" onclick="javascript:pageTracker._trackPageview('/perpetualbeta.com');">fair use would let one copy it anyway</a>. I.e., if you&#8217;re engaging in &#8220;fair use,&#8221; you&#8217;re entitled to do what otherwise would be a copyright violation. Good point. The GPL license says in effect &#8220;You only are allowed to use this material (in certain ways) if you do as we say about your own work,&#8221; so that is defeated if the Fair Use Doctrine lets you say &#8220;Um, actually, I&#8217;m using this without your permission, so buzz off.&#8221;</p>
<p>GPL advocates can pontificate all they want about certain uses of GPLed code violating their license terms. But if either of these arguments holds up &#8212; and it looks to me like both do &#8212; <strong>a program that invokes GPLed code is not subject to the GPL just based on those invocations. </strong>And that, in turn, would more than imply that <strong>MySQL storage engine vendors could use GPLed MySQL-compatible front-ends without being under any GPL obligation themselves.<br />
</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/17/mysql-gpl-storage-engine-wordpress-theme/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Sybase SQL Anywhere</title>
		<link>http://www.dbms2.com/2010/07/17/sybase-sql-anywhere/</link>
		<comments>http://www.dbms2.com/2010/07/17/sybase-sql-anywhere/#comments</comments>
		<pubDate>Sun, 18 Jul 2010 00:13:20 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Mid-range]]></category>
		<category><![CDATA[Progress, Apama, and DataDirect]]></category>
		<category><![CDATA[Specific users]]></category>
		<category><![CDATA[Sybase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2592</guid>
		<description><![CDATA[After Powersoft acquired Watcom and its famed Fortran compiler, marketing VP Tom Herring told me that the hidden jewel of the acquisition might well be a little DBMS, Watcom SQL. To put it mildly, Tom was right. Watcom SQL became SQL Anywhere; Powersoft was acquired by Sybase; Powersoft&#8217;s and Sybase&#8217;s main products both fell on [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">After Powersoft acquired Watcom and its famed Fortran compiler, marketing VP Tom Herring told me that the hidden jewel of the acquisition might well be a little DBMS, Watcom SQL. To put it mildly, Tom was right. Watcom SQL became SQL Anywhere; Powersoft was acquired by Sybase; Powersoft&#8217;s and Sybase&#8217;s main products both fell on hard times; Sybase built a whole mobile technology division around SQL Anywhere; and <a href="http://www.dbms2.com/2010/05/13/sap-sybase-reactions/" >the whole thing just got sold for billions of dollars to SAP</a>. Chris Kleisath recently briefed me on SQL Anywhere Version 12 (released to manufacturing this month), which seemed like a fine opportunity to catch up on prior developments as well.</p>
<p style="margin-bottom: 0in;">The first two things to understand about SQL Anywhere is that there actually are three products:</p>
<ul>
<li>Sybase 	<strong>SQL Anywhere,</strong> a mid-range relational DBMS.</li>
<li>Sybase 	<strong>UltraLite,</strong> a DBMS for mobile devices.</li>
<li>Sybase 	<strong>MobiLink,</strong> a replication/sync tool.</li>
</ul>
<p style="margin-bottom: 0in;">and also that there are three main deployment/use cases:</p>
<ul>
<li><strong>Generic 	desktop or server computers.</strong> This was the original market for 	SQL Anywhere.</li>
<li><strong>Laptop/handheld 	computers.</strong> This was the original growth market for SQL Anywhere. 	In particular, Siebel Systems&#8217; first growth spurt was selling sales 	force automation software on laptop computers with SQL Anywhere 	underneath.</li>
<li><strong>Specialized 	devices.</strong> Earlier this decade, Sybase thought SQL Anywhere&#8217;s big 	growth market was on specialized devices. (I recall a video 	featuring some kind of automated pill dispensing machine for 	hospitals.)</li>
</ul>
<p style="margin-bottom: 0in;"><span id="more-2592"></span>Meanwhile, terminological weirdness includes:</p>
<ul>
<li>At 	various times the product Sybase SQL Anywhere has had “Server” 	in its name, but that seems not be part of the naming convention at 	the moment. Sybase folks still talk about it that way, however.</li>
<li>Is 	SQL Anywhere a product? A group of products? A division? Again, that 	depends on whom and when you ask.</li>
<li>When 	Sybase speaks of SQL Anywhere being in the “embedded” database 	market, it&#8217;s talking of any OEM business, not just deployment on 	particular kinds of devices. (<a href="http://www.intersystems.com/cache/analysts/idc_embed.pdf" onclick="javascript:pageTracker._trackPageview('/www.intersystems.com');">IDC</a> has long used that confusing terminology, which is a regrettably 	good excuse for vendors to use it as well.)</li>
</ul>
<p style="margin-bottom: 0in;">I get the impression that there are two big markets for the Sybase SQL Anywhere family:</p>
<ul>
<li>~1000 	OEM partners for Sybase SQL Anywhere. Notable examples include:
<ul>
<li>Intuit 	(for QuickBooks)</li>
<li>Symantec 	(for NetBackup)</li>
<li>Sybase 	IQ, if you want to view it that way (a lot of Sybase IQ&#8217;s front-end 	is actually SQL Anywhere code)</li>
</ul>
</li>
<li>Direct 	enterprise customers for mobility-oriented combinations of Sybase 	UltraLite, Sybase SQL Anywhere, and Sybase MobiLink. Notable 	examples include:
<ul>
<li>Pepsi Bottling Group (repairmen, 	using handheld devices)</li>
<li>US 	2010 Census (140,000 people address checkers – every address in 	the US was checked against GPS coordinates, it seems)</li>
</ul>
</li>
</ul>
<p style="margin-bottom: 0in;">By way of contrast, the market for true embedded systems never took off as strongly for Sybase SQL Anywhere as it did for, say, pre-acquisition solidDB. Added together, this all seems to amount to ~ 10 million users overall, as per <a href="http://www.monash.com/uploads/SQL-Anywhere-12.pdf" onclick="javascript:pageTracker._trackPageview('/www.monash.com');">a slide deck Sybase graciously gave me permission to post</a>.</p>
<p style="margin-bottom: 0in;">Other market notes include:</p>
<ul>
<li>Sybase says that the main competitors for Sybase SQL Anywhere are Microsoft SQL Server, Oracle, and occasionally MySQL, with Progress coming up only rarely. I would have thought Progress OpenEdge RDBMS would be higher on the list (simple DBMS for the OEM market) – but come to think of it, it&#8217;s been quite a while since I&#8217;ve run into anybody who says they compete much with Progress OpenEdge. Perhaps Progress just isn&#8217;t selling its core product very hard these days.</li>
<li>80-90% of Sybase SQL Anywhere installations are on Windows, but the fact that SQL Anywhere also runs on Macintosh and &#8216;nix platforms is apparently pretty important to sales.</li>
<li>The main platforms for Sybase UltraLite are Blackberry and Windows Mobile, although with iPhone support newly added in SQL Anywhere Version 12, that&#8217;s a good bet to change.</li>
</ul>
<p style="margin-bottom: 0in;">Besides the above, the key points about Sybase SQL Anywhere are:</p>
<ul>
<li>Sybase 	SQL Anywhere is designed for a no-DBA, “customers may not know 	there&#8217;s a DBMS” there kind of easy administration. (That&#8217;s one 	reason I compared it to Progress.)</li>
<li>Sybase 	SQL Anywhere has a reasonable set of mid-range DBMS features, 	including:
<ul>
<li>Triggers</li>
<li>Stored 	procedures</li>
<li>Referential 	integrity</li>
<li>Geospatial 	datatype</li>
<li>Text 	datatype (with a CONTAINS clause), and you can index external 	documents</li>
<li>Multiprocessor/multicore 	operation</li>
</ul>
</li>
</ul>
<p style="margin-bottom: 0in;">Sybase SQL Anywhere Version 12 seems to have a variety of performance improvements, of which the most interesting are:</p>
<ul>
<li>If 	you&#8217;re using an ORM (Object-Relational Mapping) layer such as 	Hibernate, the SQL Anywhere optimizer now works hard to rewrite some 	of the horrific SQL that can be generated, for example by flattening 	or eliminating a lot of subselects.
<ul>
<li>There&#8217;s 	also a “Hibernate dialect” of SQL Anywhere – I&#8217;m not sure what 	that means, but it clearly indicates that SQL Anywhere wants to be 	Hibernate-friendly.</li>
</ul>
</li>
<li>When 	the SQL Anywhere optimizer decides the statistics are wrong, it 	reruns them, piggybacking on a subsequent query if it can, doing a 	background job if it must. Somewhat pretentiously, Sybase refers to 	this as “self-healing statistics”.</li>
</ul>
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">Many SQL Anywhere features are of course missing in Sybase UltraLite, but geospatial datatypes are present.</p>
<p style="margin-bottom: 0in;">Finally, it&#8217;s a natural conjecture that MobiLink is related to Sybase Replication Server – but that conjecture turns out to be wrong. That said, <a href="http://www.sybaseteam.com/what-difference-between-replication-server-sql-remote-t-728.html" onclick="javascript:pageTracker._trackPageview('/www.sybaseteam.com');">keeping Sybase&#8217;s various replication offerings straight is tricky</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/17/sybase-sql-anywhere/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More on Greenplum and EMC</title>
		<link>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/</link>
		<comments>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 19:15:21 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Market share]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2534</guid>
		<description><![CDATA[I talked with Ben Werther of Greenplum for about 40 minutes, which was my first post-merger Greenplum/EMC briefing. &#8220;Historical&#8221; highlights include:

Ben says Greenplum wasn&#8217;t being shopped, by which he means Greenplum was out raising more capital and the fund-raising was going well.  Note: Half or so of Greenplum&#8217;s deals were subscription-priced, so it had weaker [...]]]></description>
			<content:encoded><![CDATA[<p>I talked with Ben Werther of Greenplum for about 40 minutes, which was my first post-merger Greenplum/EMC briefing. &#8220;Historical&#8221; highlights include:</p>
<ul>
<li>Ben says Greenplum wasn&#8217;t being shopped, by which he means Greenplum was out raising more capital and the fund-raising was going well.  <em>Note: <a href="http://www.dbms2.com/2009/10/18/greenplum-customer-notes/" >Half or so of Greenplum&#8217;s deals were subscription-priced</a>, so it had weaker cash flow than it would have if it were doing equally well selling perpetual licenses.</em></li>
<li>However, joint engineering was also going well with, e.g., Greenplum CTO Luke Lonergan spending time at EMC facilities in Cork, Ireland. And one thing led to another &#8230;</li>
<li>Greenplum has ~ 140 customers, vs. <a href="http://www.dbms2.com/2009/06/05/greenplum-update-release-3-3/" >~65 five quarters ago</a>, 100+ at year-end, and an acquisition rate of 12-15/quarter last fall.</li>
<li>A typical &#8220;small&#8221; paying customer for Greenplum starts with 10-20 TB of data.</li>
<li><a href="http://www.dbms2.com/2010/04/12/greenplumchorus/" >Greenplum Chorus</a> isn&#8217;t generally available yet, with rollout energy being focused on Greenplum 4.0. <em>Note: As important as it is for overall industry direction, Greenplum Chorus is a product which won&#8217;t be a terribly big deal in Release 1 anyway.</em></li>
</ul>
<p>Highlights looking forward include:  <span id="more-2534"></span></p>
<ul>
<li>When I challenged him, Ben sounded quite optimistic that Pat Gelsinger will immunize Greenplum against and generally counteract some of EMC&#8217;s traditionally stifling bureaucracy. (My words, of course, not his.)</li>
<li>The initial Greenplum/EMC product vision appears truly centered around &#8220;private cloud,&#8221; specifically including Greenplum, VMware, and EMC storage arrays.</li>
<li>Some other areas of potential Greenplum/EMC technical synergy I think are cool obviously haven&#8217;t been seriously addressed yet.</li>
<li>Based on what I heard from Ben about the aura around the deal and also on what I know of the individual executives at Greenplum, I think each of them is a good bet to stick around EMC for a while. (That&#8217;s on average. Of course, it would be surprising if 100% of them stayed around very long.) Basically, there&#8217;s at least a chance EMC/Greenplum will do some pretty cool stuff, and most of the guys will probably stick around to see if that actually starts to happen.*</li>
</ul>
<p><em>*Also, when they do eventually leave, they&#8217;ll surely say things to the effect &#8220;The cool stuff is well underway; my work here is done.&#8221; That party line is almost guaranteed, no matter how things unfold in reality. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/07/more-on-greenplum-and-emc/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Why analytic DBMS increasingly need to be storage-aware</title>
		<link>http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/</link>
		<comments>http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 06:30:14 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2515</guid>
		<description><![CDATA[In my quick reactions to the EMC/Greenplum announcement, I opined

I think that even software-only analytic DBMS vendors should design their systems in an increasingly storage-aware manner

promising to explain what I meant later on. So here goes.  
There always have been good technical reasons to tailor hardware to analytic database software. Data moves through disk controller, [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;"><span style="font-style: normal;">In <a href="../2010/07/06/emc-is-buying-greenplum/">my quick reactions to the EMC/Greenplum announcement</a>, I opined</span></p>
<blockquote>
<p style="margin-bottom: 0in;">I think that even software-only analytic DBMS vendors should design their systems in an increasingly storage-aware manner</p>
</blockquote>
<p style="margin-bottom: 0in;">promising to explain what I meant later on. So here goes.  <span id="more-2515"></span></p>
<p style="margin-bottom: 0in;">There always have been good technical reasons to tailor hardware to analytic database software. Data moves through disk controller, network, RAM, CPU and more, each with its own data rate. Getting different kinds of parts into the right bal<span style="font-style: normal;">ance doesn&#8217;t completely eliminate bottlenecks – the <a href="../2010/07/06/the-one-hoss-shay/">Wonderful One-Hoss Shay</a> is poetic fiction </span>– but it certainly can help. As a result, every analytic DBMS vendor of any size offers at least one of:</p>
<ul>
<li><a href="../2007/01/27/data-warehouse-appliance-hardware-strategies/">A 	Type 0 appliance</a></li>
<li>A Type 1 appliance</li>
<li>A “recommended hardware 	configuration”</li>
</ul>
<p style="margin-bottom: 0in;">And beyond performance, appliances and pre-specified hardware configurations offer at least the possibility of easing installation, administration, and support.</p>
<p style="margin-bottom: 0in;">There also are marketing reasons to offer an appliance or something appliance-like.</p>
<ul>
<li>To various extents, Oracle, 	Teradata, Microsoft, IBM, Netezza, and EMC are all telling the world 	that your hardware should be optimized for your analytic DBMS.</li>
<li>Smaller vendors such as Vertica 	and Aster Data also tend to cobble together some sort of appliance, 	in part so they don&#8217;t have to say they disagree.</li>
<li>Thus, a “We don&#8217;t see any point 	in special hardware assembly at all” story would leave an analytic 	DBMS vendor pretty far out on a limb.</li>
</ul>
<p style="margin-bottom: 0in;">Finally, there are three overlapping technical trends that increase the need for storage-awareness in analytic DBMS. First and foremost is the rise of <strong>solid-state memory.</strong> For starters, I believe:</p>
<ul>
<li><a href="../2010/06/25/flash-is-coming-well/">Flash 	will be important for analytic DBMS soon</a>.</li>
<li>There are good technical reasons 	for this.</li>
<li><a href="../2010/01/22/oracle-database-hardware-strategy/">Oracle&#8217;s 	marketing will make a big deal out of the Flash aspects of Exadata</a>, 	so other analytic DBMS vendors will need a response. And of course, 	if Netezza or Teradata preemptively make a big deal of their 	Flash-based offerings, that just adds to the pressure for Flash 	adoption on everybody else.</li>
<li>But it&#8217;s not just Flash – <a href="../2010/01/31/flash-pcmsolid-state-memory-disk/">Flash, 	other solid-state memory, and disk</a> will be combined in various 	ways.</li>
</ul>
<p style="margin-bottom: 0in;">But this move to Flash will require analytic DBMS vendors to be increasingly storage-aware for at least three reasons:</p>
<ul>
<li>It just adds another level of 	<strong>complexity</strong> to their hardware-balancing challenges.</li>
<li><strong>Flash overturns some of the 	fundamental assumptions of modern analytic DBMS,</strong> in particular:
<ul>
<li><a href="../2006/09/19/is-data-warehousing-now-all-about-sequential-access/">Sequential 	reads are hugely better than random</a></li>
<li>The worst bottleneck is at the 	point where data comes out of storage.</li>
</ul>
</li>
<li><strong>The Flash technology stack is 	still immature,</strong> and you have to pick your poison in how to deal 	with it. Vendors are making very different choices in this regard – 	and they do have to choose.</li>
</ul>
<p style="margin-bottom: 0in;">Another trend that could naturally lead analytic DBMS vendors to be more storage-aware is their incorporation of what could be viewed as hierarchical storage/ILM technologies.  Different data is stored in different ways and/or on different kinds of storage hardware. (Vendors pursuing – you guessed it – different approache<span style="font-style: normal;">s to this include <a href="../2009/08/04/2008/10/14/teradata-virtual-storage/">Teradata</a>, <a href="../2009/10/14/greenplum-hybrid-columnar/">Greenplum</a>, <a href="../2009/08/04/flexstore-and-the-rest-of-vertica-35/">Vertica</a>, and <a href="../2009/08/25/sybase-iq-technical-highlights/">Sybase</a>.) The m</span>ore automatic that process is, the more storage-aware the DBMS will need to be.</p>
<p style="margin-bottom: 0in;">Finally, there are reasons to th<span style="font-style: normal;">ink that <a href="../2008/09/06/sans-vs-das-in-mpp-data-warehousing/">DBMS should be split between conventional servers and smart storage</a>. This is, of course, the E</span>xadata strategy. <a href="../2010/06/21/netezza-silicon-balance/">Netezza&#8217;s two-processor approach</a>, while rather different, also somewhat validates the idea.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/07/07/analytic-database-storage-aware/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>
