<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS2 -- DataBase Management System Services &#187; Memory-centric data management</title>
	<atom:link href="http://www.dbms2.com/category/memory-centric-data-management/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Fri, 19 Mar 2010 15:49:58 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Memcached-based company NorthScale launches</title>
		<link>http://www.dbms2.com/2010/03/16/memcached-northscale-launc/</link>
		<comments>http://www.dbms2.com/2010/03/16/memcached-northscale-launc/#comments</comments>
		<pubDate>Tue, 16 Mar 2010 17:52:48 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cache]]></category>
		<category><![CDATA[Clustering]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Parallelization]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1717</guid>
		<description><![CDATA[NorthScale, a start-up based around memcached, has just launched, two weeks after the Todd Hoff&#8217;s post arguing the MySQL/memcached combo is passe&#8217;. NorthScale wouldn&#8217;t necessarily argue with Todd, arguing that what you really should use instead is NorthScale&#8217;s combo of memcached and MemBase, a memcached-like DBMS &#8230;
&#8230; or something like that. I don&#8217;t intend to [...]]]></description>
			<content:encoded><![CDATA[<p>NorthScale, a start-up based around memcached, has just launched, two weeks after the Todd Hoff&#8217;s post arguing <a href="http://www.dbms2.com/2010/03/02/cassandra-nosql-scalable-oltp/" >the MySQL/memcached combo is passe&#8217;</a>. NorthScale wouldn&#8217;t necessarily argue with Todd, arguing that what you really should use instead is NorthScale&#8217;s combo of memcached and MemBase, a memcached-like DBMS &#8230;</p>
<p>&#8230; or something like that. I don&#8217;t intend to write seriously about NorthScale until I have a better idea of what MemBase is.</p>
<p>In the mean time,</p>
<ul>
<li>VentureBeat put up a solid post on <a href="http://deals.venturebeat.com/2010/03/16/northscale-zynga-memcached/" onclick="javascript:pageTracker._trackPageview('/deals.venturebeat.com');">NorthScale&#8217;s company history</a> and so on</li>
<li>Om Malik bought into <a href="http://gigaom.com/2010/03/16/northscale/" onclick="javascript:pageTracker._trackPageview('/gigaom.com');">the NorthScale memcached pitch</a></li>
<li>TechCrunch has <a href="http://techcrunch.com/2010/03/16/northscales-data-management-technology-attracts-zynga-and-others/" onclick="javascript:pageTracker._trackPageview('/techcrunch.com');">a low-quality post about NorthScale</a> (although it wasn&#8217;t as error-riddled as the same author&#8217;s post about nStein, which<a href="http://intelligent-enterprise.informationweek.com/blog/archives/2010/02/open_text_buyin.html;jsessionid=T51GQFI1CCPL1QE1GHOSKHWATMY32JVN" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');"> Seth Grimes properly blasted</a>)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/03/16/memcached-northscale-launc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Notes on Sybase Adaptive Server Enterprise</title>
		<link>http://www.dbms2.com/2010/02/25/sybase-adaptive-server-enterprise-as/</link>
		<comments>http://www.dbms2.com/2010/02/25/sybase-adaptive-server-enterprise-as/#comments</comments>
		<pubDate>Thu, 25 Feb 2010 13:10:48 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cache]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Sybase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1646</guid>
		<description><![CDATA[It had been a very long time since I was remotely up to speed on Sybase&#8217;s main OLTP DBMS, Adaptive Server Enterprise (ASE).  Raj Rathee, however, was kind enough to fill me in a few days ago. Highlights of our chat included:

One of the most confusing things about Sybase ASE is its version numbering. In [...]]]></description>
			<content:encoded><![CDATA[<p>It had been a very long time since I was remotely up to speed on Sybase&#8217;s main OLTP DBMS, Adaptive Server Enterprise (ASE).  Raj Rathee, however, was kind enough to fill me in a few days ago. Highlights of our chat included:<span id="more-1646"></span></p>
<ul>
<li>One of the most confusing things about Sybase ASE is its version numbering. In particular,
<ul>
<li>Sybase ASE 15.5 went GA in December, 2009. (But the clustered version is just coming out in March.)</li>
<li>The prior version of Sybase ASE was 15.03.</li>
<li>Sybase ASE 15.0 came out in September, 2005.</li>
<li>The version of Sybase ASE before that was 12.5.</li>
<li>And by the way, Sybase System 10 came out in 1994 or so.</li>
</ul>
</li>
<li><strong>Sybase ASE 15.0 was a major rewrite.</strong> In particular, Sybase ASE 15.0 had a “brand new” optimizer and query processing engine, based on the <strong>Volcano</strong> model. The main driver of the rewrite was to make Sybase ASE suitable for mixing OLTP and some level of decision-support workloads. (Not on the order of what Sybase IQ can handle, but at least operational reporting and so on.)</li>
<li>I haven&#8217;t looked up Volcano in more detail than to confirm that what I thought Raj said made sense, but as he characterized it, it&#8217;s a lot more modular than what Sybase had in ASE 12.5. For example, substantially the only join algorithm in Sybase ASE 12.5 was nested loop – no hash or sort/merge.</li>
<li>As you might imagine, a lot of things one might regard as core modern DBMS features were only added to Sybase ASE once 15.0 came out. Examples include:
<ul>
<li>Various forms of partitioning at the storage level.</li>
<li>User-defined functions (UDFs).</li>
<li>A clustering offering that competes with Oracle RAC. (100 or so customers are on that so far.) Absent clustering, Sybase ASE is limited to a single SMP (Symmetric Multiprocessing) box.</li>
<li>Shared disk. Amazingly, it seems that before 2008, every node in an SMP box running Sybase ASE had its own private partition (maybe not the right word) of data.</li>
</ul>
</li>
<li>In Sybase ASE, you have lots of databases managed by one database server. You can write SQL statements that span multiple databases, but they have to reference database names as well as table names.</li>
<li>There are several ways to get data from one place to another in Sybase&#8217;s technology and nomenclature, specifically including. Replication Server, Incremental Data Transfer, and “proxy tables.” (Other than the fact that Replication Server is a separate, chargeable product, I don&#8217;t really have these straight.) In addition, there&#8217;s a hand-coded one in <a href="http://www.dbms2.com/2010/02/05/sybase-aleri-rap/" >Sybase RAP</a>, which will get a planned 5-6X performance improvement later this year when it is replaced by Incremental Data Transfer.</li>
</ul>
<p>And in what basically sounds like a very cool approach, Sybase ASE has a lot of <strong>memory-centric</strong> aspects. That said, Sybase&#8217;s in-memory ASE story is still incomplete (wait until the next release) and confused (I think in part because of what&#8217;s missing in the current release).  Also, this is one area where the non-technical nature of the briefing got in my way. So here&#8217;s some of what I do and don&#8217;t know about Sybase&#8217;s memory-centric ASE strategy:</p>
<ul>
<li>Sybase lets you mix and match on-disk and in-memory databases under one instance of Sybase ASE. To a programmer, it all looks like ASE.</li>
<li>I don&#8217;t know exactly what the limitations are on what you can do with in-memory databases, how you can use them in tandem with on-disk databases, etc.</li>
<li>You can replicate data from disk to an in-memory Sybase ASE database today. (Hello caching, ala Oracle Times Ten or IBM DB2/solidDB.)</li>
<li>Replicating from memory to disk is a near-term future capability. (So Sybase does not yet have a hybrid memory-centric story ala <a href="http://www.dbms2.com/2007/06/22/in-memory-database-solid/" >solidDB Classic</a>.)</li>
<li>I have no clue as to what kinds of in-memory data structures Sybase ASE uses.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/25/sybase-adaptive-server-enterprise-as/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Quick thoughts on the StreamBase Component Exchange</title>
		<link>http://www.dbms2.com/2010/02/16/quick-thoughts-on-the-streambase-component-exchange/</link>
		<comments>http://www.dbms2.com/2010/02/16/quick-thoughts-on-the-streambase-component-exchange/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 15:04:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Games and virtual worlds]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1591</guid>
		<description><![CDATA[Streambase is announcing something called the StreamBase Component Exchange, for developers to exchange components to be used with the StreamBase engine, presumably on an open source basis. I simultaneously think:

This is a good idea, and many software vendors should do it if they aren&#8217;t already.
It&#8217;s no big deal.

For reasons why, let me quote an email [...]]]></description>
			<content:encoded><![CDATA[<p>Streambase is announcing something called the <a href="http://streambase.com/b6409b0d-7d1f-4cf8-99b9-98b2b1858628/press-release-detail.htm" onclick="javascript:pageTracker._trackPageview('/streambase.com');">StreamBase Component Exchange</a>, for developers to exchange components to be used with the StreamBase engine, presumably on an open source basis. I simultaneously think:</p>
<ul>
<li>This is a good idea, and many software vendors should do it if they aren&#8217;t already.</li>
<li>It&#8217;s no big deal.</li>
</ul>
<p>For reasons why, let me quote an email I just sent to an inquiring reporter:</p>
<ul>
<li>StreamBase sells mainly to the financial services and intelligence community markets. Neither group will share much in the way of core algorithms.</li>
<li>But both groups are <a href="http://www.dbms2.com/2009/01/27/introduction-to-pentaho/" >pretty interested in open source software</a> even so. (I think for both the price and customizability benefits.)</li>
<li>Open source software commonly gets community contributions for connectors, adapters, and (national) language translations.</li>
<li>But useful contributions in other areas are much rarer.</li>
<li>Linden Labs is one of StreamBase&#8217;s <a href="http://www.dbms2.com/2009/03/09/independent-cep-vendors-continue-to-flounder/" >few significant customers outside its two core markets</a>.</li>
<li>All of the above are consistent with the press release (which quotes only one StreamBase customer &#8212; guess who?).</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/16/quick-thoughts-on-the-streambase-component-exchange/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>The Sybase Aleri RAP</title>
		<link>http://www.dbms2.com/2010/02/05/sybase-aleri-rap/</link>
		<comments>http://www.dbms2.com/2010/02/05/sybase-aleri-rap/#comments</comments>
		<pubDate>Sat, 06 Feb 2010 00:05:11 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Aleri and Coral8]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Sybase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1545</guid>
		<description><![CDATA[Well, I got a quick Sybase/Aleri briefing, along with multiple apologies for not being prebriefed. (Main excuse: News was getting out, which accelerated the announcement.) Nothing badly contradicted my prior post on the Sybase/Aleri deal.
To understand Sybase&#8217;s plans for Aleri and CEP, it helps to understand Sybase&#8217;s current CEP-oriented offering, Sybase RAP. So far as [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Well, I got a quick Sybase/Aleri briefing, along with multiple apologies for not being prebriefed. <em>(Main excuse: News was getting out, which accelerated the announcement.)</em> Nothing badly contradicted my prior post on <a href="http://www.dbms2.com/2010/02/04/sybase-aleri-acquisitio/" >the Sybase/Aleri deal</a>.</p>
<p style="margin-bottom: 0in;">To understand Sybase&#8217;s plans for Aleri and CEP, it helps to understand Sybase&#8217;s current CEP-oriented offering, <strong>Sybase RAP.</strong> So far as I ca<span style="font-weight: normal;">n tell, Sybase RAP has to date only been sold in the form of</span><strong> Sybase RAP: The Trading Edition.</strong> In that guise, Sybase RAP has been sold to &gt;40 outfits since its May, 2008 launch, mainly big names in the investment banking and stock exchange sectors. If I understood correctly, the next target market for Sybase RAP is telcos, for real-time network tuning and management.</p>
<p style="margin-bottom: 0in;">In addition to any domain-specific applications, Sybase RAP has three layers:</p>
<ul>
<li><strong>CEP (Complex Event Processing).</strong> Sybase RAP CEP is based on a version of the Coral8 engine Sybase 	licensed and has been subsequently developing.</li>
<li><strong>In-memory DBMS.</strong> Sybase&#8217;s 	IMDB is part of (but I guess separable from) and has the same API as 	Sybase&#8217;s OLTP DBMS Adaptive Server Enterprise (ASE, aka Sybase 	Classic).</li>
<li><strong>Sybase IQ.</strong> Actually, Sybase 	used the phrase “based on Sybase IQ,” but I&#8217;m guessing it&#8217;s just 	Sybase IQ.</li>
</ul>
<p style="margin-bottom: 0in;"><span id="more-1545"></span>In theory, there could be a DBMS other than Sybase IQ, such as Sybase ASE or even Oracle, because Sybase IMDB can talk to a variety of DBMS. I didn&#8217;t get the impression, however, that in practice there were any Sybase RAP installations whose persistent DBMS was anything other than Sybase IQ.</p>
<p style="margin-bottom: 0in;">Aleri had all along had something called Project Ohio, to merge Coral8 with Aleri Classic.  Now Sybase&#8217;s own CEP engineering team is being added to the mix, schedules are being reconsidered and haven&#8217;t been disclosed yet. <em>(If one woman can produce one baby in nine months, how long does it take nine women to produce a baby?) </em>Apparently Sybase has a dozen programmers in the CEP area, plus ~20 more on Sybase RAP, not counting QA, documentation, etc.; that represents a significant bump to the overall Aleri development team.</p>
<p style="margin-bottom: 0in;">Sybase doesn&#8217;t seem to have decided what to do yet with the various <a href="../2008/10/20/coral8-proposes-cep-as-a-bi-data-platform/">business intelligence</a>/real-time OLAP engine products and technologies it is inheriting from Aleri.</p>
<p style="margin-bottom: 0in;">And finally, some metrics:</p>
<ul>
<li>The Sybase/Aleri guys estimate 	that 1/3 of of Aleri&#8217;s customers and even less of its revenue came 	from outside the financial services sector. They did say the 	non-financial-services business was “starting to pick up,” but 	not very convincingly.</li>
<li>Sybase IQ is now up to &gt;1800 	customers, with &gt;200 new ones in 2009.</li>
<li>Sybase IQ indeed has users taking 	in market feeds up to 3 terabytes a day, so it probably  matches 	Vertica in having at least several-hundred-terabyte databases in the 	financial sector.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/05/sybase-aleri-rap/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Quick thoughts on Sybase/Aleri</title>
		<link>http://www.dbms2.com/2010/02/04/sybase-aleri-acquisitio/</link>
		<comments>http://www.dbms2.com/2010/02/04/sybase-aleri-acquisitio/#comments</comments>
		<pubDate>Thu, 04 Feb 2010 16:15:19 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Aleri and Coral8]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Sybase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1530</guid>
		<description><![CDATA[Sybase announced an asset purchase that amounts to a takeover of CEP (Complex Event Processing) Aleri. Perhaps not coincidentally, Sybase already had technology under the hood from Aleri predecessor/acquiree Coral8, for financial services uses (notwithstanding that between Aleri Classic and Coral8, Aleri Classic was the one of the two more focused on financial services). Quick [...]]]></description>
			<content:encoded><![CDATA[<p>Sybase announced an asset purchase that amounts to a takeover of CEP (Complex Event Processing) <a href="http://www.dbms2.com/2009/03/25/aleri-update/" >Aleri</a>. Perhaps not coincidentally, <a href="http://magmasystems.blogspot.com/2009/03/sybase-and-coral8.html" onclick="javascript:pageTracker._trackPageview('/magmasystems.blogspot.com');">Sybase already had technology under the hood from Aleri predecessor/acquiree Coral8</a>, for financial services uses (notwithstanding that between Aleri Classic and Coral8, Aleri Classic was the one of the two more focused on financial services). Quick reactions include:</p>
<ul>
<li>The folks at Sybase still haven&#8217;t figured out when to prebrief me. <em>(Edit: I&#8217;ve been <a href="http://www.dbms2.com/2010/02/05/sybase-aleri-rap/" >briefed</a> subsequently.)</em></li>
<li>Sybase/Aleri is a potentially powerful combination, if they can effectively address the point I just made about <a href="http://www.dbms2.com/2010/02/01/open-issues-in-database-and-analytic-technology/" >integrating disparate latencies</a>. That said, I&#8217;m not expecting a lot, because <a href="http://www.dbms2.com/2009/03/09/independent-cep-vendors-continue-to-flounder/" >the CEP industry always disappoints me</a>.</li>
<li><a href="http://www.dbms2.com/2009/05/13/microsoft-announced-cep-this-week-too/" >Microsoft</a>, <a href="http://www.dbms2.com/2009/05/13/ibm-system-s-infosphere-streams-processing/" >IBM</a>, and (somewhat less clearly) <a href="http://www.dbms2.com/2008/01/16/oracle-bea/" >Oracle</a> are all trying to do CEP inhouse. Sybase is making a good choice in having serious CEP inhouse itself</li>
<li>Surely the main focus and financial justification for the Sybase/Aleri acquisition is the financial services market.</li>
<li>Specifically, I expect the focus of technical integration between Aleri and Sybase&#8217;s DBMS products to start with Sybase IQ.</li>
<li>Coral8 had <a href="http://www.dbms2.com/2008/10/20/coral8-proposes-cep-as-a-bi-data-platform/" >some interesting ideas about how to integrate CEP with OLTP/operational BI</a>, but I&#8217;m not aware that they got much traction.</li>
<li>I bet there are use cases where Sybase tries and fails to sell <span style="text-decoration: line-through;">Adaptive Server</span> SQL Anywhere that CEP would be a better technical fit, but I don&#8217;t immediately see much practical business significance to that observation.</li>
<li>While this deal could easily strengthen the Vertica/StreamBase partnership, I don&#8217;t see any reason why it would lead those two companies to actually merge.</li>
</ul>
<p><em><strong>Related link</strong></em></p>
<ul>
<li><a href="http://www.dbms2.com/2009/09/10/analytic-speed-latency/" >Thinking about analytic latency</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/04/sybase-aleri-acquisitio/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Interesting trends in database and analytic technology</title>
		<link>http://www.dbms2.com/2010/01/31/trends-database-aanalytic-technology/</link>
		<comments>http://www.dbms2.com/2010/01/31/trends-database-aanalytic-technology/#comments</comments>
		<pubDate>Mon, 01 Feb 2010 02:11:17 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Data models and architecture]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[Storage]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1492</guid>
		<description><![CDATA[My project for the day is blogging based on my “Database and analytic technology: State of the union” talk of a few days ago. (I called it that because of when it was given, because it mixed prescriptive and descriptive elements, and because I wanted to call attention to the fact that I cover the [...]]]></description>
			<content:encoded><![CDATA[<p>My project for the day is blogging based on my “<a href="http://www.dbms2.com/2009/11/25/new-england-database-summit-january-28-2010/" >Database and analytic technology: </a><a href="http://www.dbms2.com/2009/11/25/new-england-database-summit-january-28-2010/" >State of the union</a>” talk of a few days ago. (I called it that because of when it was given, because it mixed prescriptive and descriptive elements, and because I wanted to call attention to the fact that I cover the <em>union</em> of database and analytic technologies – the <em>intersection</em> of those two sectors is an area of particular focus, but is far from the whole of my coverage.)</p>
<p>One section covered recent/ongoing/near-future trends that I thought were particularly interesting, including:<span id="more-1492"></span></p>
<p><strong>Simpler database technology,</strong> by which I mean DBMS that are:</p>
<ul>
<li>Easier 	to administer than market-leading systems &#8230;</li>
<li>… even if at the cost of being special-purpose</li>
<li>E.g.,
<ul>
<li>MySQL and older mid-tier RDBMS such as Progress</li>
<li>Many analytic DBMS and appliances, most notably Netezza&#8217;s</li>
</ul>
</li>
</ul>
<p>For general purpose or OLTP uses, I&#8217;m not a big fan of MySQL (not enough progress in making it industrial-strength), PostgreSQL (no good company behind it – I&#8217;m a non-fan of EnterpriseDB), or Ingres (open source or not, it&#8217;s an antiquated system that hasn&#8217;t been invested in as much as Oracle, DB2 or SQL Server).</p>
<p>But I get the impression there are a lot of contenders among small startups, featuring very new architectures for OLTP or general-purpose database management. VoltDB comes to mind. NimbusDB is finally within range of getting funded. Dan Weinreb told me Friday he knows of a bunch of others as well. And that&#8217;s all before we even get into the <a href="http://www.dbms2.com/2009/12/12/legit-nosql-key-value-store/" >NoSQL</a> kind of alternative.</p>
<p><strong>Flexible storage architectures.</strong> That&#8217;s starting out with an emphasis on hybrid columnar, as in the examples of <a href="http://www.dbms2.com/2009/08/04/pax-analytica-row-and-column-stores-begin-to-come-together/" >Vertica</a> and <a href="http://www.dbms2.com/2009/10/14/greenplum-hybrid-columnar/" >Greenplum</a>. Oracle (to whom I&#8217;m under no NDA obligation) and other vendors (to whom I am) are going that way as well.</p>
<p><strong>Multi-tier database architectures,</strong> by which I mean at least two things:</p>
<ul>
<li>The database tier/server tier split of Exadata</li>
<li>Hybrid RAM/disk architectures, examples of which include
<ul>
<li>Vertica&#8217;s RAM-based write-optimized store</li>
<li><a href="http://www.dbms2.com/2009/10/18/introduction-to-sensage/" >Sensage&#8217;s CEP-in-the-DBMS</a></li>
<li>This in-memory analytics stuff we keep hearing about from the BI vendors</li>
<li>Any true in-memory/disk hybrid, such as the regrettably sidelined <a href="http://www.dbms2.com/2007/12/21/ibm-acquires-soliddb/" >solidDB</a></li>
<li>Smart thinking by numerous DBMS vendors about optimizing the use of RAM and/or Level 2 cache</li>
</ul>
</li>
</ul>
<p>Netezza is particularly interesting to watch in this regard because it:</p>
<ul>
<li>Had a pretty strict storage/other processing split in prior product generations and &#8230;</li>
<li>… <a href="http://www.dbms2.com/2009/07/30/netezza-new-product-family/" >ditched that in its latest generation</a> …</li>
<li>… which however is focused on optimizing the use of RAM cache</li>
</ul>
<p>Also noteworthy is Petascan, the stealth-mode –and therefore harder to watch right now <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  – company I keep teasing about, which makes a strong case for carrying the database/storage tier split into the flash/solid-state memory technology generation. <a href="../2009/04/20/calpont-update-you-read-it-here-first/">Calpont</a> also has a server/storage tier split, but that&#8217;s of mainly theoretical interest unless and until Calpont actually ships an MPP version of <a href="../2009/11/07/calponts-infinidb/">InfiniDB</a>.</p>
<p><strong>Cheaper parts,</strong> which have of course been a huge trend for decades.<a href="../2010/01/31/flash-pcmsolid-state-memory-disk/"> Solid-state memory</a> will soon conquer the world. Meanwhile, cheaper sensors drive that <a href="../2010/01/17/three-broad-categories-of-data/">machine-generated data</a> I keep talking about.</p>
<p>An ever-better understanding of <strong>scale-out technology,</strong> in several respects, including:</p>
<ul>
<li>Query, notably data movement for MPP DBMS</li>
<li>Update, especially minimalistic DBMS approaches, be they sharded MySQL or more NoSQLish</li>
<li>Number-crunching, especially via MapReduce and/or parallel analytic libraries integrated into DBMS</li>
</ul>
<p>Cool trends I touched on more briefly include:</p>
<ul>
<li>More data being available for analysis. This was a core theme of my <a href="http://www.dbms2.com/2009/07/30/netezza-enzee-universe/" >Enzee Universe keynote speeches</a>; there are also some notes on it in my 	post based on my <a href="http://www.dbms2.com/2009/11/23/boston-big-data-summit-keynote-outline/" >Boston Big Data Summit</a> talk.</li>
<li>More users being served by analytics. Ditto.</li>
<li>Data exploration/visualization, ala QlikView, Spotfire, or Tableau, and also the faceted stuff.</li>
<li>The democratization of data mining. But I&#8217;m not as sure of that one as of the others&#8230;</li>
</ul>
<p>One area I flat-out forgot to mention is <a href="http://www.dbms2.com/2009/06/08/the-future-of-data-marts/" >easy data mart spin-out</a>.</p>
<p><em><strong>Other posts based on my January, 2010 New England Database Summit keynote address</strong></em></p>
<ul>
<li><a title="Data-based snooping — a huge threat to liberty that we’re all helping make worse" href="../2010/01/31/data-based-snooping-threat-libert/">Data-based snooping — a huge threat to liberty that we’re all helping make worse</a></li>
<li><a title="Flash, other solid-state memory, and disk" href="../2010/01/31/flash-pcmsolid-state-memory-disk/">Flash, other solid-state memory, and disk</a></li>
<li><a title="Open issues in database and analytic technology" href="../2010/02/01/open-issues-in-database-and-analytic-technology/">Open issues in database and analytic technology</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/01/31/trends-database-aanalytic-technology/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Two cornerstones of Oracle’s database hardware strategy</title>
		<link>http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/</link>
		<comments>http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/#comments</comments>
		<pubDate>Fri, 22 Jan 2010 08:59:23 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cache]]></category>
		<category><![CDATA[DBMS product categories]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EAI, EII, ETL, ELT, ETLT]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1429</guid>
		<description><![CDATA[After several months of careful optimization, Oracle managed to pick the most inconvenient* day possible for me to get an Exadata update from Juan Loaiza. But the call itself was long and fascinating, with the two main takeaways being:

Oracle      thinks flash memory is the most important hardware technology of the [...]]]></description>
			<content:encoded><![CDATA[<p>After several months of careful optimization, Oracle managed to pick the most inconvenient* day possible for me to get an Exadata update from Juan Loaiza. But the call itself was long and fascinating, with the two main takeaways being:</p>
<ul>
<li>Oracle      thinks <strong>flash memory is the most important hardware technology of the      decade,</strong> one that could lead to Oracle being “bumped off” if they don’t      get it right.</li>
<li>Juan      believes <strong>the “bulk” of Oracle’s business will move over to Exadata-like      technology over the next 5-10 years. </strong>Numbers-wise, this seems to be based more      on Exadata being a platform for consolidating an enterprise’s many Oracle databases than it is on Exadata running a few Especially Big Honking Database      management tasks.</li>
</ul>
<p>And by the way, Oracle doesn’t make its storage-tier software available to run on anything than Oracle-designed boxes.  At the moment, that means Exadata Versions 1 and 2. Since Exadata is by far Oracle’s best DBMS offering (at least in theory), that means <strong>Oracle’s best database offering only runs on specific Oracle-sold hardware platforms.<span id="more-1429"></span></strong> <em></em></p>
<p><em>*E.g., I was sitting upstairs in my parents’ apartment in </em><em>Columbus</em><em>, </em><em>OH</em><em> having the call while their doctor, who I’ve never met, was visiting downstairs. He offered to make a special trip back Saturday afternoon because he missed me Wednesday, but he’s notorious for not coming when he says he will.</em> <em>Update: He didn&#8217;t come Saturday. On Saturday he said he&#8217;d come Sunday. He didn&#8217;t do that either. </em></p>
<p>Other high- and lowlights of our conversation included:</p>
<ul>
<li>Flash      is the main new hardware element in Exadata Version 2. Otherwise, Exadata      2 is just an annual refresh of Exadata Version 1 to include updated      components (Nehalem chips, bigger disk drives, etc.)</li>
<li>Juan      thinks it’s suboptimal to use flash memory through the bottleneck of disk      controllers, favoring PCIe cards instead. (I emphatically agree.)</li>
<li>Juan      resolutely ducked questions about <a href="../../../../../2009/09/25/the-hunt-for-oracle-exadata-production-references/">actual      Exadata production deployment</a>. Literally the only fact he shared in      that regard is that there are at least 2 Exadata production systems      running that each have 2 or more racks cabled together.</li>
<li>Juan      stressed that Exadata runs apps written over Oracle DBMS unchanged.</li>
<li>When      making mixed-workload claims for Exadata 2, Juan stressed consolidation of      multiple databases, some OLTP and some analytic. He didn’t really argue      with my skepticism about <a href="../../../../../2009/09/29/integration-oltp-data-warehousing-exadata-2/">integrating      OLTP and analytics in the same database</a>, with one exception:</li>
<li>Juan      pointed out that in major OLTP apps such as ERP systems, there often is      actually more processing going on in reporting and other batch stuff than      there is in true OLTP.</li>
<li>Exadata      2’s flash memory is designed as a disk cache, smarter than LRU (Least      Recently Used). The two examples Juan gave of “smarter than LRU” are that      backups and table scans don’t flush the cache.</li>
<li>I      forget whether this is new in Exadata 2 (I think it is), but anyhow –      Exadata has a “Storage Index” that’s a lot like a <a href="../../../../../2006/09/20/netezza-vs-conventional-data-warehousing-rdbms/">Netezza      zone map</a>. I.e., for each megabyte or so of data it stores the min and      max value of every column; if a query predicate rules out those ranges,      that megabyte is never retrieved.</li>
<li>Oracle      has long offered what sounds like flexible workload management capability,      and this has now been extended to specifically include I/O resources on      the storage tier.</li>
<li>This      isn’t Exadata-specific, but Oracle has built a file system on top of its      DBMS, optimized for speed, which helps with, e.g., ELT      (Extract/Load/Transform). Evidently, it’s not at all the same thing as      Mark Benioff’s 1990s Microsoft-annoying IFS (Internet File System)      project, which seems to have morphed into a content management SDK.</li>
</ul>
<p>Highlights specifically in the area of parallelization included:</p>
<ul>
<li>Juan      stressed that all databases consolidated onto an Exadata machine      are/should be striped across all storage units.</li>
<li>On the      other hand, Juan said that different databases should be confined to      specific cores or CPUs on the database tier.</li>
<li>But on      the third hand, Juan also stressed – in what could be called a “private      cloud” pitch – that there’s great elasticity as to which databases are      matched to which server CPUs.</li>
<li>Contrary      to what <a href="../../../../../2008/09/28/exadata-oracle-database-machine-parallelization/">I      thought he and/or his colleagues told me a year ago</a>, Juan said RAC      (Real Application Clusters) is a big part of Oracle’s data warehouse      processing.</li>
<li>However,      Juan says that what I regard(ed) as a major objection to Oracle’s      database-tier parallelization &#8212; the need to manually specify “degrees of      parallelism” &#8212; has now been obviated by automation. Juan thinks that few      data warehouse DBAs will now need to manually tune parallelism, with minor      exceptions. One exception he cites is that if a nightly report really is      non-urgent, it can just be forced to run on a single core with no chance      to grab more resources. (However, Juan thinks manual tuning of parallelism      will continue to play a greater role in OLTP.)</li>
</ul>
<p>OK. That’s all I can get done tonight (see above re: inconvenience of timing). Follow-on subjects I’d like to and indeed plan to post about include:</p>
<ul>
<li>What      Juan said about hybrid columnar compression</li>
<li>Oracle’s      delightfully non-confidential slide deck, and a few comments about same</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Research agenda for 2010</title>
		<link>http://www.dbms2.com/2009/12/31/research-agenda-for-2010/</link>
		<comments>http://www.dbms2.com/2009/12/31/research-agenda-for-2010/#comments</comments>
		<pubDate>Thu, 31 Dec 2009 22:02:11 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[About this blog]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Jaspersoft]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[QlikTech and QlikView]]></category>
		<category><![CDATA[Solid-state memory]]></category>
		<category><![CDATA[Tableau Software]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1384</guid>
		<description><![CDATA[As you may have noticed, I&#8217;ve been posting less research/analysis in November and December than during some other periods. In no particular order, reasons have included:

Over a 20 week period, I had travel in 13 of them.
3 of those were vacation in November.
As travel finally wound down:

It was time to focus a bit on my [...]]]></description>
			<content:encoded><![CDATA[<p>As you may have noticed, I&#8217;ve been posting less research/analysis in November and December than during some other periods. In no particular order, reasons have included:<span id="more-1384"></span></p>
<ul>
<li>Over a 20 week period, I had travel in 13 of them.</li>
<li>3 of those were vacation in November.</li>
<li>As travel finally wound down:
<ul>
<li>It was time to focus a bit on <a href="http://www.monashreport.com/2009/12/14/our-services-for-technology-vendors/" onclick="javascript:pageTracker._trackPageview('/www.monashreport.com');">my own business</a></li>
<li>Elder care got serious; e.g., my parents went to the hospital on consecutive days, Christmas week, the first one on their 52nd wedding anniversary</li>
<li>Linda and I both got really nasty colds</li>
<li>The holidays were happening</li>
<li>I started helping out a really cool startup company (first time I&#8217;ve taken stock in a private company in years; more on that soon)</li>
<li>There was less industry news going on anyway than in some other recent months</li>
</ul>
</li>
</ul>
<p>But of course I plan to speed up the research/analysis/writing soon. Here, FYI, are a few things I have on my plate.</p>
<p>For a couple of years now, the center of what I&#8217;ve written about has been <strong>high-performance analytic data processing. </strong>You can expect me to keep pursuing that in all its aspects. But there are two specific areas I&#8217;ve identified in which I want to redouble my efforts.</p>
<p>First, almost every BI vendor has an effort in<strong> &#8220;in-memory analytics&#8221;</strong> and/or <strong>&#8220;interactive data exploration.&#8221;</strong> I suspect there&#8217;s a lot of difference in underlying technologies, but I&#8217;m having trouble getting details. QlikTech (the worst foot-dragger of the three), Microstrategy, and Jaspersoft all owe me follow-up conversations with the people who know what&#8217;s going on well enough to explain it. Tableau keeps promising me a briefing and then not delivering. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  And I&#8217;m even further behind with the behemoth companies &#8212; Oracle, Microsoft, IBM/Cognos (arguably) et al.</p>
<p>Second, <strong>solid-state memory</strong> is coming to data warehousing. The obvious reasons are that it&#8217;s obviously close, and Moore&#8217;s Law still applies to bring it closer. More specific reasons for believing in solid-state include:</p>
<ul>
<li><a href="http://www.dbms2.com/2009/10/25/teradata-hardware-strategy-and-tactics/" >Teradata</a> has made large strides in making solid-state memory useful.</li>
<li>The stealth start-up I mentioned above is poised to make further strides.</li>
<li>(I&#8217;m not totally sure yet about this part) The in-memory analytics mentioned above might wind up working better in solid-state memory than in DRAM.</li>
</ul>
<p>I&#8217;m spending quite a few cycles thinking about this area.</p>
<p>I&#8217;d also like to look further at <strong>analytic applications </strong>and<strong> advanced analytic functionality.</strong> I foreshadowed some of that in my <a href="http://www.dbms2.com/2009/12/02/mapreduce-for-complex-analytics-webina/" >Aster webinars</a>. There&#8217;s some good stuff to talk about at Teradata I should try to write up soon. I need to have a follow-up conversation with fascinating anti-fraud guy I met at Netezza&#8217;s London event. But that&#8217;s all just scratching the surface.</p>
<p>Both the MySQL and PostgreSQL communities are in some disarray. Other non-behemoth <strong>OLTP/general-purpose DBMS </strong> seem to be, at best, thriving niche products. (I see little in the way of innovative new use for, say, Progress, Cache&#8217;, Ingres, or anything multivalue.) But it feels as if there&#8217;s more opportunity out there than is being met. And at a minimum, I&#8217;d like to learn more than the almost nothing I know about <strong>OLTP <a href="http://www.dbms2.com/2009/12/12/legit-nosql-key-value-store/" >NoSQL</a> alternatives.</strong></p>
<p>I&#8217;ve already said that I expect to give an <a href="http://www.dbms2.com/2009/11/25/new-england-database-summit-january-28-2010/" >industry-overview talk</a> at MIT on January 28. I also have an overviewy press article and overviewy white paper under discussion. If those come to fruition, I&#8217;ll of course let you know. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Besides the above, I of course have a number of specific posts that I need to get around to researching and writing at some point, often on topics I&#8217;ve already written about before.  Three subjects fairly high on the priority list are scientific data management, machine-generated data, and Oracle Exadata.</p>
<p>And finally, I have some subjects queued up for a couple of my other blogs as well. If you don&#8217;t already take our <a href="http://www.monash.com/blogs.html" onclick="javascript:pageTracker._trackPageview('/www.monash.com');">multi-blog integrated feed</a>, this might be a good time to switch over.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/12/31/research-agenda-for-2010/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>This and that</title>
		<link>http://www.dbms2.com/2009/12/29/this-and-that/</link>
		<comments>http://www.dbms2.com/2009/12/29/this-and-that/#comments</comments>
		<pubDate>Tue, 29 Dec 2009 09:14:46 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Structured documents]]></category>
		<category><![CDATA[Theory and architecture]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1348</guid>
		<description><![CDATA[I have various subjects backed up that I don&#8217;t really want to write about at traditional blog-post length.  Here are a few of them.
Vertica offers a post on its 3.5 release, with a riff on the popular theme &#8220;We&#8217;ve fixed some weaknesses in our prior versions that we didn&#8217;t previously say we had.&#8221; More important, [...]]]></description>
			<content:encoded><![CDATA[<p>I have various subjects backed up that I don&#8217;t really want to write about at traditional blog-post length.  Here are a few of them.<span id="more-1348"></span></p>
<p><strong>Vertica</strong> offers a post on<a href="http://databasecolumn.vertica.com/database-innovation/vertica-3-5-flexstoretm-the-next-generation-of-column-stores/" onclick="javascript:pageTracker._trackPageview('/databasecolumn.vertica.com');"> its 3.5 release</a>, with a riff on the popular theme &#8220;We&#8217;ve fixed some weaknesses in our prior versions that we didn&#8217;t previously say we had.&#8221; More important, Vertica is pretty clear on the virtues of its <a href="http://www.dbms2.com/2009/08/04/pax-analytica-row-and-column-stores-begin-to-come-together/" >hybrid columnar architecture</a>.</p>
<p>Speaking of which &#8212; <strong>Oracle is going true hybrid columnar</strong> as well. I don&#8217;t have details or timing, however.</p>
<p>Dave Kellogg of <strong>Mark Logic</strong> wrote in to amusedly point out <a href="http://www.oracle.com/technology/tech/xml/xmldb/Current/marklogicserver_4.1_v1.0.pdf" onclick="javascript:pageTracker._trackPageview('/www.oracle.com');" target="_blank"><span style="color: #0000ff;"> </span>Oracle&#8217;s anti-MarkLogic collateral.</a> The very first charge Oracle levies is that MarkLogic goes beyond the emerging XQuery standard to add additional functionality. Considering Oracle&#8217;s approach to SQL standards, I tend to share Dave&#8217;s amusement.</p>
<div><span style="font-family: Calibri,sans-serif; font-size: small;"> </span></div>
<p>Bill Conniff of <a href="http://www.xponentsoftware.com/" onclick="javascript:pageTracker._trackPageview('/www.xponentsoftware.com');">Xponent LLC</a> wrote in to tell of a vastly cheaper and less functional approach to <strong>XML management,</strong> apparently geared to looking at very large XML files one at a time.</p>
<p><strong>Cayuga</strong> is a Cornell research project in complex event processing (CEP). There&#8217;s a <a href="http://www.cs.cornell.edu/bigreddata/cayuga/" onclick="javascript:pageTracker._trackPageview('/www.cs.cornell.edu');">Cayuga academic home page</a>, a Sourceforge page for some <a href="http://sourceforge.net/projects/cayuga/" onclick="javascript:pageTracker._trackPageview('/sourceforge.net');">open source Cayuga CEP code</a>, and so on. Minsheng Hong, writing from a Vertica email address, tipped me off some months ago. The basic idea seems to be to do <em>lots</em> of queries very quickly, rather than a smaller number of queries over and over again. Whether this is an advance in anything but open-sourceness over Apama or Aleri I couldn&#8217;t say, but I do think it&#8217;s a different focus than that of StreamBase or pre-Aleri Coral8.</p>
<p>And finally, editor Doug Henschen listed his <a href="http://intelligent-enterprise.informationweek.com/blog/archives/2009/12/intelligent_ent_2.html;jsessionid=0YRB5UUISPBXLQE1GHRSKH4ATMY32JVN" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');">15 favorite <em>Intelligent Enterprise</em> blog posts of 2009</a> &#8212; four each by Seth Grimes and Doug himself, three by Cindi Howson, two by me,* and one each by Mark Smith and Neil Raden.</p>
<p><em>*Doug selects up to three posts a month from here to republish.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/12/29/this-and-that/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Introduction to Gooddata</title>
		<link>http://www.dbms2.com/2009/12/27/introduction-to-gooddata/</link>
		<comments>http://www.dbms2.com/2009/12/27/introduction-to-gooddata/#comments</comments>
		<pubDate>Mon, 28 Dec 2009 03:16:30 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Amazon and its cloud]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Gooddata]]></category>
		<category><![CDATA[Jaspersoft]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1341</guid>
		<description><![CDATA[Around the end of the Cold War, Esther Dyson took it upon herself to go repeatedly to Eastern Europe and do a lot of rah-rah and catalysis, hoping to spark software and other computer entrepreneurs. I don&#8217;t know how many people&#8217;s lives she significantly affected – I&#8217;d guess it&#8217;s actually quite a few – but [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Around the end of the Cold War, Esther Dyson took it upon herself to go repeatedly to Eastern Europe and do a lot of rah-rah and catalysis, hoping to spark software and other computer entrepreneurs. I don&#8217;t know how many people&#8217;s lives she significantly affected – I&#8217;d guess it&#8217;s actually quite a few – but in any case the number is not zero. Roman Stanek, who has built and sold a couple of software business, cites her as a key influence setting him on his path.</p>
<p style="margin-bottom: 0in;">Roman&#8217;s latest venture is business intelligence firm Gooddata. Gooddata was founded in 2007 and has been soliciting and getting attention for a while, so I was surprised to learn that Gooddata officially launched just a few weeks ago. Anyhow, some less technical highlights of the Gooddata story include:<span id="more-1341"></span></p>
<ul>
<li>Gooddata believes it makes BI easy 	to adopt, unlike every other BI vendor on the planet &#8212; not 	excluding the many other BI vendors who say the same thing about 	themselves. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
<li>Gooddata is entirely cloud-based, 	specifically in the Amazon cloud.  I.e., Gooddata is selling 	SaaS-based BI.</li>
<li>Gooddata wants to sell to 	enterprises that are large enough to have more than a couple of BI 	users, and small enough not to be well served by the BI market 	leaders.
<ul>
<li>In revenue terms, this is the ever-popular $100 million &#8211; 	$1 billion market.</li>
<li>Specifically, Gooddata believes 	that those enterprises may have decent “back office” BI, but 	don&#8217;t have much in the front office. Gooddata wants to provide them 	with front office BI, which seems to basically mean CRM analytics. 	Gooddata sees this as a market in which QlikTech is the major 	player.  Generally, Gooddata wants to emulate and go after QlikTech.</li>
<li>Even more specifically, Gooddata 	wants to sell to Salesforce.com customers, who it believes are not 	well-served by what passes for built-in analytics at Salesforce. 	Partnering with NetSuite didn&#8217;t work as well, since NetSuite&#8217;s 	customers turn out to be smaller firms than are in Gooddata&#8217;s target 	market.</li>
</ul>
</li>
<li>Something I heard from both 	Jaspersoft and Gooddata is that there&#8217;s a hot market in providing 	cloud-based BI to online gaming companies. I gather these are mainly 	games running on mass communication platforms such as Facebook or 	the iPhone. Surely not coincidentally, it seems likely that:
<ul>
<li>These are small companies whose 	success – and hence data intake – can suddenly explode.</li>
<li>The data originates in cyberspace, 	with no particular need ever to come to the game companies&#8217; own 	premises.</li>
</ul>
</li>
<li>Gooddata has 50 production 	customers.</li>
<li>Gooddata had 2500 “projects” 	at the end of beta in June, and is adding 100 more per month. (Those 	numbers look weird together.) A “project” is a lot like a 	database, with associated reports, security privileges, etc.</li>
<li>Gooddata has close to 40 people, 	mainly in development.</li>
<li>I didn&#8217;t detect much of a sales 	strategy, nor much of a marketing strategy beyond the impressive 	early buzz generation. Perhaps that&#8217;s a partial explanation as to 	why the rate of Gooddata adoption fell even before the company 	officially launched.</li>
<li>I forgot to ask what those 50 	customers were actually paying, but considering Gooddata&#8217;s price 	list, it appears a typical price range for Gooddata&#8217;s stuff would be 	$500-$2,000/month.</li>
</ul>
<p style="margin-bottom: 0in;">Gooddata technical highlights include:</p>
<ul>
<li>Gooddata is building an 	entire BI stack – reporting, dashboards, ETL, in-memory database 	management, everything. I doubt Gooddata would claim that the pieces 	are best-of-breed in many ways other than BI ease of adoption and 	use.</li>
<li>So far I&#8217;ve seen three Gooddata 	ease-of-use features or feature groups that strike me as 	differentiated – <strong>reusability</strong> (of metrics and/or reports), 	<strong>collaboration,</strong> and <strong>tag clouds.</strong> More on those below. 	Gooddata is also building toward an <strong>agility</strong> pitch, but those 	features aren&#8217;t all baked yet.</li>
<li>Gooddata is MySQL-based today, but 	plans to move to a memory-centric compressed column store in 2010. 	Roman doesn&#8217;t reject analogies to SAP&#8217;s <em>BI/BW/whatever 	Accelerator. </em><span style="font-style: normal;">Yes, folks – 	Gooddata is yet another BI vendor doing some form of memory-centric 	OLAP. That&#8217;s a big trend.</span></li>
<li>I&#8217;m guessing 	that a big reason Gooddata is reinventing so many technical wheels 	is to ensure that the Gooddata stack is seamlessly multi-tenant from 	top to bottom. (Hasso Plattner of SAP&#8217;s <a href="../2009/07/07/hasso-plattner-calls-for-in-memory-oltp-column-stores/">comments 	on a similar idea</a> suggest a similar emphasis.)</li>
<li>Gooddata has 	its own multidimensional query language called MAQL (the A doesn&#8217;t 	seem to stand for anything). Today MAQL generates SQL for MySQL. The 	future columnar memory-centric data store will &#8212; I think – 	understand MAQL natively.</li>
</ul>
<p style="margin-bottom: 0in;">Now we get to the good stuff. When I wrote about <a href="../2009/05/30/reinventing-business-intelligence/">reinventing business intelligence</a> back in May, I focused on some interesting developments I see as actually underway &#8212; at least on an experimental basis and/or from small vendors – namely:</p>
<ul>
<li><strong>Text-search interfaces. </strong>Well, 	while I didn&#8217;t see true text search in the Gooddata demo, I did see 	tag clouds, which have some of the same benefits.</li>
<li><strong>Collaboration tools.</strong> Well, 	Gooddata has a nice-looking approach to BI collaboration, heavily 	reflected in its UI metaphors. (That said, I haven&#8217;t really compared 	Gooddata to Microsoft SharePoint or SAP&#8217;s Portal/Rooms/whatever.)</li>
<li><strong>Memory-centric analytics</strong> (for speed of exploration). As noted above, Gooddata has that coming 	soon.</li>
<li><strong>Data exploration that tries to 	ignore fixed relational schemas,</strong> ala Attivio or Splunk.  Roman 	says Gooddata is interested in or working on that, but offers no 	timetable.</li>
</ul>
<p style="margin-bottom: 0in;">Meanwhile, something I&#8217;ve been seeking for years, but haven&#8217;t seen much progress on since enhancement stopped on Cognos Metrics Manager, is more <a href="../2007/11/13/the-key-problem-with-dashboard-functionality/">user-friendly metrics management</a>.  Well, it doesn&#8217;t have a lot of bells and whistles, but at least Gooddata has the basics – a list of already-defined metrics, and a reasonable way of compounding them into other metrics. I think that kind of thing will be a major BI feature going forward, to the point that a few years from now we&#8217;ll be worrying about how to port them from one BI vendor&#8217;s tool from another.</p>
<p style="margin-bottom: 0in;"><strong>Bottom line: If you&#8217;re interested in BI, you should look at a Gooddata demo.</strong></p>
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/12/27/introduction-to-gooddata/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>
