<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; Infobright</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/infobright-brighthouse/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 02 Sep 2010 09:06:44 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Infobright&#8217;s Release 3.4</title>
		<link>http://www.dbms2.com/2010/06/27/infobright-release-3-4/</link>
		<comments>http://www.dbms2.com/2010/06/27/infobright-release-3-4/#comments</comments>
		<pubDate>Sun, 27 Jun 2010 15:09:41 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Workload management]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2413</guid>
		<description><![CDATA[Infobright called a couple weeks ago to discuss, among other subjects, its subsequently-released Infobright Release 3.4. I made no effort to distinguish between community/open source and professional/chargeable editions, but leaving that aside, it seems fair to characterize Infobright 3.4 as having two overlapping primary themes:

Performance and bottleneck 	cleanup.
“Omigod, you 	mean you didn&#8217;t have that feature [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Infobright called a couple weeks ago to discuss, among other subjects, its subsequently-released Infobright Release 3.4. I made no effort to distinguish between community/open source and professional/chargeable editions, but leaving that aside, it seems fair to characterize Infobright 3.4 as having two overlapping primary themes:</p>
<ul>
<li>Performance and <a href="http://www.dbms2.com/2009/08/21/bottleneck-whack-a-mole/" >bottleneck 	cleanup</a>.</li>
<li>“Omigod, you 	mean you didn&#8217;t have that feature before?” cleanup.</li>
</ul>
<p style="margin-bottom: 0in; font-style: normal;">That said, the traditional release for cleaning up the last huge gaps in an analytic DBMS product seems have become 4.0; recent examples include <a href="http://www.dbms2.com/2009/10/30/aster-data-application-server-ncluster/" >Aster Data</a>, <a href="http://www.dbms2.com/2010/02/22/vertica-4/" >Vertica</a> and <a href="http://www.dbms2.com/2010/04/12/greenplumchorus/" >Greenplum</a>. Infobright seems on track to be another example of that rule.</p>
<p style="margin-bottom: 0in; font-style: normal;"><em>Ack. Now that I&#8217;ve said that, other vendors are going to be tempted to accelerate their numbering so as to reach the 4.0 mark sooner &#8230;</em></p>
<p style="margin-bottom: 0in;">A lot of Infobright performance enhancements are in the vein “We used to rely on generic MySQL for that, but now we do it ourselves, and it works a lot better.” Examples include:  <span id="more-2413"></span></p>
<ul>
<li>Infobright now does DELETEs all at 	once, vs. the previous row-by-row way. This makes DELETE performance 	similar to SELECT performance, when previously there was a big 	difference.</li>
<li>Ditto, if I understood correctly, 	INSERTs and UPDATEs.</li>
<li>Each release, Infobright covers 	more SQL functionality itself and passes less through to the generic 	MySQL engine.</li>
<li>UTF-8 Unicode data can now be 	loaded via Infobright&#8217;s parallel loader. Previously, you had to use 	MySQL&#8217;s load.</li>
</ul>
<p style="margin-bottom: 0in;">Infobright has also added workload management in 3.4, and this is intertwined with multicore parallelization, apparently because the workload manager decides when a query should use multiple cores to execute. Infobright further says that multi-user INSERT performance has increased a lot more than single-user, but I have forgotten why that is.</p>
<p style="margin-bottom: 0in;">Infobright now streams data back to the client faster. E.g., unless there&#8217;s some good reason not to, partial query results are pipelined back as they become available.* Finally, loading data no longer locks tables from being read (in my notes I wasn&#8217;t sure whether that was a current or future feature, but Infobright&#8217;s marketing seems to indicate it&#8217;s current). For some reason, Infobright is positioning this as a major, innovative feature.</p>
<p style="margin-bottom: 0in;"><em>*A good reason not to do this would be an aggregate that requires full materialization of the table before Infobright can carry it out.</em></p>
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/06/27/infobright-release-3-4/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Algebraix</title>
		<link>http://www.dbms2.com/2010/06/05/algebraix/</link>
		<comments>http://www.dbms2.com/2010/06/05/algebraix/#comments</comments>
		<pubDate>Sat, 05 Jun 2010 08:54:59 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Algebraix]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Theory and architecture]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2227</guid>
		<description><![CDATA[I talked Friday with Chris Piedemonte and Gary Sherman, respectively the Cofounder/CTO and Chief Mathematician of Algebraix, who hooked up together for this project back in 2003 or 2004. (Algebraix is the company formerly known as XSPRADA.) Algebraix makes an analytic DBMS, somewhat based on the ideas of extended set theory, that runs on SMP [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">I talked Friday with Chris Piedemonte and Gary Sherman, respectively the Cofounder/CTO and Chief Mathematician of Algebraix, who hooked up together for this project back in 2003 or 2004. (Algebraix is the company formerly known as XSPRADA.) Algebraix makes an analytic DBMS, somewhat based on the ideas of <a href="../2010/06/05/extended-set-theory/">extended set theory</a>, that runs on SMP (Symmetric MultiProcessing) boxes. Like all analytic DBMS vendors, Algebraix has on some occasions run some queries orders of magnitude faster than they ran on the systems users were looking to replace.</p>
<p style="margin-bottom: 0in;">Algebraix&#8217;s secret sauce is that the DBMS keeps reorganizing and recopying the data on disk, to optimize performance in response to expected query patterns (automatically inferred from queries it&#8217;s seen so far). This sounds a lot like the Infobright story, with some of the more obvious differences being:  <span id="more-2227"></span></p>
<ul>
<li>Infobright has fixed data 	structures, with what serve for indexes and precomputed aggregates 	added on the fly. Algebraix apparently reorganizes everything, 	including data partitions. (I also presume that Algebraix&#8217;s indexes 	and aggregates look rather different than Infobright&#8217;s data packs.)</li>
<li>Infobright compresses data 	aggressively. Algebraix doesn&#8217;t yet compress.</li>
<li>Infobright&#8217;s product is presumably 	much more mature.</li>
</ul>
<p style="margin-bottom: 0in;">So far as I can tell, that&#8217;s about it. Experience teaches that when a small company claims that some big mathematical breakthrough lets it have huge product superiority in DBMS or analytic tools, the claim doesn&#8217;t pan out. Maybe the company has good technology somewhat inspired by the mathematics, but the more breathless claims are always overwrought. Examples include Infobright (<a href="../2007/10/22/infobright-brighthouse-mysql/">rough set theory</a>), CrossZ/QueryObject (fractals), and KXEN (<a href="http://www.monashreport.com/2006/10/04/kxen-and-verix-try-to-disrupt-the-data-mining-market/" onclick="javascript:pageTracker._trackPageview('/www.monashreport.com');">support vector machines</a>). Algebraix (extended set theory) shows every sign of being another such case. An extended discussion about whether or not join operations are commutative – without benefit of any examples of their supposed non-commutativity – did little to convince me otherwise.</p>
<p style="margin-bottom: 0in;">Finally, as per email from diligent PR guy David King,</p>
<ul>
<li>Algebraix has 22 employees in 	Austin and at its HQ in San Diego.</li>
<li>Algebraix&#8217;s biggest customer is 	the Department of Defense (through BAE Systems).</li>
<li>Algebraix&#8217;s product just became 	commercially available on March 1.</li>
<li>Algebraix has completed one round 	of funding for $11 million.</li>
<li>Algebraix is 100% angel-backed.</li>
<li>Algebraix&#8217;s target market is “any 	mid-large sized organization that needs rapid access to analytics on 	large volumes of data.”</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/06/05/algebraix/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Infobright blog update</title>
		<link>http://www.dbms2.com/2010/03/19/infobright-blog-update/</link>
		<comments>http://www.dbms2.com/2010/03/19/infobright-blog-update/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 13:42:01 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Log analysis]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Web analytics]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1733</guid>
		<description><![CDATA[I often offer that, if a company puts up a sufficiently good blog post, I&#8217;ll link to it. Well, I just noticed that Infobright CEO Mark Burton (somewhere along the way he seems to have dropped the “interim”) put up an excellent post last month.
Highlights on the market share/sector side include:

Infobright’s customer base grew 500% [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">I often offer that, if a company puts up a sufficiently good blog post, I&#8217;ll link to it. Well, I just noticed that Infobright CEO Mark Burton <span style="text-decoration: line-through;">(somewhere along the way he seems to have dropped the “interim”)</span> put up <a href="http://www.infobright.com/Blog/Entry/infobright_strategy_and_plans" onclick="javascript:pageTracker._trackPageview('/www.infobright.com');">an excellent post</a> last month.</p>
<p style="margin-bottom: 0in;">Highlights on the market share/sector side include:<span id="more-1733"></span></p>
<ul>
<li>Infobright’s customer base grew 500% over the past year, to 	120 paying customers.</li>
<li>This included end users (60%), as well as ISVs and SaaS 	providers (40%) who embed Infobright&#8217;s DBMS in their application.</li>
<li>During the same period, Infobright&#8217;s open source software was 	downloaded 35,000 times.</li>
<li>The end user applications were heavily clustered around web 	and online analytics tracking, with a focus on understanding 	customer behavior on the web.</li>
<li>Infobright also continues to see the growth of 	application-specific data marts.</li>
<li>There is also continued interest and growth in using 	Infobright technology to analyze IT logs and telecom CDR (Call 	Detail Record) data, to identify fraud or security issues, to 	understand and improve network performance, and other purposes.</li>
</ul>
<p>Product highlights include:</p>
<ul>
<li>Infobright be much more transparent in 2010 about its plans.</li>
<li>Infobright will start posting and commenting on future 	releases and themes in March of this year. (However, they haven&#8217;t 	run much of that by me yet, and we&#8217;re past the middle of March.)</li>
<li>Infobright expects to drop 3-4 interim releases for every 	major release, with at least two major releases in 2010.</li>
<li>Some of Infobright&#8217;s major improvements this year will be:
<ul>
<li>Continued SMP performance improvements “without the need 	for complex hardware configurations or administrative effort”.</li>
<li>Extending the “hit rate” of the Knowledge Grid, which is 	central to Infobright&#8217;s performance story.</li>
<li>Better international support with UTF-8 extensions.</li>
</ul>
</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/03/19/infobright-blog-update/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Intelligent Enterprise’s Editors’/Editor’s Choice list for 2010</title>
		<link>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/</link>
		<comments>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/#comments</comments>
		<pubDate>Thu, 11 Feb 2010 23:13:42 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Intersystems and Cache']]></category>
		<category><![CDATA[Jaspersoft]]></category>
		<category><![CDATA[Kalido]]></category>
		<category><![CDATA[MarkLogic]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pentaho]]></category>
		<category><![CDATA[QlikTech and QlikView]]></category>
		<category><![CDATA[SAP AG]]></category>
		<category><![CDATA[Tableau Software]]></category>
		<category><![CDATA[Talend]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1578</guid>
		<description><![CDATA[As he has before, Intelligent Enterprise Editor Doug Henschen

Personally selected annual lists of 12 &#8220;Most influential&#8221; companies and 36 &#8220;Companies to watch&#8221; in analytics- and database-related sectors.
Made it clear that these are his personal selections.
Nonetheless has called it an Editors&#8217; Choice list, rather than Editor&#8217;s Choice.  

(Actually, he&#8217;s really called it an &#8220;award.&#8221;)
People advising [...]]]></description>
			<content:encoded><![CDATA[<p>As he has <a href="http://www.dbms2.com/2009/01/12/intelligent-enterprises-editorseditors-choice-list/" >before</a>, <em>Intelligent Enterprise</em> Editor Doug Henschen</p>
<ul>
<li>Personally selected <a href="http://intelligent-enterprise.informationweek.com/showArticle.jhtml;jsessionid=IANLOXCT2244BQE1GHPCKH4ATMY32JVN?articleID=222900034&amp;pgno=1" onclick="javascript:pageTracker._trackPageview('/intelligent-enterprise.informationweek.com');">annual lists</a> of 12 &#8220;Most influential&#8221; companies and 36 &#8220;Companies to watch&#8221; in analytics- and database-related sectors.</li>
<li>Made it clear that these are his personal selections.</li>
<li>Nonetheless has called it an Editors&#8217; Choice list, rather than Editor&#8217;s Choice. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
</ul>
<p>(Actually, he&#8217;s really called it an &#8220;award.&#8221;)</p>
<p><span id="more-1578"></span>People advising Doug &#8212; who come to think of it actually are Contributing Editors to <em>Intelligent Enterprise</em> or something like that &#8212; included Cindi Howson, Seth Grimes, three others, and me.</p>
<p>And if past is prologue, I will now get a flood of PR emails calling my attention to this award that I already have both participated in and blogged about. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>As usual, the sense:nonsense ratio on these lists was pleasingly high. Analytic DBMS vendors cited included IBM, Microsoft, Netezza, Oracle, Sybase, and Teradata in the &#8220;Most influential&#8221; group, with Aster, Greenplum, HP, Infobright, and Vertica among the &#8220;To watch&#8221; crowd. It&#8217;s tough to argue with those selections, whose most questionable element is probably the not-ridiculous supposition that HP could do something interesting over the coming year. Cloudera and Intersystems also made the list, deservedly.</p>
<p>All three of QlikTech, Tableau, and TIBCO made the list, which is appropriate given the potential for and interest in interactive data exploration technology.  The BI majors, independent or otherwise, were all on as well. In text mining, Doug included Attensity and Clarabridge, which I think is exactly right. (Plus OpenCalais.)  Upon reflection, I probably should have nominated Mark Logic, even though most of its business is non-enterprise; but hey, nobody&#8217;s perfect, and the same goes for lists. Open source was well represented, with Apache, Actuate, Jaspersoft, Eclipse, Infobright, Nuxeo and R all being cited (but not Ingres or Pentaho). Kalido made the list, with my endorsement, their silly I-CASE like marketing messaging notwithstanding.</p>
<p>Speaking of imperfections &#8212; there only are a few category names, and so category assignments can be pretty bizarre. (In an ideal world, middleware wouldn&#8217;t be included under &#8220;enterprise applications&#8221;.) Greenplum hasn&#8217;t really &#8220;extended&#8221; its DBMS with a &#8220;cloud&#8221; option. As much as I&#8217;d like Netezza to be more influential than SAP, that&#8217;s probably not the best way to rank them. And there are a number of &#8220;This company is on a roll!&#8221; kinds of comments that I wouldn&#8217;t necessarily endorse.</p>
<p>But those are all nitpicks. On the whole, it&#8217;s another nice job.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/11/intelligent-enterprise-editors-choice-201/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Comments on the Gartner 2009/2010 Data Warehouse Database Management System Magic Quadrant</title>
		<link>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/</link>
		<comments>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/#comments</comments>
		<pubDate>Wed, 10 Feb 2010 23:28:39 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Aster Data]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[HP and Neoview]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Ingres]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[illuminate Solutions]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1553</guid>
		<description><![CDATA[At intervals of little over a year, Gartner Group publishes a Data Warehouse Database Management System Magic Quadrant. Gartner&#8217;s 2009 data warehouse DBMS Magic Quadrant &#8212; actually, January 2010 &#8212; is now out.* For many reasons, including those I noted in my comments on Gartner&#8217;s 2008 Data Warehouse DBMS Magic Quadrant, the Gartner quadrant pictures [...]]]></description>
			<content:encoded><![CDATA[<p>At intervals of little over a year, Gartner Group publishes a Data Warehouse Database Management System Magic Quadrant. <a href="http://www.gartner.com/technology/media-products/reprints/greenplum/173535.html" onclick="javascript:pageTracker._trackPageview('/www.gartner.com');">Gartner&#8217;s 2009 data warehouse DBMS Magic Quadrant</a> &#8212; actually, January 2010 &#8212; is now out.* For many reasons, including those I noted in <a href="http://www.dbms2.com/2009/01/12/gartners-2008-data-warehouse-database-management-system-magic-quadrant-is-out/" >my comments on Gartner&#8217;s 2008 Data Warehouse DBMS Magic Quadrant</a>, the Gartner quadrant pictures are a bad use of good research. Rather than rehash that this year, I&#8217;ll merely call out some points in the surrounding commentary that I find interesting or just plain strange.<span id="more-1553"></span></p>
<p><em>*Links to Gartner Magic Quadrants commonly break, but that one worked at the time of this posting.</em></p>
<ul>
<li>Gartner thinks that data warehouse appliances are on the rise, due to their simplicity.</li>
<li>Gartner correctly says that <a href="http://www.softwarememories.com/2008/09/15/database-machines/" onclick="javascript:pageTracker._trackPageview('/www.softwarememories.com');">Teradata has been a data warehouse appliance vendor from the getgo</a>.</li>
<li>Gartner characterizes IBM as being an appliance vendor as well.</li>
<li>Gartner suggests that HP is having trouble living up to its technical promises for Neoview.</li>
<li>Gartner further suggests &#8212; no surprise here &#8212; that HP Neoview has had very few new customers past its initial wave.</li>
<li>Gartner notes IBM&#8217;s difficulties in selling data warehouse installations of DB2, despite what on paper is great-sounding technology.</li>
<li>Gartner says &#8212; also no surprise &#8212; that illuminate &#8220;has seen little success in North America since opening its first office in the U.S. over two years ago.&#8221;</li>
<li>Ingres has evidently gotten a few BI-centric &#8220;appliance&#8221; deals, e.g. with Jaspersoft. But basically Ingres isn&#8217;t doing well in data warehousing.</li>
<li>Gartner does say Ingres has &#8220;the strongest open-source DBMS offering for data warehousing.&#8221; Being very literal about &#8220;open source,&#8221; that&#8217;s a defensible claim &#8212; but it&#8217;s pretty irrelevant in a world where <a href="http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/" >Greenplum Single-Node Edition</a> can be had for free. It also waves away all the data mart use cases in which Infobright Community Edition shines.</li>
<li>Gartner says that Netezza is working out as a &#8220;complex workload&#8221; enterprise data warehouse provider, according to reference checks, in addition to its established success in data mart scenarios.</li>
<li>Gartner says Oracle&#8217;s offering has finally become &#8220;accepted&#8221; in the market for databases &gt;50 TB. I guess I can live with that fairly weak claim, but <a href="http://www.dbms2.com/2009/09/19/oracle-database-siz/" >I wouldn&#8217;t go much further than that</a>.</li>
<li>Gartner asserts that, unlike software-only Oracle, Oracle Exadata isn&#8217;t significantly harder to administer than &#8220;other mixed OLTP/OLAP DBMS vendors,&#8221; because Exadata is fast enough you don&#8217;t need to jump through all those hoops any more to get tolerable performance. The money quote is &#8220;one reference reported reducing the number of indexes by a factor of 100 to fewer than five.&#8221; Note, however, that Gartner does not seem to assert that Exadata&#8217;s ease of use rivals that of the newer analytic DBMS specialists.</li>
<li>Gartner confirms <a href="http://www.dbms2.com/2009/02/01/oracle-says-they-do-onsite-exadata-pocs-after-all/" >Oracle&#8217;s reluctance to do onsite Exadata POCs</a>, but says it is not absolute. This is roughly compatible with what I&#8217;m hearing elsewhere, and indeed with Oracle own claims to be ramping up availability of Exadata POC hardware.</li>
<li>Gartner&#8217;s criteria for inclusion include at least 10 different organizations having a product &#8220;in production.&#8221; Thus, the big surprise was ParAccel being included. The money quote there is &#8220;With approximately 20 customers in the pharmaceutical, retail, financial and media/advertising analytics sectors, ParAccel has a good reference base.&#8221; That assessment is difficult to reconcile with other information, but I&#8217;ve been told Gartner is sticking to its guns. That assessment would be even harder to believe if those 20 references were all alleged to be true production customers.</li>
<li>Gartner notes that you basically can&#8217;t run a 1 TB+ MySQL data warehouse without sharding. (Of course, Infobright has an alternative, and up to a small number of terabytes so does Kickfire.)</li>
<li>Gartner reports that at least some customers are pleased with Sybase IQ&#8217;s mixed workload/enterprise data warehouse capabilities.</li>
<li>Gartner correctly notes that <a href="http://www.dbms2.com/2009/10/05/oracle-exadata-2-capacity-pricing/" >Oracle Exadata is a price-competition challenge for Teradata</a>.</li>
<li>Gartner notes that 20% of Vertica&#8217;s customers are outside the US. While not shocking, that&#8217;s more than I realized.</li>
<li>Gartner notes something I don&#8217;t think I&#8217;ve posted yet, which is that Vertica has a customer with 300 TB of data. (The identity is a deep dark secret, but if I told you you probably wouldn&#8217;t recognize the name anyway.)</li>
</ul>
<p>As does any such piece, the Gartner Data Warehouse DBMS Magic Quadrant also has outright errors.  For example:</p>
<ul>
<li>Aster Data isn&#8217;t really &#8220;the newest entrant to the DBMS data warehouse world.&#8221;</li>
<li>Aster&#8217;s SQL/MapReduce was not new in Release 4.0.</li>
<li>Greenplum isn&#8217;t yet pushing down code to the storage tier.</li>
<li>I&#8217;m not sure what kind of database-tier parallelism Gartner is claiming is new in Oracle in 11g Release 2 &#8212; but I doubt it&#8217;s really new. Rather, what Oracle has done recently is <a href="http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/" >make parallelism less administratively cumbersome</a>.</li>
<li>Vertica wasn&#8217;t really the first DBMS in the cloud. At most it was the first pure-play analytic DBMS to get there.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/10/gartner-magic-quadrant-data-warehouse-2009-2010/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>More miscellany</title>
		<link>http://www.dbms2.com/2009/12/30/more-miscellany/</link>
		<comments>http://www.dbms2.com/2009/12/30/more-miscellany/#comments</comments>
		<pubDate>Wed, 30 Dec 2009 11:38:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Continuent]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Rainstor]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1370</guid>
		<description><![CDATA[Adding to yesterday&#8217;s varied quick comments:
Robert Hodges of Continuent offers a great outline of Continuent&#8217;s clustering story, with a lot of &#8220;Now we got right what we previously didn&#8217;t know/admit we got wrong.&#8221; Continuent now claims to have a strong clustering offering, both paid and free/open-source, for both MySQL and PostgreSQL, with Oracle support perhaps [...]]]></description>
			<content:encoded><![CDATA[<p>Adding to <a href="http://www.dbms2.com/2009/12/29/this-and-that/" >yesterday&#8217;s varied quick comments</a>:<span id="more-1370"></span></p>
<p><a href="http://www.dbms2.com/2009/09/03/continuent-on-clustering/" >Robert Hodges</a> of <strong>Continuent</strong> offers <a href="http://scale-out-blog.blogspot.com/2009/12/proving-masterslave-clusters-work-and.html" onclick="javascript:pageTracker._trackPageview('/scale-out-blog.blogspot.com');">a great outline of Continuent&#8217;s clustering story</a>, with a lot of &#8220;Now we got right what we previously didn&#8217;t know/admit we got wrong.&#8221; Continuent now claims to have a strong <strong>clustering</strong> offering, both paid and free/open-source, for both MySQL and PostgreSQL, with Oracle support perhaps coming really soon.</p>
<p>Merv Adrian, who has <a href="http://www.dbms2.com/2009/06/22/the-tpc-h-benchmark-is-a-blight-upon-the-industry/" >overrated the importance of TPC benchmarks</a> in the past, seems to have become more <a href="http://mervadrian.wordpress.com/2009/12/23/additional-caveats-obscure-oracles-tpc-benchmark/" onclick="javascript:pageTracker._trackPageview('/mervadrian.wordpress.com');">skeptical</a>.</p>
<p>Interim CEO <a href="http://www.infobright.com/Blog/ceo_blog" onclick="javascript:pageTracker._trackPageview('/www.infobright.com');">Mark Burton</a> laid out<strong> Infobright&#8217;s focus</strong> pretty clearly when he took over:</p>
<blockquote><p><span style="letter-spacing: 0px;"> &#8230; the focus must be in building products that fit market segments where ease-of-use and easily attainable performance are valued.  This doesn’t sound like the high end of Data Warehousing to me where highly complex MPP architectures and teams of DBAs spend their time.  It sounds like the realm of Departmental IT and SMB where business leaders are in a hurry to gain access to data and answers without the lead time and pain of complex architectures and high costs.</span></p></blockquote>
<p><span style="letter-spacing: 0px;">I&#8217;m hearing about a <strong>SaaS focus</strong> from a lot of companies. The Continuent link above mentions one. So does <a href="http://www.rainstor.com/news-blog/news/users-demand-saas-data-escrow-services" onclick="javascript:pageTracker._trackPageview('/www.rainstor.com');">RainStor&#8217;s latest blog post</a>. <a href="http://www.dbms2.com/2009/12/27/introduction-to-gooddata/" >Gooddata</a>, a SaaS vendor itself, seems focused on analyzing data that was originally created via SaaS. I haven&#8217;t talked with Cast Iron or Pervasive for a while, but when I did, their ETL market targeting was <a href="http://www.dbms2.com/2008/03/21/cast-iron-systems-focuses-on-saas-data-integration/" >all about SaaS</a>. And of course, I hear dumber SaaS-focus ideas as well. I think the biggest substantive reason for this trend is &#8212; i</span><span style="letter-spacing: 0px;">f you don&#8217;t have the broadest feature set, and fear large enterprises therefore won&#8217;t want your stuff, going after SMBs makes sense. And SMBs are presumed to be going SaaS. Also in the mix, of course, are a single platform to support, a small number of large SaaS vendors to sell to or partner with, and/or general trendiness.<br />
</span></p>
<p><span style="letter-spacing: 0px;"><br />
</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/12/30/more-miscellany/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Calpont&#8217;s InfiniDB</title>
		<link>http://www.dbms2.com/2009/11/07/calponts-infinidb/</link>
		<comments>http://www.dbms2.com/2009/11/07/calponts-infinidb/#comments</comments>
		<pubDate>Sun, 08 Nov 2009 01:35:25 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Calpont]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open source]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1207</guid>
		<description><![CDATA[Since its inception, Calpont has gone through multiple management teams, strategies, and investor groups. What it hadn&#8217;t done, ever, is actually shipped a product. Last week, however, Calpont introduced a free/open source DBMS, InfiniDB, with technical details somewhat reminiscent of what Calpont was promising last April. Highlights include:

Like Infobright, Calpont&#8217;s 	InfiniDB is a columnar DBMS [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Since its inception, Calpont has gone through multiple management teams, strategies, and investor groups. What it hadn&#8217;t done, ever, is actually shipped a product. Last week, however, Calpont introduced a free/open source DBMS, InfiniDB, with technical details somewhat reminiscent of <a href="../2009/04/20/calpont-update-you-read-it-here-first/">what Calpont was promising last April</a>. Highlights include:</p>
<ul>
<li>Like Infobright, Calpont&#8217;s 	InfiniDB is a columnar DBMS consisting of a MySQL front end and a 	columnar storage engine.</li>
<li>Community edition InfiniDB runs on 	a single server.</li>
<li>One of commercial/enterprise 	edition InfiniDB&#8217;s main claims to fame will be MPP support.</li>
<li>There&#8217;s no announced time frame 	for commercial edition InfiniDB.</li>
<li>InfiniDB&#8217;s current compression 	story is dictionary/token only, with decompression occurring  before 	joins are executed. Improvement is a roadmap item.</li>
<li>Indeed, InfiniDB has many roadmap 	items, a few of which can be found <a href="http://infinidb.org/resources/tech-articles/120-infinidb-community-edition-roadmap" onclick="javascript:pageTracker._trackPageview('/infinidb.org');">here</a>. 	Also, a great overview of InfiniDB&#8217;s current state and roadmap can 	be found in <a href="http://www.mysqlperformanceblog.com/2009/11/02/air-traffic-queries-in-infinidb-early-alpha/" onclick="javascript:pageTracker._trackPageview('/www.mysqlperformanceblog.com');">this 	MySQL Performance Blog</a> thread. (And follow the links there to 	find performance discussions of other free analytic DBMS.)</li>
<li>One thing InfiniDB already has 	that is still a roadmap item for Infobright is the ability to run a 	query across multiple cores at once.</li>
<li>One thing free InfiniDB has that 	Infobright only offers in its Enterprise Edition is ACID-compliant 	Insert/Update/Delete. <em>(Note: I wish people would stop saying that Infobright Enterprise Edition isn&#8217;t ACID-compliant, since that point was cleared up <a href="http://www.dbms2.com/2009/04/20/infobright-update-3/" >a while ago</a>.)</em></li>
<li>InfiniDB has no indexes or 	materialized views.</li>
<li>However, InfiniDB&#8217;s retrieval is 	expedited by something called “Extents,” which sounds a lot like 	Netezza&#8217;s zone maps.</li>
</ul>
<p><em>Being on vacation, I&#8217;ll stop there for now. (If it weren&#8217;t for Tropical Storm/ depression Ida, I might not even be posting this much until I get back.)</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/11/07/calponts-infinidb/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Greenplum Single-Node Edition &#8212; sometimes free is a real cool price</title>
		<link>http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/</link>
		<comments>http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/#comments</comments>
		<pubDate>Mon, 19 Oct 2009 13:25:41 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[EnterpriseDB and Postgres Plus]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[PostgreSQL]]></category>
		<category><![CDATA[Pricing]]></category>
		<category><![CDATA[Scientific research]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1158</guid>
		<description><![CDATA[Greenplum is announcing today that you can run Greenplum software on a single 8-core commodity server, free.  First and foremost, that&#8217;s a strong statement that Greenplum wants enterprises to pay it for Greenplum&#8217;s parallelization/”private cloud” capabilities. Second, it may be an attractive gift to a variety of folks who want to extract insight from [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Greenplum is announcing today that you can run Greenplum software on a single 8-core commodity server, free.  First and foremost, that&#8217;s a strong statement that Greenplum wants enterprises to pay it for Greenplum&#8217;s parallelization/”<a href="../2009/06/08/the-future-of-data-marts/">private cloud</a>” capabilities. Second, it may be an attractive gift to a variety of folks who want to extract insight from terabyte-scale databases of various kinds.</p>
<p style="margin-bottom: 0in;">Greenplum Single-Node Edition:</p>
<ul>
<li>Is free of charge, although you 	can buy support.</li>
<li>Has no restrictions on use, 	production or otherwise.</li>
<li>Has no restrictions on database 	size.</li>
<li>Is closed-source.</li>
</ul>
<p style="margin-bottom: 0in;">For those who want free, terabyte-scale data warehousing software, Greenplum Single-Node Edition may be quite appealing, considering that the main available alternatives are:</p>
<ul>
<li>General-purpose open-source DBMS, 	such as PostgreSQL and MySQL (lacking analytic DBMS performance and 	features)</li>
<li>Infobright Community Edition (the 	other best choice – <a href="../2009/10/14/infobright-notes/">Infobright&#8217;s 	commercial sales success</a> indicates the solidity of Infobright&#8217;s 	technology)</li>
<li>Rough research-project code and 	other other questionable open source offerings</li>
<li>Crippleware from other commercial 	analytic DBMS vendors (e.g., <a href="../2009/10/19/teradata-partners-2009/">Teradata</a>)</li>
</ul>
<p style="margin-bottom: 0in;">For example, comparing PostgreSQL-based Greenplum with PostgreSQL itself, Greenplum offers:</p>
<ul>
<li>The ability to scale out queries 	across all cores in your box (and no, pgpool is not a serious 	alternative)</li>
<li>Storage alternatives such as 	columnar (I am told that EnterpriseDB recently stopped funding a 	project for a PostgreSQL columnar option)</li>
</ul>
<p style="margin-bottom: 0in;"><span id="more-1158"></span>Greenplum would surely also argue that its software is superior to PostgreSQL in parallel load, compression, MapReduce integration, and general fit-and-finish. I imagine that in some (perhaps not all) cases it would be right. PostgreSQL&#8217;s main technical advantages over Greenplum would probably lie in the area of datatype extensibility.</p>
<p style="margin-bottom: 0in;">The main target users for Greenplum&#8217;s Single-Node Edition are obviously <strong>individual enterprise power users or very small analytic teams.</strong> I.e., it&#8217;s people with a data mart need that a central data warehouse isn&#8217;t meeting. Potential benefits to Greenplum include:</p>
<ul>
<li>Adding value to its <a href="../2009/06/08/the-future-of-data-marts/">Enterprise 	Data Cloud</a> story</li>
<li>Seeding the market for future 	enterprise sales</li>
<li>Depriving competitors of revenue, 	perhaps at enterprises too small to ever be paying Greenplum 	customers</li>
</ul>
<p style="margin-bottom: 0in;">In addition, I see free Greenplum as a charity offering that could be appealing to <a href="http://" onclick="javascript:pageTracker._trackPageview('/');">scientists</a> who face PostgreSQL performance limitations.</p>
<p style="margin-bottom: 0in;"><em><strong>Related links</strong></em></p>
<ul>
<li><a href="http://www.greenplum.com/news/252/388/Greenplum-Introduces-Free-Greenplum-Database-Edition-for-Data-Analysts/d,press-releases/" onclick="javascript:pageTracker._trackPageview('/www.greenplum.com');">Greenplum 	Free Single-Node Edition press release</a> (I&#8217;m quoted)</li>
<li><a href="http://www.mysqlperformanceblog.com/2009/10/02/analyzing-air-traffic-performance-with-infobright-and-monetdb/" onclick="javascript:pageTracker._trackPageview('/www.mysqlperformanceblog.com');">MySQL 	Performance blog on MonetDB and Infobright community edition</a></li>
<li><a href="http://archives.postgresql.org/pgsql-general/2009-03/msg01227.php" onclick="javascript:pageTracker._trackPageview('/archives.postgresql.org');">PostgreSQL&#8217;s 	restriction to one core per query</a></li>
<li><a href="http://www.infobright.org/Forums/viewthread/1141/" onclick="javascript:pageTracker._trackPageview('/www.infobright.org');">Infobright&#8217;s 	restriction to one core per query</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Infobright notes</title>
		<link>http://www.dbms2.com/2009/10/14/infobright-notes/</link>
		<comments>http://www.dbms2.com/2009/10/14/infobright-notes/#comments</comments>
		<pubDate>Wed, 14 Oct 2009 19:32:36 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Log analysis]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Telecommunications]]></category>
		<category><![CDATA[Web analytics]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1091</guid>
		<description><![CDATA[I had lunch w/ Bob Zurek and Susan Davis of Infobright today. This wasn&#8217;t primarily a briefing, but a few takeaways are:

Infobright now has &#62;100 paying customers.
Typical database size is from the low 100s of gigabytes to the low single-digit number of terabytes.
Agile development is at or approaching two-week release cycles.
Like Kickfire, Infobright  has a [...]]]></description>
			<content:encoded><![CDATA[<p>I had lunch w/ Bob Zurek and Susan Davis of Infobright today. This wasn&#8217;t primarily a briefing, but a few takeaways are:</p>
<ul>
<li>Infobright now has &gt;100 paying customers.</li>
<li>Typical database size is from the low 100s of gigabytes to the low single-digit number of terabytes.</li>
<li>Agile development is at or approaching two-week release cycles.</li>
<li>Like Kickfire, Infobright  has a multi-year deal with MySQL that insulates it against many potential Oracle/MySQL shenanigans.</li>
<li>From an industry perspective, Infobright&#8217;s customer base sounds a lot like other vendors&#8217;:
<ul>
<li>Data mart outsourcing/online analytics</li>
<li>Log files for websites</li>
<li>Telecommunications</li>
<li>Financial services</li>
<li>OEM, especially in the markets cited above</li>
<li>&#8220;Hey, we&#8217;re beginning to see the occasional energy deal&#8221;</li>
<li>A few random others</li>
</ul>
</li>
<li>Infobright is seeing some household-name customers, who surely have big-name analytic DBMS products, but who also have a policy that open source is the default choice, and if open source can get the job done then the favorite closed-source choices aren&#8217;t used.</li>
<li>Infobright has the usual open-source community story &#8212; lots of involvement and engagement in the forums, but contributions are limited mainly to connectivity, utility scripts, etc. (Maybe some national language translation too; I&#8217;m not sure.)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/10/14/infobright-notes/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Infobright metrics</title>
		<link>http://www.dbms2.com/2009/07/08/infobright-metrics/</link>
		<comments>http://www.dbms2.com/2009/07/08/infobright-metrics/#comments</comments>
		<pubDate>Wed, 08 Jul 2009 17:41:50 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Market share]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=836</guid>
		<description><![CDATA[Merv Adrian posted about Infobright, and included some company-supplied metrics. Most looked familiar from a post I did in April, but Infobright&#8217;s latest figure for # of paying customers seems to be &#8220;&#62;60&#8243;, up from &#8220;&#62;50&#8243;. Pricing aside, that&#8217;s Vertica/Greenplum territory &#8212; behind Netezza, Teradata, and the big OLTP DBMS vendors, but ahead of everybody [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mervadrian.wordpress.com/2009/07/07/infobright-bids-to-anchor-an-open-source-dw-ecosystem/" onclick="javascript:pageTracker._trackPageview('/mervadrian.wordpress.com');">Merv Adrian posted about Infobright</a>, and included some company-supplied metrics. Most looked familiar from <a href="http://www.dbms2.com/2009/04/20/infobright-update-3/" >a post I did in April</a>, but Infobright&#8217;s latest figure for # of paying customers seems to be &#8220;&gt;60&#8243;, up from &#8220;&gt;50&#8243;. Pricing aside, that&#8217;s Vertica/Greenplum territory &#8212; behind Netezza, Teradata, and the big OLTP DBMS vendors, but ahead of everybody else I think of as a modern analytic DBMS vendor.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/07/08/infobright-metrics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
