<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; StreamBase</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/streambase/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Wed, 08 Feb 2012 17:17:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>StreamBase LiveView &#8212; push-based real-time BI</title>
		<link>http://www.dbms2.com/2011/11/10/streambase-liveview-push-based-real-time-bi/</link>
		<comments>http://www.dbms2.com/2011/11/10/streambase-liveview-push-based-real-time-bi/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 03:38:53 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5631</guid>
		<description><![CDATA[My clients at StreamBase are coming out with a new product line called LiveView, and I agreed they could launch it via this blog. Key points about StreamBase LiveView Version 1.0 include: LiveView is a business intelligence and alerting suite built on/in the rest of StreamBase&#8217;s technology, meant to operate on streaming data. LiveView is [...]]]></description>
			<content:encoded><![CDATA[<p>My clients at StreamBase are coming out with a new product line called LiveView, and I agreed they could launch it via this blog. Key points about StreamBase LiveView Version 1.0 include:</p>
<ul>
<li>LiveView is a business intelligence and alerting suite built on/in <a href="http://www.dbms2.com/2011/11/10/streambase-catchup/">the rest of </a><a href="http://www.dbms2.com/2011/11/10/streambase-catchup/">StreamBase&#8217;s technology</a>, meant to operate on streaming data.</li>
<li>LiveView is positioned by StreamBase as having a true push event-driven architecture rather than pull/poll.</li>
<li>StreamBase LiveView is designed to query in-memory data and then have the results change in real time as the data set changes.</li>
<li>The LiveView user interface is a rapidly changing work in progress.</li>
<li>LiveView has other Version 1 limitations as well</li>
<li>LiveView is targeted squarely at StreamBase&#8217;s financial trading core market until some of the Version 1 limitations are lifted.</li>
</ul>
<p>The basic StreamBase LiveView pipeline goes something like:   <span id="more-5631"></span></p>
<ul>
<li>Data comes into the system via multiple streams.</li>
<li>Transformations upon data arrival can include but are not limited to:
<ul>
<li>Aggregations.</li>
<li>Joins to reference data.</li>
<li>Joins to other streams.</li>
</ul>
</li>
<li>The streams (transformed or perhaps otherwise) are output to tables &#8230;</li>
<li>&#8230; which are continuously updated as more data streams through.</li>
<li> The data in the resulting table can be consumed:
<ul>
<li>Via LiveView-provided BI capabilities.</li>
<li>Via an API.</li>
</ul>
</li>
</ul>
<p>When wearing my vendor consultant hat, I warmly encourage StreamBase to emphasize the lack of a batch step anywhere in this process. As an analyst, however, I&#8217;m more restrained about a claim like &#8220;We uniquely free you from batch.&#8221; I agree that avoiding batch jobs is a Very Nice Thing. But you also are spared most batch-cycle processing if you stream updates from your short-request database to an analytic DBMS, e.g. via some kind of near-real-time replication.</p>
<p>That said, the push-versus-pull continuous filtering part of the StreamBase LiveView story seems pretty real. I think having sub-second display updates is cool in all sorts of BI use cases, and seriously useful in some number of them. While I don&#8217;t have a clear opinion as to whether the StreamBase approach offers huge performance advantages for that kind of latency over &#8220;pull&#8221; alternatives, my guess is in the direction of &#8220;yes&#8221;.</p>
<p>Version 1 limitations on StreamBase LiveView include:</p>
<ul>
<li>You consume data one table at a time, with no possibility of a join after the data has originally been put into a LiveView table.</li>
<li>While LiveView in principle offers rich alerting potential, you get at it via an API rather than much in the way of alerting-specific tools.</li>
<li>The first LiveView UI StreamBase put together looks a lot like 1980s stock quote machines. The next one it added looks a lot like Panopticon. Much cool-looking enhancement remains to be done.</li>
</ul>
<p><em>One competitive (non)-note: This all sounds something like what TIBCO has been pushing for years, but in fact I don&#8217;t have much knowledge of TIBCO&#8217;s efforts in the area. I had a meeting set up to learn about it some time ago, but it got canceled because TIBCO&#8217;s PR people:</em></p>
<ul>
<li><em>Didn&#8217;t want to let any kind of meeting happen without them, even though a serious CTO-type representative seemed happy to talk, but also &#8230;</em></li>
<li><em>&#8230; didn&#8217;t want to work at dinner time.</em></li>
</ul>
<p><em>I haven&#8217;t had substantive contact with TIBCO since.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/11/10/streambase-liveview-push-based-real-time-bi/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>StreamBase catchup</title>
		<link>http://www.dbms2.com/2011/11/10/streambase-catchup/</link>
		<comments>http://www.dbms2.com/2011/11/10/streambase-catchup/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 03:31:44 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5630</guid>
		<description><![CDATA[While I was cryptic in my general CEP/streaming catchup, I&#8217;ll say a bit more regarding StreamBase in particular. At the highest level, non-technically: StreamBase once planned to conquer the world. However, StreamBase really only sold effectively in the financial trading and intelligence markets. StreamBase retrenched, focusing almost exclusively on the financial trading market. With StreamBase [...]]]></description>
			<content:encoded><![CDATA[<p>While I was cryptic in my general <a href="http://www.dbms2.com/2011/11/10/cep-streaming-catchup/">CEP/streaming catchup</a>, I&#8217;ll say a bit more regarding StreamBase in particular. At the highest level, non-technically:</p>
<ul>
<li>StreamBase once planned to conquer the world.</li>
<li>However, StreamBase really only sold effectively in the financial trading and intelligence markets.</li>
<li>StreamBase retrenched, focusing almost exclusively on the financial trading market.</li>
<li>With <a href="http://www.dbms2.com/2011/11/10/streambase-liveview-push-based-real-time-bi/">StreamBase LiveView</a>, StreamBase is expanding from embedded <a href="../../../../../2011/11/08/terminology-operational-analytics/">operational analytics</a> to do (also operational) business intelligence as well.</li>
<li>StreamBase is hopeful that, perhaps starting with Version 2 or so, LiveView will be successful outside the financial trading market.</li>
</ul>
<p><span id="more-5630"></span><em>Not coincidental to these shifts in focus, StreamBase was our client, then stopped being one for a while, and now is a client again.</em></p>
<p>StreamBase (the product set) consists primarily of three things (LiveView aside):</p>
<ul>
<li>A development environment, whose output is in &#8230;</li>
<li>&#8230; a visual programming language called EventFlow &#8230;</li>
<li>&#8230; which is complied and executed by StreamBase&#8217;s execution layers.</li>
</ul>
<p>One important set of ancillary products are StreamBase&#8217;s connectors to various data sources &#8212; StreamBase offers about 125 of its own, a number that approaches 200 when <a href="../../../../../2010/02/16/quick-thoughts-on-the-streambase-component-exchange/">community contributions</a> are included.</p>
<p>StreamBase has a second programming language called StreamSQL, but that&#8217;s rarely used except for embedding in or connecting to third-party software. EventFlow and StreamSQL compile to nearly identical byte code. (The main difference seems to be that as a practical matter you&#8217;ll name things a bit differently in the two languages, focusing on verbs in EventFlow and nouns in StreamSQL.)</p>
<p>StreamBase says that in the financial trading market, great performance out of the box equates to better time-to-value, since you are spared time you&#8217;d otherwise have to spend tuning the system. Implicit in that is a claim &#8212; which competitors might dispute <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  &#8212; that StreamBase has great <a href="../../../../../2009/05/21/notes-on-cep-performance/">performance</a>. StreamBase fondly thinks that having a domain-specific language gives it a leg up in achieving great compiler optimization. (The same would presumably apply to StreamBase&#8217;s competitors, but only if they have optimizing compilers themselves.)</p>
<p>One point that&#8217;s a little unusual for me these days is that StreamBase favors big SMP (Symmetric MultiProcessing) boxes over blade-based scale-out. 16+ cores and 256 gigabytes of RAM are not uncommon. Clusters commonly include 4-8 machines, but rarely more; the largest StreamBase cluster evidently contains 36 machines.</p>
<p>And with that I&#8217;ll turn to StreamBase&#8217;s newest offering, <a href="http://www.dbms2.com/2011/11/10/streambase-liveview-push-based-real-time-bi/">LiveView</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/11/10/streambase-catchup/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Very brief CEP/streaming catchup</title>
		<link>http://www.dbms2.com/2011/11/10/cep-streaming-catchup/</link>
		<comments>http://www.dbms2.com/2011/11/10/cep-streaming-catchup/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 03:29:37 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[StreamBase]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Truviso]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5632</guid>
		<description><![CDATA[When I agreed to launch the StreamBase LiveView product via DBMS 2, I planned to catch up on the whole CEP/streaming area first. Due to the power and internet outages last week, that didn&#8217;t entirely happen. So I&#8217;ll do a bit of that now, albeit more cryptically than I hoped and intended. The upshot of [...]]]></description>
			<content:encoded><![CDATA[<p>When I agreed to launch the StreamBase LiveView product via <em>DBMS 2,</em> I planned to catch up on the whole CEP/streaming area first. Due to the power and internet outages last week, that didn&#8217;t entirely happen. So I&#8217;ll do a bit of that now, albeit more cryptically than I hoped and intended.</p>
<ul>
<li>The upshot of my <a href="../../../../../2011/08/25/renaming-cep-or-not/">what to call CEP thread</a> in August was that &#8220;streaming&#8221; and &#8220;event processing&#8221; are not the same concept, but it so happens that they have the most traction where they intersect. That said, I both observe and endorse an apparent shift from &#8220;event&#8221; to &#8220;stream&#8221; as the core of the terminology, in <a href="../../../../../2008/03/19/what-to-call-cep/">a reversal of my opinion of several years ago</a>.</li>
<li>IBM continues to throw a lot of resources at its <a href="../../../../../2009/05/13/ibm-system-s-infosphere-streams-processing/">System S/ InfoSphere Streams</a> product, but I haven&#8217;t heard yet of much marketplace success. That said, I believe IBM is still pretty serious about Streams, as one would expect from an effort whose code name so cheekily references <a href="http://www.softwarememories.com/2008/10/02/a-bit-of-db2-history-per-ibm/">System R</a>. In particular, Streams shows up prominently on IBM&#8217;s top-level analytic architecture slide.</li>
<li>Sybase recently released its ESP (Event Stream Processor) 5.0, which it says is the full merger of the Aleri and Coral8 predecessors. You can still get Sybase ESP without buying into the full <a href="../../../../../2010/02/05/sybase-aleri-rap/">Sybase RAP</a> stack, and Sybase has no plans to change that.</li>
<li>Sybase has discontinued all <a href="../../../../../2009/03/25/aleri-update/">the business intelligence types of products Aleri and Coral8 were developing</a>. Rather, Sybase is OEMing Panopticon, which it reports has been well received. Other than the discontinuation of the BI efforts, there seem to be few Aleri or Coral8 features missing from the merged Sybase ESP product.</li>
<li>Truviso continues to be <a href="../../../../../2010/05/04/truviso-evidently-reinvents-itself/">out of the picture</a>.</li>
<li>I have more to say about <a href="http://www.dbms2.com/2011/11/10/streambase-catchup/">StreamBase</a> separately.</li>
<li>I have more to say about Sybase and IBM, which I&#8217;ll get to when I can.</li>
<li>I have nothing new on Progress Apama. I also know little about any of the open source efforts.</li>
</ul>
<p>Meanwhile, if you want to see technically nitty-gritty posts about the CEP/streaming area, you may want to look at <a href="../../../../../category/memory-centric-data-management/event-stream-processing/page/4/">my CEP/streaming coverage circa 2007-9</a>, based on conversations with (among others) <a href="../../../../../2007/06/18/mike-stonebraker-on-financial-stream-processing/">Mike Stonebraker</a>, <a href="../../../../../2007/08/03/a-deeper-dive-into-apama/">John Bates</a>, and <a href="../../../../../2007/08/10/the-essence-of-cep-according-to-coral8/">Mark Tsimelzon</a>.</p>
<p><strong> </strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/11/10/cep-streaming-catchup/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Renaming CEP &#8230; or not</title>
		<link>http://www.dbms2.com/2011/08/25/renaming-cep-or-not/</link>
		<comments>http://www.dbms2.com/2011/08/25/renaming-cep-or-not/#comments</comments>
		<pubDate>Fri, 26 Aug 2011 02:58:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=5127</guid>
		<description><![CDATA[One of the less popular category names I deal with is &#8220;Complex Event Processing (CEP)&#8221;. The word &#8220;complex&#8221; looks weird, and many are unsure about the &#8220;event processing&#8221; part as well. CEP does have one virtue as a name, however &#8212; it&#8217;s concise. The other main alternative is to base the name on &#8220;stream processing&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>One of the less popular category names I deal with is &#8220;Complex Event Processing (CEP)&#8221;. The word &#8220;complex&#8221; looks weird, and many are unsure about the &#8220;event processing&#8221; part as well. CEP does have one virtue as a name, however &#8212; it&#8217;s concise.</p>
<p>The other main alternative is to base the name on &#8220;stream processing&#8221; instead.* The CEP-or-whatever industry is split between these choices, with <a href="http://www.streambase.com/about-home.htm">StreamBase</a> currently favoring &#8220;CEP&#8221; (despite its company name), <a href="../../../../../2009/05/13/ibm-system-s-infosphere-streams-processing/">IBM emphatically favoring &#8220;stream&#8221;</a>, and Sybase seemingly trying to have things both ways.</p>
<p><em>*And then, of course, there is &#8220;event stream processing&#8221;, regarding which please see below.</em></p>
<p><span id="more-5127"></span>I&#8217;ve been juggling this terminological divide myself, referring to <a href="../../../../../2007/08/12/applications-for-not-so-low-latency-cep/">complex event/stream processing</a> as long as four years ago. But enough is enough. I&#8217;d like to write more about the category without repeatedly apologizing for its name. And so, always bearing in mind <a href="http://www.strategicmessaging.com/no-market-categorization-is-ever-precise/2011/03/01/">Monash&#8217;s Third Law of Commercial Semantics</a>, here&#8217;s where I&#8217;m coming down.</p>
<p>The more I think about it, the less I like the term &#8220;event processing&#8221;. Here&#8217;s why. Events happen; data is produced; CEP systems most commonly try to identify and categorize the events based on the data. The CEP systems may then do significant further processing, but more often they just pass the information on to another system (most commonly either persistent DBMS or &#8220;real-time&#8221; business intelligence). How much of that is really &#8220;event processing&#8221;? Relatively little, I&#8217;d say. And referring specifically to &#8220;complex&#8221; events doesn&#8217;t address my complaints at all.</p>
<p>So I&#8217;d like to go with some version of &#8220;stream&#8221;. But &#8220;<a href="http://en.wikipedia.org/wiki/Stream_processing">stream processing</a>&#8221; has other computer-related uses, while &#8220;Stream management&#8221; commonly describes care and planning for small waterways. So &#8220;stream&#8221; might do best with a modifier, such as &#8220;event&#8221; or &#8220;data&#8221;. Of the two, I prefer &#8220;data stream&#8221; (or &#8220;datastream&#8221;) to &#8220;event stream&#8221;; the events aren&#8217;t really streaming, but the data is.</p>
<p>So should it be &#8220;data stream processing&#8221; or &#8220;data stream management&#8221;? Well, the only one of numerous Wikipedia definitions I&#8217;ve actually liked while researching this post is the one for &#8220;<a href="http://en.wikipedia.org/wiki/Data_Stream_Management_System">Data Stream Management System</a>&#8220;:</p>
<blockquote><p>A <strong>Data Stream Management System</strong> (<strong>DSMS</strong>) is a set of computer programs that controls the maintenance and querying of data in data streams. The use of a DSMS to manage a data stream is roughly analogous to the use of a Database Management System (DBMS) to manage a conventional database.</p>
<p>A key feature of a DSMS is the ability to execute a <em>continuous query</em> against a data stream. A conventional database query executes once and returns a set of results for a given point in time. In contrast, a continuous query continues to execute over time, as new data enters the stream. The results of the continuous query are updated as new data appears.</p></blockquote>
<p>I think the data stream/database management analogy is spot on. Your queries work a little differently, but otherwise you&#8217;re doing pretty much the same things. Indeed, you&#8217;re probably even going to persistently store some of the data, and ideally that DBMS capability would be tightly integrated into your CEP system. (In practice they&#8217;re apt to be more loosely coupled; for most purposes that works well enough.) Query execution, data ingestion, performance monitoring/tuning, workload prioritization &#8212; it&#8217;s very DBMS-like stuff. And by the way, &#8220;data stream management system&#8221; is the term that was used by the researchers &#8212; Mike Stonebreaker, Stan Zdonik, Dan Abadi, et al. &#8212; who wrote a paper describing <a href="http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.67.8671&amp;rep=rep1&amp;type=pdf">the project on which StreamBase was based</a> &#8230; although some might question whether that particular observation is a strong signal of accuracy. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>This reasoning suggests <strong>Data Stream Management System</strong> is what it should be. The usual kinds of abbreviation &#8212; datastream (product), datastream manager, DSMS, etc. would no doubt follow. So should it be &#8220;Data Stream&#8221;, &#8220;Datastream&#8221;, or &#8220;Data-stream&#8221;? At that level of detail, I don&#8217;t yet have an opinion.</p>
<p>The only thing is &#8212; that&#8217;s all pretty wordy compared to <strong>CEP. </strong>So after all this, I&#8217;m still not sure which term(s) I prefer.</p>
<p>What are your thoughts?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/08/25/renaming-cep-or-not/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>Some quick notes on HP-Vertica</title>
		<link>http://www.dbms2.com/2011/02/14/some-quick-notes-on-hp-vertica/</link>
		<comments>http://www.dbms2.com/2011/02/14/some-quick-notes-on-hp-vertica/#comments</comments>
		<pubDate>Mon, 14 Feb 2011 17:19:57 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[StreamBase]]></category>
		<category><![CDATA[VoltDB and H-Store]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=3862</guid>
		<description><![CDATA[HP is acquiring Vertica.  Now we know (at least in part) why Vertica went oddly silent for a while. As per that same link, Vertica has &#62;250 ordinary customers, and &#62;70 more OEM sell-through ones. This is a setback for speculation about any kind of upcoming Aster/HP tie-up. Edit:  Forgot this one briefly &#8212; HP [...]]]></description>
			<content:encoded><![CDATA[<p>HP is acquiring Vertica.  <span id="more-3862"></span></p>
<ul>
<li>Now we know (at least in part) why <a href="http://www.dbms2.com/2011/02/14/now-we-know-why-vertica-has-been-so-weirdly-evasive/">Vertica went oddly silent</a> for a while.</li>
<li>As per that same link, Vertica has &gt;250 ordinary customers, and &gt;70 more OEM sell-through ones.</li>
<li>This is a setback for speculation about any kind of upcoming <a href="http://www.dbms2.com/2011/01/19/sound-bites-on-hpmicrosoft-and-neoview/">Aster/HP</a> tie-up.</li>
<li><em>Edit:  Forgot this one briefly &#8212; HP chairman Ray Lane was previously involved with Vertica.</em></li>
<li>Vertica arguably is the most mature of the modern <a href="http://www.dbms2.com/2011/02/06/columnar-compression-database-storage/">column-store DBMS</a> &#8212; i.e., the ones that don&#8217;t have their roots in bitmaps the way Sybase and SAND do.</li>
<li><a href="http://www.dbms2.com/2010/09/07/soundbites-about-mark-hurd-joining-oracle/">HP executed really badly in data warehouse DBMS and appliances</a> under former CEO Mark Hurd.</li>
<li>Unfortunately, if you&#8217;re quickly researching Vertica, neither <a href="http://www.dbms2.com/2011/02/05/gartner-magic-quadrant-data-warehouse-database-management-2010/">Gartner</a> nor <a href="http://www.dbms2.com/2011/02/11/comments-on-the-2011-forrester-wave-for-enterprise-data-warehouse-platforms/">Forrester</a> is a reliable source of detailed information.</li>
<li>It would make sense for HP to acquire <a href="http://www.dbms2.com/category/products-and-vendors/streambase/">StreamBase</a> too, and fold StreamBase into Vertica. Reasons include:
<ul>
<li>StreamBase and Vertica are aligned with each other. Both were founded by Mike Stonebraker, with overlapping groups of academic contributors. Both are in the Boston area. StreamBase and Vertica have worked together on various joint customer accounts, especially in the financial services sector.</li>
<li>Like other <a href="http://www.dbms2.com/2009/03/09/independent-cep-vendors-continue-to-flounder/">independent CEP vendors</a>, StreamBase can&#8217;t or won&#8217;t accomplish much outside certain niches (mainly financial services).</li>
<li>StreamBase reports, rather credibly, that it&#8217;s doing well in its niches. While StreamBase&#8217;s success seems to include a heavy dose of professional services, that hardly would be a deal-breaker for HP.</li>
</ul>
</li>
<li>It would make partial sense for HP to acquire <a href="http://www.dbms2.com/2010/05/25/voltdb-finally-launches/">VoltDB</a>, and fold VoltDB into Vertica.
<ul>
<li>VoltDB was actually spun out of Vertica, and incubated in Vertica offices. A lot of thinking has already been done about how to integrate VoltDB and Vertica.</li>
<li>VoltDB needs the help, as its strategy is not attuned to the needs of succeeding in a highly competitive, rapidly innovative marketplace.</li>
<li>VoltDB doesn&#8217;t have the kind of traction on which a big company like HP could hang an acquisition case or acquisition strategy.</li>
</ul>
</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2011/02/14/some-quick-notes-on-hp-vertica/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Quick thoughts on the StreamBase Component Exchange</title>
		<link>http://www.dbms2.com/2010/02/16/quick-thoughts-on-the-streambase-component-exchange/</link>
		<comments>http://www.dbms2.com/2010/02/16/quick-thoughts-on-the-streambase-component-exchange/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 15:04:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Games and virtual worlds]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1591</guid>
		<description><![CDATA[Streambase is announcing something called the StreamBase Component Exchange, for developers to exchange components to be used with the StreamBase engine, presumably on an open source basis. I simultaneously think: This is a good idea, and many software vendors should do it if they aren&#8217;t already. It&#8217;s no big deal. For reasons why, let me [...]]]></description>
			<content:encoded><![CDATA[<p>Streambase is announcing something called the <a href="http://streambase.com/b6409b0d-7d1f-4cf8-99b9-98b2b1858628/press-release-detail.htm">StreamBase Component Exchange</a>, for developers to exchange components to be used with the StreamBase engine, presumably on an open source basis. I simultaneously think:</p>
<ul>
<li>This is a good idea, and many software vendors should do it if they aren&#8217;t already.</li>
<li>It&#8217;s no big deal.</li>
</ul>
<p>For reasons why, let me quote an email I just sent to an inquiring reporter:</p>
<ul>
<li>StreamBase sells mainly to the financial services and intelligence community markets. Neither group will share much in the way of core algorithms.</li>
<li>But both groups are <a href="http://www.dbms2.com/2009/01/27/introduction-to-pentaho/">pretty interested in open source software</a> even so. (I think for both the price and customizability benefits.)</li>
<li>Open source software commonly gets community contributions for connectors, adapters, and (national) language translations.</li>
<li>But useful contributions in other areas are much rarer.</li>
<li>Linden Labs is one of StreamBase&#8217;s <a href="http://www.dbms2.com/2009/03/09/independent-cep-vendors-continue-to-flounder/">few significant customers outside its two core markets</a>.</li>
<li>All of the above are consistent with the press release (which quotes only one StreamBase customer &#8212; guess who?).</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/02/16/quick-thoughts-on-the-streambase-component-exchange/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Notes on CEP application development</title>
		<link>http://www.dbms2.com/2009/05/21/notes-on-cep-application-development/</link>
		<comments>http://www.dbms2.com/2009/05/21/notes-on-cep-application-development/#comments</comments>
		<pubDate>Thu, 21 May 2009 08:21:49 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Aleri and Coral8]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[Progress, Apama, and DataDirect]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=788</guid>
		<description><![CDATA[While performance may not be all that great a source of CEP competitive differentiation, event processing vendors find plenty of other bases for technological competition, including application development, analytics, packaged applications, and data integration. In particular: Most independent CEP vendors have some kind of application story in the capital markets vertical, such as packaged applications, [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">While <a href="http://www.dbms2.com/2009/05/21/notes-on-cep-performance/">performance may not be all that great a source of CEP competitive differentiation</a>, event processing vendors find plenty of other bases for technological competition, including application development, analytics, packaged applications, and data integration.  In particular:</p>
<ul>
<li>Most independent CEP vendors have some 	kind of application story in the capital markets vertical, such as 	packaged applications, ISV partners with packaged applications, 	application frameworks, and so on.</li>
<li>CEP vendors offer lots of connectors 	to specific financial industry price/quote/trade feeds, as well as 	the usual other kinds of database connectivity (SQL, XML, etc.)</li>
<li>Aleri/Coral8 (separately and now <a href="http://www.dbms2.com/2009/03/25/aleri-update/"> together</a>) like to call attention to their business 	intelligence/analytics offerings. Analytics is front-and-center on 	Truviso&#8217;s web site too, not that Truviso does much to call attention 	to itself, period.  (Roman Bukary once said he&#8217;d outline Truviso&#8217;s 	new strategy to me in 6-8 weeks or so &#8230; it&#8217;s now 14 months and 	counting.)</li>
</ul>
<p style="margin-bottom: 0in;">So far as I can tell, the areas of applications and analytics are fairly uncontroversial. Different CEP vendors have implemented different kinds of things, no doubt focusing on those they thought they would find easiest to build and then sell.  But these seem to be choices in business execution, not in core technical philosophy.</p>
<p style="margin-bottom: 0in;">In CEP application development, however, real philosophical differences do seem to arise.  There are at least three different CEP application development paradigms:<span id="more-788"></span></p>
<ul>
<li><strong>DML (Data Manipulation 	Language) extensions</strong> to handle time windows, etc., which are 	then embodied into a conventional application development stack.  	<a href="http://www.dbms2.com/2009/05/13/microsoft-announced-cep-this-week-too/">Microsoft</a> exemplifies an extreme form of this strategy, but 	most vendors offer it at least as an option.</li>
<li><strong>Visual event-oriented 	programming language.</strong> This is StreamBase&#8217;s claim to fame; 	StreamBase says that 95% of its customers do 100% of their 	development in its Eclipse-based visual tool, which truly creates 	executable programs without any kind of code generation step.  	(Other &#8220;this isn&#8217;t just a toy&#8221; buzzwords StreamBase 	offered were &#8220;modularity, &#8221; &#8220;parametrizability,&#8221; 	and, best of all, &#8220;visual debugger.&#8221;)</li>
<li><strong>Rules engine.</strong> When Progress 	Apama first told me how its preferred tool (I think a precursor of what 	is now Apama Event Modeler) worked, it sounded a lot like a 	RETE-based expert system shell. The Apama folks assured me that this 	really wasn&#8217;t RETE, and I see that there&#8217;s definitely more than 	just a rules language in Apama MonitorScript.  Still, the paradigm 	seems to basically be that you write a bunch of rules, which are 	then executed by a suitable engine.</li>
</ul>
<p style="margin-bottom: 0in;">Even more fundamental, however, is the question of <em>event-driven programming.</em> That&#8217;s a term which first was popular in the 1990s, signifying programs that &#8212; rather than being wholly <em>procedural</em> &#8212; listened for and responded to <em>events,</em> often in user interfaces.  More generally, event-driven programming is inherent in almost any kind of loosely-coupled architecture, be it client-server, service-oriented, or whatever. But to hear some CEP proponents tell it, all that isn&#8217;t event-driven enough.</p>
<p style="margin-bottom: 0in;">In particular, StreamBase recommended to me Gregor Hohpe&#8217;s paper <a href="http://www.enterpriseintegrationpatterns.com/docs/EDA.pdf">Programming Without a Call Stack – Event-driven Architectures</a>, which argues that if there&#8217;s any concept of procedure or method invocation at all, the whole thing is too &#8220;command-and-control&#8221; (Boo!!) and  insufficiently event-driven to be well-suited for CEP.   Rather, everything should be done on a fine-grained publish-subscribe basis.  It&#8217;s an interesting argument. Usually, the problem with extremist programming paradigms is that the thing you write has to interface with the rest of the world, and by the time it accommodates itself to them, the paradigm is violated anyway. But if the essence of the paradigm is loose coupling to begin with, maybe that pitfall can for once be avoided.</p>
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/05/21/notes-on-cep-application-development/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Notes on CEP performance</title>
		<link>http://www.dbms2.com/2009/05/21/notes-on-cep-performance/</link>
		<comments>http://www.dbms2.com/2009/05/21/notes-on-cep-performance/#comments</comments>
		<pubDate>Thu, 21 May 2009 08:20:36 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Aleri and Coral8]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[Progress, Apama, and DataDirect]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=787</guid>
		<description><![CDATA[I&#8217;ve been talking to CEP vendors on and off for a few years. So what I hear about performance is fairly patchwork. On the other hand, maybe 1-2+ year-old figures of per-core performance are still meaningful today. After all, Moore&#8217;s Law is being reflected more in core count than per-core performance, and it seems CEP [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been talking to CEP vendors on and off for a few years.  So what I hear about performance is fairly patchwork. On the other hand, maybe 1-2+ year-old figures of per-core performance are still meaningful today.  After all, Moore&#8217;s Law is being reflected more in core count than per-core performance, and it seems CEP vendors&#8217; development efforts haven&#8217;t necessarily been concentrated on raw engine speed.</p>
<p style="margin-bottom: 0in;">So anyway, what do you guys have to add to the following observations?</p>
<ul>
<li>Super-low-latency financial 	services industry tasks are often &#8220;embarrassingly parallel.&#8221; 	Thus, near-linear scale-out is common.</li>
<li>That said, good parallelism seems 	fairly new in CEP engines (of course, CEP engines are fairly new 	themselves &#8212; for all I know, some have been parallel since 	inception).</li>
<li>I&#8217;ve heard claims of up to 400,000 	messages/second/core for simple queries or patterns.</li>
<li>I&#8217;ve heard claims of 70,000 	messages/core for not-so-simple queries or patterns, and probably 	higher than that depending on what the meaning of &#8220;simple&#8221; 	is.</li>
<li>IBM just disclosed <a href="http://www.dbms2.com/2009/05/18/followup-on-ibm-system-sinfosphere-streams/">&gt;15,000 	messages/core on a pretty low-powered processor</a>.</li>
<li>I&#8217;ve heard that Coral8, Apama, and 	StreamBase rarely lost deals due to performance or throughput 	problems. I&#8217;ve heard that the same is not as true of Aleri.</li>
<li>StreamBase proudly says it&#8217;s been fully multithreaded since academic research-project days.  For Apama multithreading is evidently a more recent feature. But does it matter much?</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/05/21/notes-on-cep-performance/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Independent CEP vendors continue to flounder</title>
		<link>http://www.dbms2.com/2009/03/09/independent-cep-vendors-continue-to-flounder/</link>
		<comments>http://www.dbms2.com/2009/03/09/independent-cep-vendors-continue-to-flounder/#comments</comments>
		<pubDate>Mon, 09 Mar 2009 15:46:57 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Aleri and Coral8]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Progress, Apama, and DataDirect]]></category>
		<category><![CDATA[StreamBase]]></category>
		<category><![CDATA[Truviso]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=720</guid>
		<description><![CDATA[Independent CEP (Complex/Event Processing) vendors continue to flounder, at least outside the financial services and national intelligence markets. StreamBase once planned to conquer the world, making an impact as big as database management&#8217;s. Now it has retreated into niche markets. Progress Software, a decent-sized company, put a large fraction of its energy into Apama. Little [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Independent CEP (Complex/Event Processing) vendors continue to flounder, at least outside the financial services  and national intelligence markets.</p>
<ul>
<li>StreamBase once planned to conquer 	the world, making an impact as big as database management&#8217;s. Now it 	has retreated into niche markets.</li>
<li>Progress Software, a decent-sized 	company, put a large fraction of its energy into Apama. Little has 	happened outside the financial service sector.</li>
<li>Coral8 has some great-sounding 	ideas. But <a href="http://www.aleri.com/news/press-releases/aleri-and-coral8-merge">Coral8 	now has merged into Aleri</a>, basically a financial-markets 	specialist.</li>
<li>Mike Franklin says some ambitious 	things on behalf of Truviso, but I haven&#8217;t noticed much traction 	there either.</li>
</ul>
<p style="margin-bottom: 0in;">CEP&#8217;s penetration outside of its classical markets isn&#8217;t quite zero.  Customers include several transportation companies (various vendors), Sallie Mae (Coral8), a game vendor or two (StreamBase, if I recall correctly), Verizon (Aleri, I think), and more.  But I just wrote that list from memory &#8212; based mainly on not-so-recent deals &#8212; and a quick tour of the vendors&#8217; web sites hasn&#8217;t turned up much I overlooked.  (Truviso does have a recent deal with Technorati, but that&#8217;s not exactly a blue chip customer these days.)</p>
<p style="margin-bottom: 0in;">So far as I can tell, this is a new version of a repeated story.<span id="more-720"></span> A clever alternative to relational DBMS was invented. It proved superior in some specific applications and vertical markets. It failed to achieve much broader adoption.  Initial high hopes got dashed, companies failed to grow rapidly, and shareholders grew tired.</p>
<p style="margin-bottom: 0in;">So where will things go from here? My best guesses include:</p>
<ul>
<li>The financial trading market isn&#8217;t 	going away for CEP.  Super-low-latency is really needed there.</li>
<li>As much as I love the idea of 	<a href="../2008/10/20/coral8-proposes-cep-as-a-bi-data-platform/">CEP-infused 	BI</a>, it will be adopted only at the rate broader-based BI vendors 	can support.</li>
<li>A few niches will generate some 	business for CEP in data reduction. Leading candidates are the ones 	where there&#8217;s been a little traction to date &#8212; national 	intelligence, transportation, web analytics, and so on.</li>
<li>Sadly, that&#8217;s about it.</li>
</ul>
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/03/09/independent-cep-vendors-continue-to-flounder/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Fixing Twitter in three letters:  CEP</title>
		<link>http://www.dbms2.com/2008/01/16/twitter-could-easily-be-made-reliable/</link>
		<comments>http://www.dbms2.com/2008/01/16/twitter-could-easily-be-made-reliable/#comments</comments>
		<pubDate>Wed, 16 Jan 2008 22:52:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Aleri and Coral8]]></category>
		<category><![CDATA[Complex event processing (CEP)]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[StreamBase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/2008/01/16/twitter-could-easily-be-made-reliable/</guid>
		<description><![CDATA[There&#8217;s a lot of agitation today because Twitter broke under the message volume generated during Steve Jobs&#8217; Macworld keynote. I don&#8217;t know what that volume was, but I just checked the lower volume of tweets (i.e., updates) going through the “public timeline” (i.e., everything) twice, and both times it was under 200 messages per minute. [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s a lot of agitation today because Twitter broke under the message volume generated during Steve Jobs&#8217; Macworld keynote.  I don&#8217;t know what that volume was, but I just checked the lower volume of tweets (i.e., updates) going through the “public timeline” (i.e., everything) twice, and both times it was under 200 messages per minute.   So, let&#8217;s say there&#8217;s a much higher volume at peak times, and also hypothesize that Twitter would like to grow a lot, and say that Twitter would like to handle 10-100,000 messages/minute – i.e., 1000+/second &#8212; as soon as possible.</p>
<p>That&#8217;s easy using <a href="http://www.dbms2.com/2007/08/10/the-essence-of-cep-according-to-coral8/">CEP</a> (<a href="http://www.dbms2.com/category/memory-centric-data-management/event-stream-processing/">Complex Event Processing</a>).  A Twitter update is just a string of 140 or fewer characters.  It is associated with three pieces of metadata – author, time, and mode of posting.   It should be visible in real time to any of the author&#8217;s “followers,” as well as in a single public timeline; perhaps there will be other kinds of Twitter channels in the future.  In most cases, these updates are only visible to a user upon page refresh.  Almost n<del datetime="2008-01-17T20:36:00+00:00">N</del>o Twitter user seems to have more than about 7,000 followers, even Robert Scoble or Evan Williams.*  The average number of followers, at least among active updaters, is probably in the low hundreds now.  So basically, this is all a heckuva lot easier than the tick-monitoring systems Wall Street firms are using today.</p>
<p><em>*<del datetime="2008-01-17T20:36:00+00:00">I believe there&#8217;s a hard cap of 7,500, but nobody seems to have bumped against it yet.</del><a href="http://www.twitterholic.com/">Twitterholic</a> gives a different figure than Twitter does for Scoble.  And it correctly shows Dave Troy with a little over 10,000.</em></p>
<p>Here&#8217;s how to implement that. <span id="more-320"></span> You start with a complex event/stream processing engine like <a href="http://www.dbms2.com/2007/08/10/coral8-versus-streambase/">StreamBase or Coral8</a>, both of which can handle a large multiple of that volume without working up a sweat.  And you basically do everything in RAM.  6 million messages per hour of under 200 bytes each?  Not a lot of RAM needed for that.  A set of 25 or so recent messages cached for each of, say, the 100,000 or 1 million most recent users?  Also not a whole lot of RAM.  And if you push out aged messages and replace them with more recent message IDs, the cache gets smaller.  Banging things to disk for persistence is an exercise left to the reader.</p>
<p>Meanwhile, <a href="http://www.scripting.com/stories/2008/01/16/aDecentralizedTwitter.html">Dave Winer views Twitter as a distributed computing challenge</a> and Larry Dignan <a href="http://blogs.zdnet.com/BTL/?p=7614">asks whether Twitter needs to be reliable at all</a> (he leans to “Yes” because of the new apps that would open up).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2008/01/16/twitter-could-easily-be-made-reliable/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
	</channel>
</rss>

