May 13, 2009

Microsoft announced CEP this week too

Microsoft still hasn’t worked out all the kinks regarding when and how intensely to brief me. So most of what I know about their announcement earlier this week of a CEP/stream processing product* is what I garnered on a consulting call in March. That said, I sent Microsoft my notes from that call, they responded quickly and clearly to my question as to what remained under NDA, and for good measure they included a couple of clarifying comments that I’ll copy below.

*”in the SQL Server 2008 R2 timeframe,” about which Microsoft wrote “the first Community Technology Preview (CTP) of SQL Server 2008 R2 will be available for download in the second half of 2009 and the release is on track to ship in the first half of calendar year 2010. “

Perhaps it is more than coincidence that IBM rushed out its own announcement of an immature CEP technology — due to be more mature in a 2010 release — immediately after Microsoft revealed its plans. Anyhow, taken together, these announcements support my theory that the small independent CEP/stream processing vendors are more or less ceding broad parts of the potential stream processing market.

The main use cases Microsoft talks about for CEP are in the area of sensor data. For example, Microsoft has prospects or customers who have many pieces of manufacturing or resource extraction equipment each. Each may spawn only a few messages per second, but overall there can be 1000s of messages/second, or indeed terabytes of data/day. (The orders of magnitude don’t quite match up there, but we were speaking pretty vaguely anyway.)

Microsoft called out four reasons to me why CEP might be needed in addition to ordinary database processing. Two are the standard reasons for data reduction:

1. Without CEP, you can’t bang the data into the database fast enough.

2. You don’t want to keep most of the data past a short time window anyway.

The other two are also fairly standard reasons for using CEP:

3. Standard SQL isn’t all that great for time series anyway.

4. CEP use cases often call for incremental processing and/or parameterization of queries, something CEP engines are commonly better designed for than are DBMS.

However, Microsoft seems to be taking a somewhat different approach to time-based SQL extensions than some other vendors. To quote email Microsoft sent today:

Microsoft Research (MSR) introduced the temporal extensions to relational algebra based upon a notion of application time that is independent of system time. It matters when the event originated instead of when they arrived at the processing system. Further it treats each event as being associated with an interval of time as opposed to a point in time. This helps in modeling certain real life phenomenon naturally. [StreamBase et al.] also reason about multiple streams. Both the approaches are extensions to relational algebra. The MSR approach took the algebra as the starting point while StreamBase took an existing language over the algebra – SQL as the starting point. The MSR approach consequently avoids having to rework other elements of the SQL surface. The primary language extensions through which this algebra will be exposed initially is LINQ.

Microsoft’s CEP capability is “fully integrated” with Visual Studio. There will be lots of adapters, both for inputs and outputs, with perhaps the most interesting non-obvious one being Excel charts. I definitely like the idea of CEP engines doing a good job of integrating both with dashboards/BI and with operational apps, because if you get value from one of those integrations, you’re apt to quickly want the other as well.

Microsoft told me that its CEP is written in a combination of “managed” and “native” code, where “managed” code, in Microsoft lingo, is more an issue of memory management than code. Noticing that I was confused on this point, Microsoft elucidated by email:

The implementation is built around getting the technologies available in the CLR and native code together to build the best implementation possible. We use the ability to do JIT code generation to efficiently evaluate expressions and back it up with very effective native memory management techniques.

Comments

8 Responses to “Microsoft announced CEP this week too”

  1. Microsoft announced CEP, real time BI on the horizon ? | Kasper de Jonge BI Blog on May 14th, 2009 7:01 am

    [...] announced earlier this week that a CEP/stream processing product will be included in SQL 2008 R2.  Complex Event Processing, or CEP, is primarily an event processing concept that deals with the [...]

  2. Ashish Thusoo on May 18th, 2009 1:04 am

    I have always wondered how real time BI needs to be. I can understand the need for CEP for applications like trading or sensor alerts where sub second response is absolutely desired, but my guess is that for most of the BI applications out there people can withstand an hours delay to get to the charts and reports and I think most of the current DW technologies would be able to support that. How mainstream is the business case of real time BI vs near real time BI? Thoughts?

  3. Curt Monash on May 18th, 2009 2:44 am

    Ashish,

    Use cases for sub-hour BI include:

    • Customer interaction — both websites and call centers, and in a few cases bricks-and-mortar.
    • Network operations.
    • Other equipment monitoring.
    • Other sensor networks.
    • In a few cases supply chain and/or logistics.
    • Trading
    • .

    Those are the main ones I can quickly think of.

  4. CEP: The Tech That Dare Not Speak Its Name « Market Strategies for IT Suppliers on May 18th, 2009 10:11 pm

    [...] because Sam Palmisano wanted to talk about it at a recent analyst event. Or (Curt Monash theorizes here) because Microsoft announced that their CEP offering will be in SQL Server Real Soon Now. IBM [...]

  5. Ashish Thusoo on May 20th, 2009 10:11 am

    Thanks Curt. So basically monitoring or optimization problems that fit the bill here, while general reporting is still ok with traditional batch oriented system.

  6. Curt Monash on May 20th, 2009 3:13 pm

    Ashish,

    Ideally, I’d like to get away from nightly batch jobs for ANYTHING. But there are certainly cases where they don’t hurt much.

  7. Gary Mintchell's Feed Forward on August 27th, 2009 4:01 pm

    bunch of thoughts…

  8. Bunch of thoughts | The Manufacturing Connection on June 27th, 2013 6:25 pm

    [...] Is Microsoft releasing a streaming database product that could compete with historians? [...]

Leave a Reply




Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:

Login

Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.