October 22, 2014

Is analytic data management finally headed for the cloud?

It seems reasonable to wonder whether analytic data management is headed for the cloud. In no particular order:

Also — although the specifics on this are generally vague and/or confidential — I sense a narrowing of the gap between:

*Cazena is proud of its team of advisors. However, the only person yet announced for a Cazena operating role is Prat Moghe, and his time period in Netezza’s mainstream happens not to have been one in which Netezza had much technical or market accomplishment.

On the other hand:

And so I’m still more confident in SaaS/colocation analytic data management, or in Redshift, than I am in true arm’s-length cloud-based systems.


3 Responses to “Is analytic data management finally headed for the cloud?”

  1. J. Andrew Rogers on October 22nd, 2014 7:02 pm

    In my experience, machine-generated data is creating a strong economic driver toward fine-grained geo-federated data management rather than cloud per se. It makes sense to push a lot of the analytics all the way out to the edge, very close to the generating source, such that there is no meaningful cloud deployment model.

    The driver is insufficient or expensive bandwidth between the location where it is generated and a sufficiently large data center, cloud or otherwise, where it can be aggregated in the raw and analyzed. Even with aggressive filtering at the edge, a petabyte per day of rich, operational data is not uncommon. There is a desire to analyze the data in place as part of a federated analytic rather than culling and backhauling, which loses too much context.

    We work with a lot of very fast, very large machine-generated data sources primarily for the purposes of spatial analytics. Much of this is in the cloud on big commodity clusters. One of the most common asks by customers is if our platform can be deployed as a fine-grained geo-federated system so that many of the analytics can run adjacent to the collection platform to reduce bandwidth requirements.

    For machine-generated (“IoT”), sensor, and other spatially organized data models this actually makes a lot sense because the topology of reality matches the topology of the spatial analysis algorithms. Spatial joins and aggregates across various data sources would have a lot of physical network locality when organized this way, which reduces analysis latency while increasing the effective bandwidth and practically supportable data volumes.

  2. David Gruzman on October 23rd, 2014 6:13 am

    In many cases reserved cluster instances are used for big data analytic in amazon cloud.
    I would argue that it is something in the middle between cloud and collocation, even closer to collocation.
    Why it is similar to collocation:
    – It is predefined hardware.
    – It is guaranteed network (10 GB), but only when all servers allocated together in one placement group.
    – It is too expensive to take on-demand, so it is used as reserved (less elastic).
    Why it still resemble cloud:
    – API for provisioning is available.
    – Cloud services, like S3 and EBS are available.

  3. Ranko Mosic on October 24th, 2014 10:07 am

    Network bandwith is still the biggest obstacle to wider cloud analytics adoption. Moving very large volumes of data associated with analytics is all but impossible for most enterprises under current network infrastructure limitations. As Curt and Andrew said above, machine ( cloud ) generated data is best fit for cloud analytics, as data is already out there so it either has to/can be processed in situ or moved to cloud processing site.

Leave a Reply

Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:


Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.