October 17, 2012

Hadoop/RDBMS integration: Aster SQL-H and Hadapt

Two of the more interesting approaches for integrating Hadoop and MapReduce with relational DBMS come from my clients at Teradata Aster (via SQL/MR and SQL-H) and Hadapt. In both cases, the story starts:

You can dump any kind of data you want into Hadoop’s file system.
You can have data in a scale-out RDBMS to get good performance on analytic SQL.
You can access all the data (not just the relationally stored part) via SQL.
You can do MapReduce on all the data (not just the Hadoop-stored part).
To varying degrees, Hadapt and Aster each offer three kinds of advantage over Hadoop-with-Hive:
- SQL performance is (much) better.
- SQL functionality is better.
- At least some of your employees — the “business analysts” — can invoke MapReduce processes through SQL, if somebody else (e.g. your techies or the vendor’s) coded them up in the first place.

Of course, there are plenty of differences. Those start:

Teradata Aster is at a whole different stage of corporate and product maturity than Hadapt (even if some crucial Aster/Hadoop features are brand new).
Aster and Hadoop clusters are separate, even if they can be run on different nodes in the same appliance. Hadapt’s RDBMS runs on the same nodes as HDFS (Hadoop Distributed File System), or optionally MapR’s HDFS alternative.
The Aster approach involves two kinds of MapReduce. If you want to do MapReduce involving data stored in the Aster RDBMS, you should use Aster’s SQL/MR, not Hadoop MapReduce.
Teradata Aster encourages appliance deployment (although commodity hardware and even the cloud are options). Hadapt encourages Hadoop-style commodity hardware. I imagine there’s a considerable software price difference as well.

As for use cases — for starters, please note that a large fraction of analytic inquiries are ultimately about people. And when you’re looking at people, there are a whole lot of data sources you can consult. Many are clearly relational; increasingly, however, some are not. What’s more, people are hard to assess and understand, so you may want to take multiple tries at refining your analysis.

So right there you have an argument for flexible investigative or iterative analytics, over multi-structured (and relational) data. And if you think about how to combine information from all those data sources — well, it’s likely that SOME of the analytic steps will be a lot like joins.

That sure sounds like Hadoop/RDBMS integration to me.

Related link

Juggling analytic databases (March, 2012)

Categories: Aster Data, Hadapt, Hadoop, Pricing, SQL/Hadoop integration, Teradata

Subscribe to our complete feed!

Comments

5 Responses to “Hadoop/RDBMS integration: Aster SQL-H and Hadapt”

Notes on Hadoop adoption and trends | DBMS 2 : DataBase Management System Services on October 18th, 2012 10:06 pm

[…] with a 6th client, Tableau); the first 2 whose offerings I’ve actually written about are Teradata Aster and Hadapt. More generally, I’m hearing “Using Hadoop is hard; we’re here to make it easier […]
SQL-Hadoop architectures compared | DBMS 2 : DataBase Management System Services on June 2nd, 2013 6:41 am

[…] SQL-H and Hadapt (October, 2012) […]
DBMS development and other subjects | DBMS 2 : DataBase Management System Services on January 31st, 2014 9:05 am

[…] cardinal rules of DBMS development. That applies to Impala (Cloudera), Stinger (Hortonworks), and Hadapt, among others. Fortunately, the relevant vendors seem to be well aware of this […]
Introduction to Spark, Shark, BDAS and AMPLab | DBMS 2 : DataBase Management System Services on January 31st, 2014 9:06 am

[…] this as a big deal in complex query execution, for example as an aspect of the design of Impala or Hadapt. But it’s perhaps even more important in iterative machine learning algorithms, which seem to […]
Quick notes on Impala | DBMS 2 : DataBase Management System Services on January 31st, 2014 9:07 am

[…] Impala, announced today. Like Hive, Impala turns Hadoop into a basic analytic RDBMS, with similar SQL/Hadoop integration benefits to those of Hadapt. In […]

Leave a Reply

Search our blogs and white papers

Monash Research blogs

DBMS 2 covers database management, analytics, and related technologies.
Text Technologies covers text mining, search, and social software.
Strategic Messaging analyzes marketing and messaging strategy.
The Monash Report examines technology and public policy issues.
Software Memories recounts the history of the software industry.

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.

Links
- Monash Research
- White Papers
Admin
- Log in

Hadoop/RDBMS integration: Aster SQL-H and Hadapt

Comments

Search our blogs and white papers

Monash Research blogs

User consulting

Vendor advisory

Monash Research highlights

Recent posts

Categories

Date archives

Admin