June 19, 2012

Hadoop distributions: CDH 4, HDP 1, Hadoop 2.0, Hadoop 1.0 and all that

This is part of a four-post series, covering:

The posts depend on each other in various ways.

My clients at Cloudera and Hortonworks have somewhat different views as to the maturity of various pieces of Hadoop technology. In particular:

*”CDH” stands, due to some trademarking weirdness, for “Cloudera’s Distribution including Apache Hadoop”. “HDP” stands for “Hortonworks Data Platform”.

The whole thing seems like a big example of Miles’ Law: Where you stand depends upon where you sit. Cloudera’s embrace of more advanced Apache Hadoop technology is accompanied by claims such as “We built a lot of this ourselves” and “We’ve already tested this stuff at length.” I find Cloudera’s claims credible, and look forward to Hortonworks’ near-future declarations that those Hadoop 2.0 features are “now” enterprise-ready.

For HCatalog, however, the situations are reversed.

For now, my views on selecting Hadoop distributions start:

Comments

11 Responses to “Hadoop distributions: CDH 4, HDP 1, Hadoop 2.0, Hadoop 1.0 and all that”

  1. “Enterprise-ready Hadoop” | DBMS 2 : DataBase Management System Services on June 19th, 2012 8:44 pm

    [...] Hadoop versions and distributions, and their readiness or lack thereof for production. [...]

  2. Hadoop marketing themes that deserve to be ignored | DBMS 2 : DataBase Management System Services on June 19th, 2012 8:46 pm

    [...] Hadoop versions and distributions, and their readiness or lack thereof for production. [...]

  3. Jim Walker on June 20th, 2012 12:47 am

    Thank you for the post. I believe this is one of, if not THE first comparison studies of Apache Hadoop commercial distributions.

    We must note however, that while HCatalog is a big part of HDP, it is not shipped as part of either CDH4 or CDH3. (As reference, here is a link to the CDH package http://tinyurl.com/7tvs3qz.)

    Thanks again, Curt…

  4. Erik Paulson on June 20th, 2012 2:42 pm

    Curt –

    You write “As does CDH 4, HDW” – did you mean HDP there, or is HDW a feature in something?

  5. Curt Monash on June 20th, 2012 5:32 pm

    HDW is a typo for HDP. I thought I’d fixed all the instances of that. Let me go back and search for more. :)

  6. Hadoop YARN — beyond MapReduce | DBMS 2 : DataBase Management System Services on July 23rd, 2012 1:26 am

    [...] made the marketing decision to include YARN in its June Hadoop distribution release anyway, but advised that it was for experimentation rather than [...]

  7. HCatalog — yes, it matters | DBMS 2 : DataBase Management System Services on August 8th, 2012 7:26 am

    [...] ETL tools such as Talend. [...]

  8. Hadoop distributions | DBMS 2 : DataBase Management System Services on February 28th, 2013 3:22 am

    [...] straddles Hadoop 1 and Hadoop 2, shipping aspects of Hadoop 2 but not recommending them for production [...]

  9. Teradata SQL-H, using HCatalog | DBMS 2 : DataBase Management System Services on April 15th, 2013 2:45 am

    [...] interesting idea, and a good hook for my first shot at writing about HCatalog. Indeed, other than the Talend integration bundled into Hortonworks’ HDP 1, Teradata SQL-H is the first real use of HCatalog I’m aware [...]

  10. Hortonworks, Hadoop, Stinger and Hive | DBMS 2 : DataBase Management System Services on August 7th, 2013 2:51 am

    [...] I think Cloudera’s earlier and necessarily incremental Hadoop 2 rollout was a better choice than Hortonworks’ later big bang, even though the core-mission aspect of [...]

  11. theshieldinc.Com on October 19th, 2014 8:05 am

    theshieldinc.Com

    Hadoop distributions: CDH 4, HDP 1, Hadoop 2.0, Hadoop 1.0 and all that | DBMS 2 : DataBase Management System Services

Leave a Reply




Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:

Login

Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.