October 23, 2011

NoSQL notes

Last week I visited with James Phillips of Couchbase, Max Schireson and Eliot Horowitz of 10gen, and Todd Lipcon, Eric Sammer, and Omer Trajman of Cloudera. I guess it’s time for a round-up NoSQL post. :)

Views of the NoSQL market horse race are reasonably consistent, with perhaps some elements of “Where you stand depends upon where you sit.”

*I hope to do a separate post on HBase adoption soon. In connection with that, any info on HBase adoption by Facebook (said to be very heavy), Twitter, et al. would be much appreciated.

The reasons for using NoSQL of course are, in some order, dynamic schemas, scale-out, and open source. I find the scale-out argument somewhat bogus,* but the data model one is very real. Depending on whom you talk with, the most important point about dynamic schemas may actually be that they’re changeable, or it may just be that you don’t have to specify a schema at the time of initial application design. MongoDB gets particular praise as a good platform on which to throw something together quickly, although predictions as to how far the application will then scale may differ depending on whether you’re talking with, say, Max or Todd.

*It’s fair to say that NoSQL systems are more proven in scale-out than most relational DBMS. Even so, I would cringe at any line of reasoning that concluded one should adopt NoSQL because it is more mature than relational alternatives.

Finally, I was perhaps too extreme when I suggested there was no good reason for Oracle to have adopted the major key/minor key approach it took in its NoSQL offering. Todd offered a reason why that approach – which he characterized as similar to Project Voldemort’s – could make sense:

I’d be interested to learn about the Couchbase and MongoDB answers to that challenge.

Comments

12 Responses to “NoSQL notes”

  1. Vlad Rodionov on October 24th, 2011 2:17 pm

    MongoDB and Couchbase are single-rack solutions right now afaik. Although I am not a big fun of HBase but potentially it can scale to hundreds and thousands of servers (Facebook will probably prove it as soon).

  2. Jean-Daniel Cryans on October 24th, 2011 2:33 pm

    Re HBase adoption at FB:

    This preso gives some good numbers: http://www.slideshare.net/brizzzdotcom/facebook-messages-hbase

    The FB guys also maintain a HBase@FB group: https://www.facebook.com/UsingHbase

    The latest message there reads: “Fun fact Facebook scaling fact of the day: the HBase clusters supporting the messages product have over 1 Petabyte of online capacity”

    For Twitter, AFAIK their HBase setup is fairly limited and they don’t disclose a lot of information about it.

    I’d also like to note that Huawei (http://en.wikipedia.org/wiki/Huawei) has a growing presence within the Hadoop and HBase dev communities.

    (Disclaimer: I’m an HBase committer)

  3. Curt Monash on October 24th, 2011 3:47 pm

    Thanks!!

    Please hit me with any other examples you think I should be aware of. :)

  4. Shashank Tiwari on October 26th, 2011 5:21 pm

    I like this post on NoSQL but I think its honestly extremely influenced by the opinions of very few. You may want to approach NoSQL from 3 different standpoints — (a) origins/inspiration — Google Bigtable, Amazon Dynamo, or any other (b) use cases — not all NoSQL databases are made equal or try to solve the same problem (c) Features like ability to scale-out, less stringent schema or no schema, map-reduce style parallel processing of large data sets, etc…I wrote a book explaining some of these. Look at Professional NoSQL (Wiley, 2011) and feel free to get in touch and I would be happy to explain some of these in greater detail.

  5. NoSQL notes « Another Word For It on October 26th, 2011 6:58 pm

    [...] NoSQL notes [...]

  6. Curt Monash on October 26th, 2011 7:13 pm

    @Shashank,

    Have you seen some of my other posts that address most of the issues you cite?

  7. Andrew Purtell on October 28th, 2011 4:58 pm

    Trend Micro has about 100 HBase nodes running in secure configuration, a modest data point by size but interesting for two reasons:

    1) “Non-traditional” adopter, if that term makes any sense for nonrelational database technology; Trend is not a Facebook or a Twitter

    2) This is a secure variant of HBase, integrated with Hadoop’s Kerberos authentication and supporting a familiar permissions grant model on the column family and table levels (ADMIN, CREATE, READ, WRITE, etc.) The enabling features are going in to 0.92 release I believe.

    (Disclaimer: I’m an HBase committer.)

  8. Andrew Purtell on October 28th, 2011 5:05 pm

    Also, we recently welcomed a committer from Salesforce.com to the project, so I think we shall see HBase address in more ways enterprise-y concerns: further attention to multitenancy (user isolation), and inclusion of constraints checking and transactional semantics as they make sense applied to the BigTable system model.

  9. Curt Monash on October 28th, 2011 5:05 pm

    Thanks, guys! Keep ‘em coming!

  10. Mahesh Paolini-Subramanya on November 1st, 2011 12:26 am

    James *would* say that wouldn’t he? :-)
    Seriously though, CouchDB (base?) will be pretty damn cool once it goes live (‘specially if/when they can merge in the cloudant/dynamo stuff), but for now Redis and HBase are definitely in the mix.

  11. Jeff Darcy (@Obdurodon) on November 1st, 2011 9:12 am

    I don’t disagree with the conclusion, but it’s disappointing the Voldemort and Riak don’t seem to be getting the same mindshare as these three. I hope someone from one of those teams will stop by and tell us something about their larger deployments, because I think they’re both fine projects that people should seriously consider alongside those mentioned.

  12. Chris Westin on December 31st, 2011 10:29 am

    @Obdurodon: In this context, Voldemort’s uptake is probably limited because of the lack of commercial support available. There are commercial entities you can buy support from for all the other offerings mentioned. A lot of application owners probably feel nervous about adopting something that doesn’t offer them somewhere to turn to if they have problems.

    (Disclaimer: I work for 10gen.)

Leave a Reply




Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:

Login

Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.