<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Facebook, Hadoop, and Hive</title>
	<atom:link href="http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Mon, 22 Feb 2010 20:56:20 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-160014</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Mon, 22 Feb 2010 20:56:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-160014</guid>
		<description>Hadoop has some level of load balancing. That&#039;s essential to any implementation of MapReduce. If it&#039;s not good enough for you -- well, Hadoop is open source, so dive in and change it.</description>
		<content:encoded><![CDATA[<p>Hadoop has some level of load balancing. That&#8217;s essential to any implementation of MapReduce. If it&#8217;s not good enough for you &#8212; well, Hadoop is open source, so dive in and change it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Parag Arora</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-159987</link>
		<dc:creator>Parag Arora</dc:creator>
		<pubDate>Mon, 22 Feb 2010 12:55:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-159987</guid>
		<description>How is the load balancing to different nodes assured? Does Hadoop internally handles it or we need to define a distribution mechanism?</description>
		<content:encoded><![CDATA[<p>How is the load balancing to different nodes assured? Does Hadoop internally handles it or we need to define a distribution mechanism?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Boston Big Data Summit keynote outline &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-157166</link>
		<dc:creator>Boston Big Data Summit keynote outline &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Thu, 28 Jan 2010 11:24:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-157166</guid>
		<description>[...] Big ones &#8212; retail-oriented ones (eBay, Amazon) partially excepted &#8212; rolled their own technology stacks [...]</description>
		<content:encoded><![CDATA[<p>[...] Big ones &#8212; retail-oriented ones (eBay, Amazon) partially excepted &#8212; rolled their own technology stacks [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Enterprise Data Base Building In The Social Era &#124; Digital Savant</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-152950</link>
		<dc:creator>Enterprise Data Base Building In The Social Era &#124; Digital Savant</dc:creator>
		<pubDate>Tue, 15 Dec 2009 23:48:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-152950</guid>
		<description>[...] utilizes the open source Hadoop/Hive system, which &#8220;ingests 15 terabytes of new data per day&#8221;. The amount of data coming in, telling details like favorite music, locations, books, thoughts and [...]</description>
		<content:encoded><![CDATA[<p>[...] utilizes the open source Hadoop/Hive system, which &#8220;ingests 15 terabytes of new data per day&#8221;. The amount of data coming in, telling details like favorite music, locations, books, thoughts and [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yahoo wants to do decapetabyte-scale data warehousing in Hadoop &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-152426</link>
		<dc:creator>Yahoo wants to do decapetabyte-scale data warehousing in Hadoop &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Sat, 12 Dec 2009 05:59:24 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-152426</guid>
		<description>[...] Language) for Hadoop, which is however getting a SQL interface. And we talked about Pig vs. Hive. But I recently heard a rumor all that is in flux, so I won&#8217;t write it up [...]</description>
		<content:encoded><![CDATA[<p>[...] Language) for Hadoop, which is however getting a SQL interface. And we talked about Pig vs. Hive. But I recently heard a rumor all that is in flux, so I won&#8217;t write it up [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: How 30+ enterprises are using Hadoop &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-143829</link>
		<dc:creator>How 30+ enterprises are using Hadoop &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Fri, 16 Oct 2009 11:36:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-143829</guid>
		<description>[...] to a relational DBMS; many others just leave it in HDFS (Hadoop Distributed File System), e.g. with Hive as the query language, or in exactly one case [...]</description>
		<content:encoded><![CDATA[<p>[...] to a relational DBMS; many others just leave it in HDFS (Hadoop Distributed File System), e.g. with Hive as the query language, or in exactly one case [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Fault-tolerant queries &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-140268</link>
		<dc:creator>Fault-tolerant queries &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Sun, 13 Sep 2009 16:07:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-140268</guid>
		<description>[...] we discussed this subject a few months ago in a couple of comment threads, it seemed to be the case [...]</description>
		<content:encoded><![CDATA[<p>[...] we discussed this subject a few months ago in a couple of comment threads, it seemed to be the case [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cloudera presents the MapReduce bull case &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-140226</link>
		<dc:creator>Cloudera presents the MapReduce bull case &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Sat, 12 Sep 2009 23:31:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-140226</guid>
		<description>[...] 10 terabytes/day of data ingested via Hadoop (Edit: Some of these metrics have been updated in a subsequent post about Facebook.) [...]</description>
		<content:encoded><![CDATA[<p>[...] 10 terabytes/day of data ingested via Hadoop (Edit: Some of these metrics have been updated in a subsequent post about Facebook.) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Confluence: Edmunds Central</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-139986</link>
		<dc:creator>Confluence: Edmunds Central</dc:creator>
		<pubDate>Thu, 10 Sep 2009 19:26:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-139986</guid>
		<description>&lt;strong&gt;Hadoop POC Environment...&lt;/strong&gt;

Overview  The Software Development team is currently researching a technology solution for addressing our ingestion and processing of weblog data that is reaching the limits of the current system based upon Informatica and Oracle RDBMS.......</description>
		<content:encoded><![CDATA[<p><strong>Hadoop POC Environment&#8230;</strong></p>
<p>Overview  The Software Development team is currently researching a technology solution for addressing our ingestion and processing of weblog data that is reaching the limits of the current system based upon Informatica and Oracle RDBMS&#8230;&#8230;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Announcing release of HadoopDB (longer version) &#124; Bookmarks</title>
		<link>http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/comment-page-1/#comment-132222</link>
		<dc:creator>Announcing release of HadoopDB (longer version) &#124; Bookmarks</dc:creator>
		<pubDate>Thu, 23 Jul 2009 20:35:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=775#comment-132222</guid>
		<description>[...] their analytical database system and claiming data warehouses of size more than a petabyte (see the end of this write-up for some links to large data warehouses).The second trend is what I talked about in my last blog [...]</description>
		<content:encoded><![CDATA[<p>[...] their analytical database system and claiming data warehouses of size more than a petabyte (see the end of this write-up for some links to large data warehouses).The second trend is what I talked about in my last blog [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic page generated in 0.272 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2010-02-22 16:57:38 -->
