<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Great MapReduce Debate</title>
	<atom:link href="http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 09 Feb 2012 09:22:14 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
	<item>
		<title>By: NoSQL Daily &#8211; Sat Nov 13 &#8250; PHP App Engine</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-190730</link>
		<dc:creator>NoSQL Daily &#8211; Sat Nov 13 &#8250; PHP App Engine</dc:creator>
		<pubDate>Sat, 13 Nov 2010 01:16:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-190730</guid>
		<description>[...] The Great MapReduce Debate &#124; DBMS2 : DataBase Management System Services [...]</description>
		<content:encoded><![CDATA[<p>[...] The Great MapReduce Debate | DBMS2 : DataBase Management System Services [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: CAP equivalent for analytics? &#171; Big Data Craft</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-186965</link>
		<dc:creator>CAP equivalent for analytics? &#171; Big Data Craft</dc:creator>
		<pubDate>Sun, 10 Oct 2010 16:56:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-186965</guid>
		<description>[...] it will make suboptimal (considering my model context not in wider sense!) great SV system. The great MapReduce debate is not for [...]</description>
		<content:encoded><![CDATA[<p>[...] it will make suboptimal (considering my model context not in wider sense!) great SV system. The great MapReduce debate is not for [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Search Facets &#187; MapReduce just semi-good for semi-structured data</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-156144</link>
		<dc:creator>Search Facets &#187; MapReduce just semi-good for semi-structured data</dc:creator>
		<pubDate>Mon, 18 Jan 2010 21:27:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-156144</guid>
		<description>[...] like Aster and Greenplum, followed fairly quickly by others such as Netezza and (somewhat surprisingly) [...]</description>
		<content:encoded><![CDATA[<p>[...] like Aster and Greenplum, followed fairly quickly by others such as Netezza and (somewhat surprisingly) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bazy danych bez SQL &#171; data mining à la polonaise</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-150565</link>
		<dc:creator>Bazy danych bez SQL &#171; data mining à la polonaise</dc:creator>
		<pubDate>Wed, 25 Nov 2009 01:00:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-150565</guid>
		<description>[...] danych, warto słuchać, co ma do powiedzenia Michael Stonebraker. Szczególnie, gdy bierze się za obronę systemów zarządzania bazami danych przed różnymi zakusami, np. przed powrotem do pre-relacyjnej [...]</description>
		<content:encoded><![CDATA[<p>[...] danych, warto słuchać, co ma do powiedzenia Michael Stonebraker. Szczególnie, gdy bierze się za obronę systemów zarządzania bazami danych przed różnymi zakusami, np. przed powrotem do pre-relacyjnej [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rod Oldehoeft</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-95822</link>
		<dc:creator>Rod Oldehoeft</dc:creator>
		<pubDate>Wed, 27 Aug 2008 00:46:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-95822</guid>
		<description>Check out MRNet:
www.paradyn.org/mrnet/
It is becoming the de facto utility for implementing scalable Multicasts and Reductions on high-performance technical computing systems, especially for performance analysis tools.  Unlike Hadoop MapReduce, which is file-based, MRNet uses one or more tree-based overlay networks (TBON) over the physical network topology of current (IBM BG/*, Cray XT*) high-end systems (and clusters, too, of course).  Each TBON uses the same filter function when used for reduction, so for different filtering functions you instantiate a TBON for each.  Unlike MapReduce, MRNet also implements multicast communication over the same TBONs.

It&#039;s not a database system, either.</description>
		<content:encoded><![CDATA[<p>Check out MRNet:<br />
<a href="http://www.paradyn.org/mrnet/" rel="nofollow">http://www.paradyn.org/mrnet/</a><br />
It is becoming the de facto utility for implementing scalable Multicasts and Reductions on high-performance technical computing systems, especially for performance analysis tools.  Unlike Hadoop MapReduce, which is file-based, MRNet uses one or more tree-based overlay networks (TBON) over the physical network topology of current (IBM BG/*, Cray XT*) high-end systems (and clusters, too, of course).  Each TBON uses the same filter function when used for reduction, so for different filtering functions you instantiate a TBON for each.  Unlike MapReduce, MRNet also implements multicast communication over the same TBONs.</p>
<p>It&#8217;s not a database system, either.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Greg Holmberg</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-90247</link>
		<dc:creator>Greg Holmberg</dc:creator>
		<pubDate>Wed, 09 Jul 2008 23:39:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-90247</guid>
		<description>What is MapReduce good for?  Take a look at the Mahout project at Apache: http://lucene.apache.org/mahout

Mahout implements a variety of machine learning algorithms, many of which are useful in text mining.  Mahout builds on Apache Hadoop, which is an implementation of MapReduce.  Both are sub-projects of the Lucene search engine.  If all this Lucene code comes together at some point in the future, then Lucene will be much more than a search engine.

I&#039;m sure Google is working on using MapReduce for some of the same algorithms--i.e. that text mining is in Google&#039;s future.</description>
		<content:encoded><![CDATA[<p>What is MapReduce good for?  Take a look at the Mahout project at Apache: <a href="http://lucene.apache.org/mahout" rel="nofollow">http://lucene.apache.org/mahout</a></p>
<p>Mahout implements a variety of machine learning algorithms, many of which are useful in text mining.  Mahout builds on Apache Hadoop, which is an implementation of MapReduce.  Both are sub-projects of the Lucene search engine.  If all this Lucene code comes together at some point in the future, then Lucene will be much more than a search engine.</p>
<p>I&#8217;m sure Google is working on using MapReduce for some of the same algorithms&#8211;i.e. that text mining is in Google&#8217;s future.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Google has thousands of internal data formats, mostly simple ones &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-90129</link>
		<dc:creator>Google has thousands of internal data formats, mostly simple ones &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Tue, 08 Jul 2008 18:27:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-90129</guid>
		<description>[...] to think of it, that sounds very consistent with the idea that MapReduce solves a large fraction of Google&#8217;s data management issues.   Share: These icons link to [...]</description>
		<content:encoded><![CDATA[<p>[...] to think of it, that sounds very consistent with the idea that MapReduce solves a large fraction of Google&#8217;s data management issues.   Share: These icons link to [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-70995</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Tue, 05 Feb 2008 17:19:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-70995</guid>
		<description>Which nobody thought to tell me about. ::sigh::

Actually, I&#039;m glad they didn&#039;t.  I&#039;ve been coughing a lot, and wound up sleeping 14 hours yesterday to good effect.  So missing the conference was probably a good thing.

CAM</description>
		<content:encoded><![CDATA[<p>Which nobody thought to tell me about. ::sigh::</p>
<p>Actually, I&#8217;m glad they didn&#8217;t.  I&#8217;ve been coughing a lot, and wound up sleeping 14 hours yesterday to good effect.  So missing the conference was probably a good thing.</p>
<p>CAM</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Weinreb</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-70960</link>
		<dc:creator>Daniel Weinreb</dc:creator>
		<pubDate>Tue, 05 Feb 2008 12:14:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-70960</guid>
		<description>Yesterday, at the New England Database Day conference, Prof. DeWitt gave an invited talk.  It became far clearer to me in what manner he was comparing Map/Reduce with parallel database systems.

The blog entry that we&#039;ve all been reading is confusing.  Like everyone else, my reaction was &quot;Well, Map/Reduce never said that it was a database system, so why are you criticizing it as if it were?&quot;

What he&#039;s mainly comparing is the overall job scheduling strategy of Map/Reduce versus the kind of job scheduling used by parallel RDBMS&#039;s. The Map/Reduce pattern is obviously a useful tool for certain problems.  His point is that a more general parallel database system can choose among many patterns, of which Map/Reduce is only one example, and therefore can be a good solution for a wider range of problems.  Furthermore, you can issue a declarative query (i.e. in a query language such as SQL or relational algebra), and an automatic optimizer can choose which of those patterns to use for your particular problem, provided that you have an actual DBMS with a schema and so forth.

In this context, what the blog entry says makes a lot more sense.

I hope Prof. DeWitt writes up his talk as a paper, which would make this all a lot more clear.</description>
		<content:encoded><![CDATA[<p>Yesterday, at the New England Database Day conference, Prof. DeWitt gave an invited talk.  It became far clearer to me in what manner he was comparing Map/Reduce with parallel database systems.</p>
<p>The blog entry that we&#8217;ve all been reading is confusing.  Like everyone else, my reaction was &#8220;Well, Map/Reduce never said that it was a database system, so why are you criticizing it as if it were?&#8221;</p>
<p>What he&#8217;s mainly comparing is the overall job scheduling strategy of Map/Reduce versus the kind of job scheduling used by parallel RDBMS&#8217;s. The Map/Reduce pattern is obviously a useful tool for certain problems.  His point is that a more general parallel database system can choose among many patterns, of which Map/Reduce is only one example, and therefore can be a good solution for a wider range of problems.  Furthermore, you can issue a declarative query (i.e. in a query language such as SQL or relational algebra), and an automatic optimizer can choose which of those patterns to use for your particular problem, provided that you have an actual DBMS with a schema and so forth.</p>
<p>In this context, what the blog entry says makes a lot more sense.</p>
<p>I hope Prof. DeWitt writes up his talk as a paper, which would make this all a lot more clear.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Text Technologies&#187;Blog Archive &#187; 19 Microsoft/Yahoo synergies that could revolutionize the Internet</title>
		<link>http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-70671</link>
		<dc:creator>Text Technologies&#187;Blog Archive &#187; 19 Microsoft/Yahoo synergies that could revolutionize the Internet</dc:creator>
		<pubDate>Sun, 03 Feb 2008 22:04:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2008/01/18/the-great-mapreduce-debate/#comment-70671</guid>
		<description>[...] Ditto. (Recent discussion of Google MapReduce quantifies this processing effort a [...]</description>
		<content:encoded><![CDATA[<p>[...] Ditto. (Recent discussion of Google MapReduce quantifies this processing effort a [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

