<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Greenplum claims very fast load speeds, and Fox still throws away most of its MySpace data</title>
	<atom:link href="http://www.dbms2.com/2009/03/20/greenplum-claims-very-fast-load-speeds-and-fox-still-throws-away-most-of-its-myspace-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2009/03/20/greenplum-claims-very-fast-load-speeds-and-fox-still-throws-away-most-of-its-myspace-data/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 09 Feb 2012 21:52:17 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
	<item>
		<title>By: Kevin Closson</title>
		<link>http://www.dbms2.com/2009/03/20/greenplum-claims-very-fast-load-speeds-and-fox-still-throws-away-most-of-its-myspace-data/#comment-113883</link>
		<dc:creator>Kevin Closson</dc:creator>
		<pubDate>Fri, 20 Mar 2009 20:14:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=723#comment-113883</guid>
		<description>Dave wrote:
&quot;4TB per hour rate is not 4TB of continuous data loading. It’s 2TB in 30 minutes. &quot;

FWIW, I have blogged a bit about the topic of load &quot;rates&quot; versus load results:

http://kevinclosson.wordpress.com/2009/03/17/no-proof-means-all-spoof-exadata-lags-competitor-bulk-data-loading-capability-really/



The views expressed in this comment are my own and do not necessarily reflect the views of Oracle. The views and opinions expressed by others on this comment thread are theirs, not mine.</description>
		<content:encoded><![CDATA[<p>Dave wrote:<br />
&#8220;4TB per hour rate is not 4TB of continuous data loading. It’s 2TB in 30 minutes. &#8221;</p>
<p>FWIW, I have blogged a bit about the topic of load &#8220;rates&#8221; versus load results:</p>
<p><a href="http://kevinclosson.wordpress.com/2009/03/17/no-proof-means-all-spoof-exadata-lags-competitor-bulk-data-loading-capability-really/" rel="nofollow">http://kevinclosson.wordpress.com/2009/03/17/no-proof-means-all-spoof-exadata-lags-competitor-bulk-data-loading-capability-really/</a></p>
<p>The views expressed in this comment are my own and do not necessarily reflect the views of Oracle. The views and opinions expressed by others on this comment thread are theirs, not mine.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: More on Greenplum, Fox/MySpace, and load speeds &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2009/03/20/greenplum-claims-very-fast-load-speeds-and-fox-still-throws-away-most-of-its-myspace-data/#comment-113862</link>
		<dc:creator>More on Greenplum, Fox/MySpace, and load speeds &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Fri, 20 Mar 2009 16:40:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=723#comment-113862</guid>
		<description>[...] Lai offers more facts, figures, explanation, and competitive insight than I did on Greenplum&#8217;s loading of the Fox/MySpace database, including that Greenplum is being loaded [...]</description>
		<content:encoded><![CDATA[<p>[...] Lai offers more facts, figures, explanation, and competitive insight than I did on Greenplum&#8217;s loading of the Fox/MySpace database, including that Greenplum is being loaded [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dave Menninger</title>
		<link>http://www.dbms2.com/2009/03/20/greenplum-claims-very-fast-load-speeds-and-fox-still-throws-away-most-of-its-myspace-data/#comment-113854</link>
		<dc:creator>Dave Menninger</dc:creator>
		<pubDate>Fri, 20 Mar 2009 14:17:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=723#comment-113854</guid>
		<description>I&#039;m not sure whether to be flattered or insulted by the use of the term &quot;contrived&quot;.  ;-)  According to Meriam Webster&#039;s online dictionary, contrived could be considered &quot;artistic or ingeneous&quot;, but I suspect most people associate a negative connotation with the term.  Just to set the record straight, Vertica and its partner Syncsort followed a precedent set by Informatica and then subsequently repeated by Microsoft.  They had each published data loading benchmarks using the TPC-H data.  Since there is no industry standard data loading benchmark we simply followed the precedent, except we had our results audited and we published a disclosure report so the effort would be transparent.  The details are available at http://www.vertica.com/etlworldrecord.

While we are on the subject of setting the record straight, readers might want to check out Eric Lai&#039;s article in which Ben Werther of Greenplum explains that the 4TB per hour rate is not 4TB of continuous data loading.  It&#039;s 2TB in 30 minutes. http://www.networkworld.com/news/2009/031909-upstarts-speed-past-bi-vendors.html?page=2. Perhaps that rate could be sustained, perhaps not.  That&#039;s why full disclosure is important.  Readers need to know what the data represents.

Dave</description>
		<content:encoded><![CDATA[<p>I&#8217;m not sure whether to be flattered or insulted by the use of the term &#8220;contrived&#8221;.  <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />   According to Meriam Webster&#8217;s online dictionary, contrived could be considered &#8220;artistic or ingeneous&#8221;, but I suspect most people associate a negative connotation with the term.  Just to set the record straight, Vertica and its partner Syncsort followed a precedent set by Informatica and then subsequently repeated by Microsoft.  They had each published data loading benchmarks using the TPC-H data.  Since there is no industry standard data loading benchmark we simply followed the precedent, except we had our results audited and we published a disclosure report so the effort would be transparent.  The details are available at <a href="http://www.vertica.com/etlworldrecord" rel="nofollow">http://www.vertica.com/etlworldrecord</a>.</p>
<p>While we are on the subject of setting the record straight, readers might want to check out Eric Lai&#8217;s article in which Ben Werther of Greenplum explains that the 4TB per hour rate is not 4TB of continuous data loading.  It&#8217;s 2TB in 30 minutes. <a href="http://www.networkworld.com/news/2009/031909-upstarts-speed-past-bi-vendors.html?page=2" rel="nofollow">http://www.networkworld.com/news/2009/031909-upstarts-speed-past-bi-vendors.html?page=2</a>. Perhaps that rate could be sustained, perhaps not.  That&#8217;s why full disclosure is important.  Readers need to know what the data represents.</p>
<p>Dave</p>
]]></content:encoded>
	</item>
</channel>
</rss>

