<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: SANs vs. DAS in MPP data warehousing</title>
	<atom:link href="http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Mon, 01 Mar 2010 19:09:32 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Ian</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-123148</link>
		<dc:creator>Ian</dc:creator>
		<pubDate>Thu, 28 May 2009 11:41:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-123148</guid>
		<description>The most important thing IMHO is cost per MB/s of throughput. When you factor in how much it&#039;ll cost you, SAN doesn&#039;t get a look-in. I currently work on a system with total sequential throughput of 3 GigaBYTES per second which cost around USD 35,000. We costed out the same throughput on SANs - it would have worked out as 8 times more expensive.

You also can&#039;t say that you&#039;ll use enterprise infrastructure to deliver the performance without compromising performance service levels -- after all, as soon as you use the Enterprise SAN, you&#039;ve now got contention on fabric switches and any other shared components.</description>
		<content:encoded><![CDATA[<p>The most important thing IMHO is cost per MB/s of throughput. When you factor in how much it&#8217;ll cost you, SAN doesn&#8217;t get a look-in. I currently work on a system with total sequential throughput of 3 GigaBYTES per second which cost around USD 35,000. We costed out the same throughput on SANs &#8211; it would have worked out as 8 times more expensive.</p>
<p>You also can&#8217;t say that you&#8217;ll use enterprise infrastructure to deliver the performance without compromising performance service levels &#8212; after all, as soon as you use the Enterprise SAN, you&#8217;ve now got contention on fabric switches and any other shared components.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: shshme</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-122306</link>
		<dc:creator>shshme</dc:creator>
		<pubDate>Thu, 21 May 2009 09:21:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-122306</guid>
		<description>There is a good Cost Comparison at the site link above.</description>
		<content:encoded><![CDATA[<p>There is a good Cost Comparison at the site link above.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Infology.Ru &#187; Blog Archive &#187; Exadata: Oracle наконец отвечает бросившим вызов в области хранилищ данных</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-98925</link>
		<dc:creator>Infology.Ru &#187; Blog Archive &#187; Exadata: Oracle наконец отвечает бросившим вызов в области хранилищ данных</dc:creator>
		<pubDate>Thu, 09 Oct 2008 20:26:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-98925</guid>
		<description>[...] SANs vs. DAS in MPP data warehousing [...]</description>
		<content:encoded><![CDATA[<p>[...] SANs vs. DAS in MPP data warehousing [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Exadata and Oracle Database Machine parallelization clarified &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-98136</link>
		<dc:creator>Exadata and Oracle Database Machine parallelization clarified &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Mon, 29 Sep 2008 06:03:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-98136</guid>
		<description>[...] that follows this optimal storage-facing parallelization strategy. Actually, opinions differ as to whether rigid coupling of processors to specific disks is actually necessary. But after supporting one extreme (the disk part of shared-everything), Oracle with Exadata has [...]</description>
		<content:encoded><![CDATA[<p>[...] that follows this optimal storage-facing parallelization strategy. Actually, opinions differ as to whether rigid coupling of processors to specific disks is actually necessary. But after supporting one extreme (the disk part of shared-everything), Oracle with Exadata has [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doug's Oracle Blog</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-97951</link>
		<dc:creator>Doug's Oracle Blog</dc:creator>
		<pubDate>Thu, 25 Sep 2008 18:16:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-97951</guid>
		<description>&lt;strong&gt;Day 4 - Grumpy Old Man...&lt;/strong&gt;


Well, I was starting to worry that I was completely out of step with the blogosphere, because finally a keynote presentation worth writing about and no-one seems keen, but I&#039;ve only just noticed this quote in Pete Scott&#039;s blog posting.&quot;For str...</description>
		<content:encoded><![CDATA[<p><strong>Day 4 &#8211; Grumpy Old Man&#8230;</strong></p>
<p>Well, I was starting to worry that I was completely out of step with the blogosphere, because finally a keynote presentation worth writing about and no-one seems keen, but I&#8217;ve only just noticed this quote in Pete Scott&#8217;s blog posting.&quot;For str&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Exadata: Oracle finally answers the data warehouse challengers &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-97906</link>
		<dc:creator>Exadata: Oracle finally answers the data warehouse challengers &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Thu, 25 Sep 2008 00:25:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-97906</guid>
		<description>[...] SANs vs. DAS in MPP data warehousing [...]</description>
		<content:encoded><![CDATA[<p>[...] SANs vs. DAS in MPP data warehousing [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Vogel</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-97396</link>
		<dc:creator>Jeff Vogel</dc:creator>
		<pubDate>Sat, 13 Sep 2008 22:12:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-97396</guid>
		<description>Chuck,
Good to see you taking an interest in the DBMS subject of SANs.  As to your comment regarding &quot;pushing logic to the array&quot;, EMC should be careful not to think about this in one dimensional terms of DBMS logic and it&#039;s clearly not just about performance benefits.  Given EMC&#039;s early market leadership in creating SANs (Mike Ruettgers comes to mind), you wouldn&#039;t want to be caught sleeping at the wheel of a multi-billions dollar market where storage management is concerned (-:  Sorry for being somewhat elusive, but confidentiality is important.</description>
		<content:encoded><![CDATA[<p>Chuck,<br />
Good to see you taking an interest in the DBMS subject of SANs.  As to your comment regarding &#8220;pushing logic to the array&#8221;, EMC should be careful not to think about this in one dimensional terms of DBMS logic and it&#8217;s clearly not just about performance benefits.  Given EMC&#8217;s early market leadership in creating SANs (Mike Ruettgers comes to mind), you wouldn&#8217;t want to be caught sleeping at the wheel of a multi-billions dollar market where storage management is concerned (-:  Sorry for being somewhat elusive, but confidentiality is important.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: How will SSDs get incorporated into data warehousing? &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-97372</link>
		<dc:creator>How will SSDs get incorporated into data warehousing? &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Sat, 13 Sep 2008 10:50:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-97372</guid>
		<description>[...] visible Information Week blog post. After the great recent (and still ongoing!) discussion in the SAN vs. DAS comment thread, I&#8217;d like to throw some questions out for discussion, [...]</description>
		<content:encoded><![CDATA[<p>[...] visible Information Week blog post. After the great recent (and still ongoing!) discussion in the SAN vs. DAS comment thread, I&#8217;d like to throw some questions out for discussion, [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-97367</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Sat, 13 Sep 2008 08:58:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-97367</guid>
		<description>The sequential I/O issue, like most of the others here, is not trivial.  If you&#039;re mainly doing queries that access a lot of data, it&#039;s possible for a large fraction of your query I/O to be sequential.   If you&#039;re doing pinpoint data lookup in some operational BI type of scenario, however, then you&#039;re apt to be pretty random.

Even for analytical queries, the more complex your partitioning and/or schema, the harder it is to be purely sequential. And by the way, Luke has been giving me an earful recently about the breadth of Greenplum&#039;s schema agnosticism and support for various index types.

As for updates -- in many warehouse scenarios, even low-latency updates can be a lot more sequential than Chuck suggests.  MVCC approaches lend themselves to sequential updating.  Or if you&#039;re on a once-every-few-minutes microbatch updating schedule, the whole thing is apt to be either pretty sequential or else so low-volume as not to be a major performance concern.

CAM</description>
		<content:encoded><![CDATA[<p>The sequential I/O issue, like most of the others here, is not trivial.  If you&#8217;re mainly doing queries that access a lot of data, it&#8217;s possible for a large fraction of your query I/O to be sequential.   If you&#8217;re doing pinpoint data lookup in some operational BI type of scenario, however, then you&#8217;re apt to be pretty random.</p>
<p>Even for analytical queries, the more complex your partitioning and/or schema, the harder it is to be purely sequential. And by the way, Luke has been giving me an earful recently about the breadth of Greenplum&#8217;s schema agnosticism and support for various index types.</p>
<p>As for updates &#8212; in many warehouse scenarios, even low-latency updates can be a lot more sequential than Chuck suggests.  MVCC approaches lend themselves to sequential updating.  Or if you&#8217;re on a once-every-few-minutes microbatch updating schedule, the whole thing is apt to be either pretty sequential or else so low-volume as not to be a major performance concern.</p>
<p>CAM</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2008/09/06/sans-vs-das-in-mpp-data-warehousing/comment-page-1/#comment-97366</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Sat, 13 Sep 2008 08:43:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=524#comment-97366</guid>
		<description>Here&#039;s what I suspect is going on.  The arguments for not sharing RAM and for not sharing disk are quite different.  They&#039;ve tended to be conflated, for historical reasons, in that the most archetypical vendors and products (Oracle, Teradata, Netezza, et al.) tended to be either ones that share both of those resource or ones that share neither of them.

Not sharing RAM saves you from all sorts of thread synchronization issues.  Unless there&#039;s something about multi-core evolution I&#039;m not understanding, the case for not sharing RAM seems solid for the foreseeable future.

Not sharing disk, however, stems largely from bandwidth concerns in current SAN configurations and/or manufacturing cost concerns in custom appliance designs.  Those arguments are lot less robust or future-proofed than the ones against sharing RAM, as is evidenced by the number of unshared-RAM MPP data warehouse DBMS providers who DO support SANs in theory and practice alike.

Sound reasonable?

Thanks for the great discussion,

CAM</description>
		<content:encoded><![CDATA[<p>Here&#8217;s what I suspect is going on.  The arguments for not sharing RAM and for not sharing disk are quite different.  They&#8217;ve tended to be conflated, for historical reasons, in that the most archetypical vendors and products (Oracle, Teradata, Netezza, et al.) tended to be either ones that share both of those resource or ones that share neither of them.</p>
<p>Not sharing RAM saves you from all sorts of thread synchronization issues.  Unless there&#8217;s something about multi-core evolution I&#8217;m not understanding, the case for not sharing RAM seems solid for the foreseeable future.</p>
<p>Not sharing disk, however, stems largely from bandwidth concerns in current SAN configurations and/or manufacturing cost concerns in custom appliance designs.  Those arguments are lot less robust or future-proofed than the ones against sharing RAM, as is evidenced by the number of unshared-RAM MPP data warehouse DBMS providers who DO support SANs in theory and practice alike.</p>
<p>Sound reasonable?</p>
<p>Thanks for the great discussion,</p>
<p>CAM</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic page generated in 0.279 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2010-03-02 18:05:54 -->
