<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Relational data warehouse Expansion (or Explosion) Ratios</title>
	<atom:link href="http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/</link>
	<description>Choices in data management and analysis</description>
	<pubDate>Fri, 16 May 2008 18:54:17 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5</generator>
		<item>
		<title>By: DBMS2 &#8212; DataBase Management System Services&#187;Blog Archive &#187; Progress Software progress report</title>
		<link>http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-27913</link>
		<dc:creator>DBMS2 &#8212; DataBase Management System Services&#187;Blog Archive &#187; Progress Software progress report</dc:creator>
		<pubDate>Sun, 29 Apr 2007 02:20:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-27913</guid>
		<description>[...] size vs. Oracle for the same data, due to, for example, variable record lengths. Such factors are not unheard of in data warehousing, of course; but for OLTP his comment was quite the jaw-dropper. On the other hand, OLTP databases [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] size vs. Oracle for the same data, due to, for example, variable record lengths. Such factors are not unheard of in data warehousing, of course; but for OLTP his comment was quite the jaw-dropper. On the other hand, OLTP databases [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-9317</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Tue, 03 Oct 2006 07:49:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-9317</guid>
		<description>William,

Yes, but the question is how much you need in the way of indexing and summaries to get the same performance from different systems.

Also, the grand hash partition that starts off a typical MPP installation saves some index space.   And if table scans are fast enough a lot of the time, you need fewer indicies than if they almost never are.

Anyhow, if you look up to my IBM/Teradata post, IBM claims that indeed their installations wind up very comparable to Teradata's.  But then, IBM and Teradata also have pretty similar overall technical approaches now.</description>
		<content:encoded><![CDATA[<p>William,</p>
<p>Yes, but the question is how much you need in the way of indexing and summaries to get the same performance from different systems.</p>
<p>Also, the grand hash partition that starts off a typical MPP installation saves some index space.   And if table scans are fast enough a lot of the time, you need fewer indicies than if they almost never are.</p>
<p>Anyhow, if you look up to my IBM/Teradata post, IBM claims that indeed their installations wind up very comparable to Teradata&#8217;s.  But then, IBM and Teradata also have pretty similar overall technical approaches now.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DBMS2 &#8212; DataBase Management System Services&#187;Blog Archive &#187; IBM and Teradata too</title>
		<link>http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-9298</link>
		<dc:creator>DBMS2 &#8212; DataBase Management System Services&#187;Blog Archive &#187; IBM and Teradata too</dc:creator>
		<pubDate>Tue, 03 Oct 2006 05:52:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-9298</guid>
		<description>[...] When I posted a few days ago about Expansion Ratios, I passed along Teradata’s estimate that there was a significant gap between Teradata and IBM. IBM’s own estimate, however, is that a typical Expansion Ratio for DB2 is a Teradata-like 2-3X in DB2 Version 8, and that it’s falling further in Version 9 due to compression. (DB2’s compression sounds like it’s the most aggressive in the business, specialty columnar or MOLAP products perhaps excepted.) Incidentally, this suggests that the indexing features DB2 has that Teradata doesn’t – e.g., alternate datatypes like geospatial – aren’t heavily used by a large fraction of the customer base. [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] When I posted a few days ago about Expansion Ratios, I passed along Teradata’s estimate that there was a significant gap between Teradata and IBM. IBM’s own estimate, however, is that a typical Expansion Ratio for DB2 is a Teradata-like 2-3X in DB2 Version 8, and that it’s falling further in Version 9 due to compression. (DB2’s compression sounds like it’s the most aggressive in the business, specialty columnar or MOLAP products perhaps excepted.) Incidentally, this suggests that the indexing features DB2 has that Teradata doesn’t – e.g., alternate datatypes like geospatial – aren’t heavily used by a large fraction of the customer base. [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: William McKnight</title>
		<link>http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-8469</link>
		<dc:creator>William McKnight</dc:creator>
		<pubDate>Fri, 29 Sep 2006 20:26:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/2006/09/28/expansion-ratio-data-warehouse-explosion/#comment-8469</guid>
		<description>Curt, I believe that end users of data warehouse technologies have good control over the expansion, or uptake, ratio.  It really depends on the level of performance that is needed and the amount of the data that they actually access in online queries.  Some sites index everything and others only do it when absolutely necessary.  Same goes for creating summary tables and cubes.  TPC numbers represent the high end of anything that would be implemented at a real site because the vendors use TPC to brag most about the performance figures, not the price/performance.  The Teradata numbers seem about right in practice.  

By the way, Teradata has an interesting new feature called "multi temperature" which allows sites to pay less for the data they don't access as much.  Many have been implementing concepts like this for some time to find a good price/performance balance.</description>
		<content:encoded><![CDATA[<p>Curt, I believe that end users of data warehouse technologies have good control over the expansion, or uptake, ratio.  It really depends on the level of performance that is needed and the amount of the data that they actually access in online queries.  Some sites index everything and others only do it when absolutely necessary.  Same goes for creating summary tables and cubes.  TPC numbers represent the high end of anything that would be implemented at a real site because the vendors use TPC to brag most about the performance figures, not the price/performance.  The Teradata numbers seem about right in practice.  </p>
<p>By the way, Teradata has an interesting new feature called &#8220;multi temperature&#8221; which allows sites to pay less for the data they don&#8217;t access as much.  Many have been implementing concepts like this for some time to find a good price/performance balance.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
