<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Thoughts on the integration of OLTP and data warehousing, especially in Exadata 2</title>
	<atom:link href="http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 09 Feb 2012 16:57:09 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
	<item>
		<title>By: Oracle Exadata 2 capacity pricing &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142622</link>
		<dc:creator>Oracle Exadata 2 capacity pricing &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Tue, 06 Oct 2009 12:20:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142622</guid>
		<description>[...] Issues in integrating OLTP and data warehousing in a single system [...]</description>
		<content:encoded><![CDATA[<p>[...] Issues in integrating OLTP and data warehousing in a single system [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stray__Cat</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142239</link>
		<dc:creator>Stray__Cat</dc:creator>
		<pubDate>Fri, 02 Oct 2009 16:04:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142239</guid>
		<description>@Curt

You are perfectly right in a technical frameset, but my point is that chosing on a technical basis is NOT the right method to go. Separate systems with different technologies make sense from a business perspective.

About #7, I&#039;ve seen to many CEOs thinking that, if the technology is the same as the ERP, BI is something operational and not strategic. 
In other words, placing the DW together with the OLTP disqualifies the DW and make harder to get resources and visibility.</description>
		<content:encoded><![CDATA[<p>@Curt</p>
<p>You are perfectly right in a technical frameset, but my point is that chosing on a technical basis is NOT the right method to go. Separate systems with different technologies make sense from a business perspective.</p>
<p>About #7, I&#8217;ve seen to many CEOs thinking that, if the technology is the same as the ERP, BI is something operational and not strategic.<br />
In other words, placing the DW together with the OLTP disqualifies the DW and make harder to get resources and visibility.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Leandro Tubia</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142227</link>
		<dc:creator>Leandro Tubia</dc:creator>
		<pubDate>Fri, 02 Oct 2009 11:26:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142227</guid>
		<description>@David

I totally agree with you that it&#039;s not a matter of deciding if it&#039;s better having one or two data models (I prefer two for all reasons enumerated above), but if it&#039;s factible to have both models in the same box.
Actually, we have a simmilar dilemma at another level when designing disk assignment to OLTP and DW servers from the same HP EVA cabin: it&#039;s supposed that data distribution along disk and virtual arrays are dynamically assigned by cabin intelligence according to usage patterns, so as to optimize throughput.
I imagine that Oracle suggests to bubble up this philosophy to the server node level.
In the case of Exadata it could be easier as architecture should be totally balanced from CPU, memory to HBAs and disk arrays.
In the classic Server+SAN architecture it&#039;s quite different because if still not considering the mission the server would be assigned to, heterogeneous server configurations generate hierarchized usage: newer powerfull servers (many CPUs with much, much Ram) monopolize disk consumption, leaving other older servers waiting for access.</description>
		<content:encoded><![CDATA[<p>@David</p>
<p>I totally agree with you that it&#8217;s not a matter of deciding if it&#8217;s better having one or two data models (I prefer two for all reasons enumerated above), but if it&#8217;s factible to have both models in the same box.<br />
Actually, we have a simmilar dilemma at another level when designing disk assignment to OLTP and DW servers from the same HP EVA cabin: it&#8217;s supposed that data distribution along disk and virtual arrays are dynamically assigned by cabin intelligence according to usage patterns, so as to optimize throughput.<br />
I imagine that Oracle suggests to bubble up this philosophy to the server node level.<br />
In the case of Exadata it could be easier as architecture should be totally balanced from CPU, memory to HBAs and disk arrays.<br />
In the classic Server+SAN architecture it&#8217;s quite different because if still not considering the mission the server would be assigned to, heterogeneous server configurations generate hierarchized usage: newer powerfull servers (many CPUs with much, much Ram) monopolize disk consumption, leaving other older servers waiting for access.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Aldridge</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142214</link>
		<dc:creator>David Aldridge</dc:creator>
		<pubDate>Fri, 02 Oct 2009 06:57:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142214</guid>
		<description>@Curt

&gt;&gt; You might just want to run a separate brand of DBMS for your OLTP and data warehousing.

Oracle&#039;s philosophy for a long time has been to take the code to the data, not to take the data to the code (eg. integration of OLAP, Data Mining, Rules Management etc), so Exadata 2 as a combined platform is very much in line with that. Having worked on extracting data to data mining and OLAP applications I&#039;m very much inclined to agree with them -- data movement is a pain.

However, running analytics against a live OLTP schema is a formidable problem. That doesn&#039;t preclude hosting both types of database on the same machine though, with all the advantages that would bring. It looks like they&#039;ve put soe serious thought into resolving DW and OLTP SLA&#039;s also.

As for the opinion of Teradata, Greenplum etc. ... &quot;Man holding hammer says screws will never catch on&quot;. They rather would say that, wouldn&#039;t they? Solving the technical problems of creating a world-class OLTP RDBMS rather exceed those of creating an analytic platform, I&#039;d say.

&gt;&gt;You may want to lay out or index your tables differently for OLTP and data warehousing

Yes. I don&#039;t think that anyone at Oracle Corp. would disagree with that. But they would probably suggest that Exadata 2 as a combined platform helps with that by making the process faster and reducing the complexity of data movement.

&gt;&gt; You may want to lay out your files differently for OLTP and data warehousing (e.g., in terms of block sizes).

Maybe so. It&#039;s a trade off between complexity of management and technical advantages. Oracle have the reputation, somewhat unjustly IMHO, of being high-maintenance and ASM&#039;s SAME approach is obviously intended to address that by offering to take away a decision point about these issues. That decision can still be addressed if you want to though, I would guess.

&gt;&gt; OLTP and analytic workloads step on each other’s toes

In Oracle that tends to be a disk issue. It looks like that is really being addressed though with more advanced query scheduling and I/O balancing (I/O resource management has not been addressed at all as far as I can think in pre-Exadata 2 Oracle), and OLTP-on-flash can only help.


Well I think you&#039;re right to be cautious, but I also think that Oracle are right to pursue this line. They&#039;re obviously putting a lot of effort into whacking the moles of the problem and I think the the discussion above shows that the primary concerns people have about it have been anticipated. Whether the solution is right on the first pass is still to be proven, but it&#039;s an exciting step in the right direction.

I think that if mixed workloads on a single machine are possible then Oracle have positioned themselves to be the only ones capable of reaching it in the near future.</description>
		<content:encoded><![CDATA[<p>@Curt</p>
<p>&gt;&gt; You might just want to run a separate brand of DBMS for your OLTP and data warehousing.</p>
<p>Oracle&#8217;s philosophy for a long time has been to take the code to the data, not to take the data to the code (eg. integration of OLAP, Data Mining, Rules Management etc), so Exadata 2 as a combined platform is very much in line with that. Having worked on extracting data to data mining and OLAP applications I&#8217;m very much inclined to agree with them &#8212; data movement is a pain.</p>
<p>However, running analytics against a live OLTP schema is a formidable problem. That doesn&#8217;t preclude hosting both types of database on the same machine though, with all the advantages that would bring. It looks like they&#8217;ve put soe serious thought into resolving DW and OLTP SLA&#8217;s also.</p>
<p>As for the opinion of Teradata, Greenplum etc. &#8230; &#8220;Man holding hammer says screws will never catch on&#8221;. They rather would say that, wouldn&#8217;t they? Solving the technical problems of creating a world-class OLTP RDBMS rather exceed those of creating an analytic platform, I&#8217;d say.</p>
<p>&gt;&gt;You may want to lay out or index your tables differently for OLTP and data warehousing</p>
<p>Yes. I don&#8217;t think that anyone at Oracle Corp. would disagree with that. But they would probably suggest that Exadata 2 as a combined platform helps with that by making the process faster and reducing the complexity of data movement.</p>
<p>&gt;&gt; You may want to lay out your files differently for OLTP and data warehousing (e.g., in terms of block sizes).</p>
<p>Maybe so. It&#8217;s a trade off between complexity of management and technical advantages. Oracle have the reputation, somewhat unjustly IMHO, of being high-maintenance and ASM&#8217;s SAME approach is obviously intended to address that by offering to take away a decision point about these issues. That decision can still be addressed if you want to though, I would guess.</p>
<p>&gt;&gt; OLTP and analytic workloads step on each other’s toes</p>
<p>In Oracle that tends to be a disk issue. It looks like that is really being addressed though with more advanced query scheduling and I/O balancing (I/O resource management has not been addressed at all as far as I can think in pre-Exadata 2 Oracle), and OLTP-on-flash can only help.</p>
<p>Well I think you&#8217;re right to be cautious, but I also think that Oracle are right to pursue this line. They&#8217;re obviously putting a lot of effort into whacking the moles of the problem and I think the the discussion above shows that the primary concerns people have about it have been anticipated. Whether the solution is right on the first pass is still to be proven, but it&#8217;s an exciting step in the right direction.</p>
<p>I think that if mixed workloads on a single machine are possible then Oracle have positioned themselves to be the only ones capable of reaching it in the near future.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Aldridge</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142212</link>
		<dc:creator>David Aldridge</dc:creator>
		<pubDate>Fri, 02 Oct 2009 06:34:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142212</guid>
		<description>@Joe,

Yes, I agree that the data should be isolated into some kind of separate stuctures for data warehousing. As you say, historical data analysis is not well served by 3NF systems and Peoplesoft or SAP or course are not designed to do that.

However I don&#039;t think that claiming that Exadata 2 is suitable for mixed DW/OLTP workloads is the same thing as saying that you don&#039;t transform and move the data. I think it&#039;s very likely that you would. Whether that&#039;s in the same database or a different one housed on the same machine would be an interesting decision, and I wonder how co-hosting on the same hardware would affect that. Maybe it&#039;s the old-skool in me but I&#039;d look at separate, &quot;co-located&quot; databases first and look to be convinced that they can be part of the same database. Oracle&#039;s CDC solutions or transportable tablespaces would be interesting when running with the same Exadata 2 hardware as both source and target.</description>
		<content:encoded><![CDATA[<p>@Joe,</p>
<p>Yes, I agree that the data should be isolated into some kind of separate stuctures for data warehousing. As you say, historical data analysis is not well served by 3NF systems and Peoplesoft or SAP or course are not designed to do that.</p>
<p>However I don&#8217;t think that claiming that Exadata 2 is suitable for mixed DW/OLTP workloads is the same thing as saying that you don&#8217;t transform and move the data. I think it&#8217;s very likely that you would. Whether that&#8217;s in the same database or a different one housed on the same machine would be an interesting decision, and I wonder how co-hosting on the same hardware would affect that. Maybe it&#8217;s the old-skool in me but I&#8217;d look at separate, &#8220;co-located&#8221; databases first and look to be convinced that they can be part of the same database. Oracle&#8217;s CDC solutions or transportable tablespaces would be interesting when running with the same Exadata 2 hardware as both source and target.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Winston Chen</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142187</link>
		<dc:creator>Winston Chen</dc:creator>
		<pubDate>Thu, 01 Oct 2009 20:54:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142187</guid>
		<description>The debate here is whether the same DB platform can serve multiple purposes, different load and query patterns, etc. This is the old generalist versus specialist debate. There&#039;re parallels in  biology and evolution, in finance about the merits of diversification, in management strategy about why conglomerates exists. Back to the IT world: Do you want your DB platform to specialize in solving a specific class of problems, but have a hard time moving beyond that, or do you want your platform to handle whatever you throw at it reasonably well but doesn&#039;t excel at any one thing? My personal opinion is that there is a place for both.</description>
		<content:encoded><![CDATA[<p>The debate here is whether the same DB platform can serve multiple purposes, different load and query patterns, etc. This is the old generalist versus specialist debate. There&#8217;re parallels in  biology and evolution, in finance about the merits of diversification, in management strategy about why conglomerates exists. Back to the IT world: Do you want your DB platform to specialize in solving a specific class of problems, but have a hard time moving beyond that, or do you want your platform to handle whatever you throw at it reasonably well but doesn&#8217;t excel at any one thing? My personal opinion is that there is a place for both.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142183</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Thu, 01 Oct 2009 20:20:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142183</guid>
		<description>Stray Cat,

Only your #1 and #7 really seem to require separate databases for DW and OLTP. And #7 seems pretty silly to me.

#1 could actually be addressed in a single DBMS by materialized views as well. But it&#039;s an excellent point even so.</description>
		<content:encoded><![CDATA[<p>Stray Cat,</p>
<p>Only your #1 and #7 really seem to require separate databases for DW and OLTP. And #7 seems pretty silly to me.</p>
<p>#1 could actually be addressed in a single DBMS by materialized views as well. But it&#8217;s an excellent point even so.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stray__Cat</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142181</link>
		<dc:creator>Stray__Cat</dc:creator>
		<pubDate>Thu, 01 Oct 2009 20:11:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142181</guid>
		<description>Why you need a database for business data analysis (i.e. a datawarehouse) EVEN IF you are a small company, have few data and your server can handle all the workload.

1) Your history is there even if your OLTP systems blows up. It’s not a matter of backups: the vendor goes belly up and there’s no further support, your business changes too much to keep using the old system,  your new CEO loves a different SW etc.

2) Building such a database forces you to think to an analysis model for your business. A stable model makes comparisons with the past possible. The need for changes in a DW mirrors the need to change the company strategy; changing too often means that your company does not know where she’s going.

3) Building such a database forces you to think to key performance indicators for your business. No business, no matter how small it is, is best managed with the bank bottom line only.

4)  Likely you have different systems with different master data. Likely you had them reconciled in your database or you have generated best practices for them.

5) Often business people think by categories not implemented in a business application. Your DW may be the only place where some data may reside and be applied to your analysis model.

6) If you have a decent Business Intelligence system in place, you can cope quickly to unexpected, one shot, requests and be ready when the request is issued again for another one shot.

7) having a different DB on a different technology make the DW users feel special compared to all those data entry people, and make it more acceptable to upper management.

It’s not a technical issue at all. Separating the two worlds makes sense from a business perspective.</description>
		<content:encoded><![CDATA[<p>Why you need a database for business data analysis (i.e. a datawarehouse) EVEN IF you are a small company, have few data and your server can handle all the workload.</p>
<p>1) Your history is there even if your OLTP systems blows up. It’s not a matter of backups: the vendor goes belly up and there’s no further support, your business changes too much to keep using the old system,  your new CEO loves a different SW etc.</p>
<p>2) Building such a database forces you to think to an analysis model for your business. A stable model makes comparisons with the past possible. The need for changes in a DW mirrors the need to change the company strategy; changing too often means that your company does not know where she’s going.</p>
<p>3) Building such a database forces you to think to key performance indicators for your business. No business, no matter how small it is, is best managed with the bank bottom line only.</p>
<p>4)  Likely you have different systems with different master data. Likely you had them reconciled in your database or you have generated best practices for them.</p>
<p>5) Often business people think by categories not implemented in a business application. Your DW may be the only place where some data may reside and be applied to your analysis model.</p>
<p>6) If you have a decent Business Intelligence system in place, you can cope quickly to unexpected, one shot, requests and be ready when the request is issued again for another one shot.</p>
<p>7) having a different DB on a different technology make the DW users feel special compared to all those data entry people, and make it more acceptable to upper management.</p>
<p>It’s not a technical issue at all. Separating the two worlds makes sense from a business perspective.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RC</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142168</link>
		<dc:creator>RC</dc:creator>
		<pubDate>Thu, 01 Oct 2009 18:00:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142168</guid>
		<description>@Joe Harris

In my previous job we stored both the now and the history of our data in our OLTP system. That was possible because the amount of data was small, no need to export the data to a dataware house and delete history in the OLTP system. By the way just storing present and past is hard but storing data with a startdate in the future makes it much, much more complicated. The past will always be the past but the future will become the present first before becoming the past. 

Most changes in this system where a combination of update and insert. You could query how &#039;life&#039; was 200 or 600 days ago or how it will be over 50 days. 

The business logic was very complicated and I think that splitting the data between an &#039;OLTP&#039; database and a &#039;dataware house&#039; database would have made it even more complicated and it would be costly and error-prone. Should that dataware house only contain history data or maybe future data too? When it has to contain future data too you will have to copy that data to the OLTP system one day. 

I don&#039;t understand how a dataware house somehow improves data that have become inconsistent in the OLTP environment. I think it is better to explore the possibilities of stuff like unit tests to improve applications. Or explore the possibilities of unique function based indexes or fast refresh materialized views to check the data consistency if you want. 

In my current job the amount of data of my customers is much larger and because of the large amount it is indeed needed to separately store the now data and the old data. Luckily we don&#039;t have to deal with future data:)</description>
		<content:encoded><![CDATA[<p>@Joe Harris</p>
<p>In my previous job we stored both the now and the history of our data in our OLTP system. That was possible because the amount of data was small, no need to export the data to a dataware house and delete history in the OLTP system. By the way just storing present and past is hard but storing data with a startdate in the future makes it much, much more complicated. The past will always be the past but the future will become the present first before becoming the past. </p>
<p>Most changes in this system where a combination of update and insert. You could query how &#8216;life&#8217; was 200 or 600 days ago or how it will be over 50 days. </p>
<p>The business logic was very complicated and I think that splitting the data between an &#8216;OLTP&#8217; database and a &#8216;dataware house&#8217; database would have made it even more complicated and it would be costly and error-prone. Should that dataware house only contain history data or maybe future data too? When it has to contain future data too you will have to copy that data to the OLTP system one day. </p>
<p>I don&#8217;t understand how a dataware house somehow improves data that have become inconsistent in the OLTP environment. I think it is better to explore the possibilities of stuff like unit tests to improve applications. Or explore the possibilities of unique function based indexes or fast refresh materialized views to check the data consistency if you want. </p>
<p>In my current job the amount of data of my customers is much larger and because of the large amount it is indeed needed to separately store the now data and the old data. Luckily we don&#8217;t have to deal with future data:)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Harris</title>
		<link>http://www.dbms2.com/2009/09/29/integration-oltp-data-warehousing-exadata-2/#comment-142149</link>
		<dc:creator>Joe Harris</dc:creator>
		<pubDate>Thu, 01 Oct 2009 16:23:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=939#comment-142149</guid>
		<description>@RC  Don&#039;t take this the wrong way but: Have you worked on a data warehouse?

OLTP systems are designed to deal with *now*, in a small number of areas they will &quot;track history&quot;, occasionally they do it correctly.

However, most changes in OLTP take the form of an update. I feel a deep sense of joy when an app has a &quot;DateUpdated&quot; field. If I have to rely on the app to tell me the previous value then I&#039;m SOL.

I&#039;m going to quote myself here: &quot;Data isolation is the first step to data quality and integrity in the warehouse.&quot;

And add: A PITA (point in time architecture) is the second step to data quality and integrity in the warehouse.

…and yes I do relish the double entendre of that last acronym. :)</description>
		<content:encoded><![CDATA[<p>@RC  Don&#8217;t take this the wrong way but: Have you worked on a data warehouse?</p>
<p>OLTP systems are designed to deal with *now*, in a small number of areas they will &#8220;track history&#8221;, occasionally they do it correctly.</p>
<p>However, most changes in OLTP take the form of an update. I feel a deep sense of joy when an app has a &#8220;DateUpdated&#8221; field. If I have to rely on the app to tell me the previous value then I&#8217;m SOL.</p>
<p>I&#8217;m going to quote myself here: &#8220;Data isolation is the first step to data quality and integrity in the warehouse.&#8221;</p>
<p>And add: A PITA (point in time architecture) is the second step to data quality and integrity in the warehouse.</p>
<p>…and yes I do relish the double entendre of that last acronym. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>

