<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS 2 : DataBase Management System Services &#187; pureXML</title>
	<atom:link href="http://www.dbms2.com/category/products-and-vendors/ibm-and-db2/purexml/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 09 Feb 2012 09:21:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>Schema flexibility and XML data management</title>
		<link>http://www.dbms2.com/2008/10/05/schema-flexibility-and-xml-data-management/</link>
		<comments>http://www.dbms2.com/2008/10/05/schema-flexibility-and-xml-data-management/#comments</comments>
		<pubDate>Sun, 05 Oct 2008 12:53:36 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data models and architecture]]></category>
		<category><![CDATA[EAI, EII, ETL, ELT, ETLT]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Structured documents]]></category>
		<category><![CDATA[pureXML]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=594</guid>
		<description><![CDATA[Conor O&#8217;Mahony, marketing manager for IBM&#8217;s DB2 pureXML talks a lot about one of my favorite hobbyhorses &#8212; schema flexibility &#8212; as a reason to use an XML data model. In a number of industries he sees use cases based around ongoing change in the information being managed: Tax authorities change their rules and forms [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;"><a href="http://nativexmldatabase.com/">Conor O&#8217;Mahony</a>, marketing manager for <a href="http://www.dbms2.com/2008/10/05/overview-of-ibm-db2-purexml/">IBM&#8217;s DB2 pureXML</a> talks a lot about one of my favorite hobbyhorses &#8212; <strong>schema flexibility</strong> &#8212; as a reason to use an XML data model.  In a number of industries he sees use cases based around ongoing change in the information being managed:</p>
<ul>
<li>Tax authorities change their rules 	and forms every year, but don&#8217;t want to do total rewrites of their 	electronic submission and processing software.</li>
<li>The financial services industry 	keeps inventing new products, which don&#8217;t just have different terms 	and conditions, but may also have different <em>kinds</em> of terms 	and conditions.</li>
<li>The same, to some extent, goes for 	the travel industry, which also keeps adding different and different 	kinds of destinations.</li>
<li>The energy industry keeps adding 	new kinds of highly complex equipment it has to manage.</li>
</ul>
<p style="margin-bottom: 0in;">Conor also thinks market evidence shows that XML&#8217;s schema flexibility is important for data interchange.<span id="more-594"></span> For example, hospitals (especially in the US) have disparate medical records and billing systems, which can make information interchange a chore.</p>
<p style="margin-bottom: 0in;">The second suggestion is probably the less controversial of the two.  After all, everybody knows that data is very commonly exchanged in XML formats.  So if it gets persisted in XML format somewhere along the way, even relational purists shouldn&#8217;t much mind, as long as it eventually gets into what they regard as a more properly structured database.  (Besides &#8212; if the data is going on long, challenging, multi-stage journeys, then nobody should much blame it if it indeed wants to stop along the way somewhere and rest. <img src='http://www.dbms2.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  )</p>
<p style="margin-bottom: 0in;">In the first group of examples, there&#8217;s usually also a kind of cooperation between native XML and other kinds of database managers.  Before those users had access to XML, they were getting by just fine using other database technology.  So XML can be used in conjunction with other systems, not as complete replacement.  Even so, it&#8217;s reasonable to consider scenarios in which <strong>XML is the primary data model of record, and relational/tabular copies of the information are secondary.</strong></p>
<p style="margin-bottom: 0in;">For example, an income tax authority wants to store your tax form in its entirety, so that they can check both your truthfulness and your arithmetic.  This is most naturally done in XML, although for many years it&#8217;s been done in relational or pre-relational technologies. They also want to aggregate a limited amount of information from each taxpayer&#8217;s form for all sorts of aggregation and administrative purposes; that&#8217;s best done in a relational database. But the part that belongs in XML is the most fundamental.</p>
<p style="margin-bottom: 0in;">As another example, the core information of a derivatives transaction is:</p>
<ul>
<li>The derivatives contract 	(naturally stored in XML)</li>
<li>The actual purchase/sale 	information (traditionally stored in relational systems)</li>
<li>Account balances of various kinds 	altered by the transaction (a classic case where relational 	databases guarantee much-needed data integrity)</li>
</ul>
<p style="margin-bottom: 0in;">Here the majority of the basic record fits best in XML.  The minority that fits best in a relational system is small enough that a good XML DBMS can probably handle it as well.  Neither the superior OLTP performance nor data integrity safeguards of a relational DBMS are needed for the purchase/sale information.  They <em>are</em> needed for the general account management – but again, that&#8217;s a relatively secondary or (no pun intended!) derivative part of the overall database.</p>
<p style="margin-bottom: 0in;">So what we&#8217;re coming up with here is a strategy along the lines of:</p>
<ol>
<li><strong>Use XML for your system of 	record.</strong></li>
<li><strong>Spawn transactions in your 	relational/tabular data stores right away.</strong></li>
</ol>
<p style="margin-bottom: 0in;">And by the way, while I haven&#8217;t dwelled on this – those relational/tabular data stores could be data warehouses instead of or in addition to transactional systems.</p>
<p style="margin-bottom: 0in;">Obviously, there are two major classes of objections to this strategy (when it is contrasted with a traditional relational approach):</p>
<ul>
<li>Assertions that the extra 	programming effort needed to assure data integrity are so important 	as to outweigh all other consideration.</li>
<li>Assertions that the need for 	schema flexibility isn&#8217;t really that high, or at least wouldn&#8217;t be 	if the enterprises&#8217; database designers were sufficiently competent.</li>
</ul>
<p style="margin-bottom: 0in;">Well, we&#8217;ll see.  So far the customer uptake for the native XML approach is small but non-zero. And thus the issue is far from being decided.</p>
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2008/10/05/schema-flexibility-and-xml-data-management/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Vertical market XML standards</title>
		<link>http://www.dbms2.com/2008/10/05/vertical-market-xml-standards/</link>
		<comments>http://www.dbms2.com/2008/10/05/vertical-market-xml-standards/#comments</comments>
		<pubDate>Sun, 05 Oct 2008 12:43:31 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[EAI, EII, ETL, ELT, ETLT]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Structured documents]]></category>
		<category><![CDATA[pureXML]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=593</guid>
		<description><![CDATA[Tracking the alphabet soup of vertical market XML standards is hard. So as a starting point, I&#8217;m splitting a list I got from IBM into a standalone post. Among the most important or successful IBM pureXML-supported standards, in terms of downloads and other evidence of customer interest, are: UNIFI (UNIversal Financial Industry message scheme). According [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Tracking the alphabet soup of vertical market XML standards is hard.  So as a starting point, I&#8217;m splitting a list I got from IBM into a standalone post.</p>
<p style="margin-bottom: 0in;">Among the most important or successful <a href="http://www.dbms2.com/2008/10/05/overview-of-ibm-db2-purexml/">IBM pureXML</a>-<a href="http://www.alphaworks.ibm.com/tech/purexml">supported</a> standards, in terms of downloads and other evidence of customer interest, are:<span id="more-593"></span></p>
<ul>
<li><em>UNIFI (UNIversal Financial Industry message scheme). </em><span style="font-style: normal;">According to ISO 20022, it&#8217;s a 	standard that any electronic fund transfer must be accompanied by a 	UNIFI document. This standard seems important at least in SEPA (the 	Single European Payment Area).  UNIFI seems to be a major pureXML 	use case.</span></li>
<li><em>FpML (Financial Products Markup Language)</em>.  FpML is 	used in the derivatives market, to actually create contracts and 	hence derivative securities.</li>
<li><em>ACORD (Association for Cooperative Operations, Research 	and Development).</em> When one discusses XML industry standards, 	ACORD is usually one of the first to come up.  It is used in the 	life insurance agency, also for contract definition/creation. The 	top use case is agents, although speculate it&#8217;s used among 	investment managers and reinsurers as well.</li>
<li><em>STAR (Standards for Technology in Automotive Retail).</em> STAR is used by car manufacturers and retailers to exchange 	information – about cars, no doubt, and probably some other things 	as well.</li>
<li><em>HR-XML,</em> which is used by human resource departments 	for resumes and other kinds of employee information.</li>
<li><em>OTA (Open Travel Alliance). </em><span style="font-style: normal;">This 	is an XML schema used by usual-suspect players in the travel 	business – hotel chains, car renters, travel agents, and probably 	anybody else who deals in travel reservations.</span></li>
</ul>
<p>I&#8217;ve heard at least about the financial services ones from other 	XML database vendors as well.  I&#8217;m a little surprised that nothing 	from the health area made the top of the list.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2008/10/05/vertical-market-xml-standards/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Overview of IBM DB2 pureXML</title>
		<link>http://www.dbms2.com/2008/10/05/overview-of-ibm-db2-purexml/</link>
		<comments>http://www.dbms2.com/2008/10/05/overview-of-ibm-db2-purexml/#comments</comments>
		<pubDate>Sun, 05 Oct 2008 12:34:51 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[EAI, EII, ETL, ELT, ETLT]]></category>
		<category><![CDATA[IBM and DB2]]></category>
		<category><![CDATA[Structured documents]]></category>
		<category><![CDATA[pureXML]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=592</guid>
		<description><![CDATA[On August 29, I had a great call with IBM about DB2 pureXML (most of the IBM side of the talking was done by Conor O&#8217;Mahony and Qi Jin). I&#8217;m finally getting around to writing it up now. (The world of tabular data warehousing has kept me just a wee bit busy &#8230;) As I [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">On August 29, I had a great call with IBM about DB2 pureXML (most of the IBM side of the talking was done by Conor O&#8217;Mahony and Qi Jin).  I&#8217;m <em>finally</em> getting around to writing it up now. (The world of tabular data warehousing has kept me just a wee bit busy &#8230;)</p>
<p style="margin-bottom: 0in;">As I write it, I see there are a considerable number of holes, but that&#8217;s the way it seems to go when researching XML storage.  I&#8217;m also writing up a September call from which I finally figured out (I think) the essence of <a href="http://www.dbms2.com/2008/10/05/marklogic-architecture-deep-dive/">how MarkLogic Server works</a> – but only after five months of trying. It turns out that MarkLogic works rather differently from DB2 pureXML.  Not coincidentally, IBM and Mark Logic focus on rather different use cases for native XML storage.</p>
<p style="margin-bottom: 0in;">What I understand so far about the basic DB2 pureXML architecture goes like this:<span id="more-592"></span></p>
<ul>
<li><strong>DB2 pureXML stores XML in “true 	hierarchical format.” </strong> Based on all the discussion of 	indexing, I&#8217;m guessing that the way it does this is somewhat similar 	to that in <em>MarkLogic</em>.</li>
<li>Unlike MarkLogic, DB2 pureXML 	gives you the choice of what tags to index on.</li>
<li>In a big difference from 	Marklogic, <strong>text search on DB2 pureXML involves two separate 	indexes</strong> – XML and text (the latter being of the usual 	inverted-list variety). You can text-search both contents and tags, 	with the usual CONTAINS semantics.</li>
<li>PureXML has <strong>a data store 	separate from the rest of DB2&#8242;s,</strong> notwithstanding IBM&#8217;s 	references to XML “columns.”  DB2&#8242;s general datatype 	extensibility framework is not used; I don&#8217;t wholly understand why.</li>
<li>I neglected to ask how well DB2 	backup, management tools, and so on extend to DB2 pureXML.</li>
<li>You can talk to DB2 pureXML via 	two data manipulation languages: <strong>SQL/XML, </strong><span>and </span><strong>XQuery.</strong> Both are compiled down to the same run-time 	instructions.  IBM said there&#8217;s an abstraction layer sitting over 	both the relational store and the XML store that allows for this  I 	don&#8217;t totally understand what that means, since presumably the 	SQL/XML starts out by being sent to DB2&#8242;s parser.</li>
</ul>
<p style="margin-bottom: 0in;">A big part of IBM&#8217;s XML business strategy is to support various (typically vertical market) XML standards. IBM has implemented support for these standards and made it freely downloadable. What does “support” mean? It surely starts with a DTD (Document Type Definition), and apparently also includes <a href="http://www.alphaworks.ibm.com/tech/purexml">mappings to generic web services interfaces</a>.  It turns out that there are a lot of them, so I&#8217;m listing some in a <a href="http://www.dbms2.com/2008/10/05/vertical-market-xml-standards/">separate post</a>.</p>
<p style="margin-bottom: 0in;">More generally, it seems that the sales and uses for IBM pureXML are concentrated in two main (overlapping) cases:</p>
<ul>
<li><strong>When XML was going to be used 	anyway.</strong> One big example of this is the case of the 	standards-based industry data interchanges.   Another example is 	when pureXML, albeit disk-based, acts as a kind of quasi-cache or 	mini-MDM hub (Master Data Management) for WebSphere-based enterprise 	application integration (EAI).  IBM reports that DB2 pureXML has 	been sold as an intermediate EAI data store at least once each in 	banking, retailing, health care, and insurance.</li>
<li><strong>When schema flexibility is of 	great importance.</strong></li>
</ul>
<p style="margin-bottom: 0in;">Experience teaches me that schema flexibility is a subject that can attract considerable flames, in the general vein of “Omigod! The relational model is perfect because it&#8217;s mathematically proven to ensure referential integrity!!” So I&#8217;ll split out the main discussion of that into yet another <em>separate post,</em> and keep going.</p>
<p style="margin-bottom: 0in;">IBM actually breaks out the pureXML use cases into four main groups:</p>
<ol>
<li><span style="font-style: normal;"><strong>Transactional.</strong></span><em> </em> This comprises the transactional logging of information that 	just happens to be XML, such as in financial services.</li>
<li><span style="font-style: normal;"><strong>Forms-oriented.</strong></span> This comprises, for example, the tax authority use case.</li>
<li><span style="font-style: normal;"><strong>Service 	bus acceleration.</strong></span> That&#8217;s a fancy phrase to cover both 	the standards-based interchanges and the other EAI-related uses.</li>
<li><span style="font-style: normal;"><strong>Event-driven 	data warehousing. </strong></span> This one was kind of blurry to me.  	What I think it means is that if you have transactional data in XML, 	and you want to use it in near-real-time business intelligence, DB2 	pureXML can help you with that.</li>
</ol>
<p style="margin-bottom: 0in;">#1, 3, and 4 seem to fit into my “When XML was going to be used anyway” category.  Part of “Schema flexibility” matches #2; I&#8217;m not clear on where in IBM&#8217;s four buckets the rest of schema flexibility goes.</p>
<p style="margin-bottom: 0in;">Finally, I asked directly in what areas there were significant numbers of DB2 pureXML customers.  IBM offered two examples.  One was financial services in general &#8212; especially in North America, notwithstanding the importance of the UNIFI standard in Europe.  The other was health care data interchange outside the United States &#8212; especially in China, where regional and national centers are being established to more closely oversee local hospitals.</p>
<p style="margin-bottom: 0in;"><em><strong>Related links</strong></em></p>
<ul>
<li>IBM kindly gave me permission to 	make available the <a href="http://www.monash.com/uploads/DB2-pureXML-Aug-2008.pdf">slide presentation</a> from our August 29 briefing.  	The last page has a large number of links to further IBM pureXML 	resources.</li>
<li>Conor O&#8217;Mahony has a <a href="http://nativexmldatabase.com/">good blog</a>.</li>
<li>As noted above, I am putting up 	separate posts on <a href="http://www.dbms2.com/2008/10/05/vertical-market-xml-standards/">standards-based data interchange</a> and <a href="http://www.dbms2.com/2008/10/05/schema-flexibility-and-xml-data-management/">schema 	flexibility</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2008/10/05/overview-of-ibm-db2-purexml/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

