<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Mark Logic story in XML database management</title>
	<atom:link href="http://www.dbms2.com/2008/04/29/the-mark-logic-story-in-xml-database-management/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2008/04/29/the-mark-logic-story-in-xml-database-management/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Fri, 05 Feb 2010 04:25:37 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Conor O'Mahony</title>
		<link>http://www.dbms2.com/2008/04/29/the-mark-logic-story-in-xml-database-management/comment-page-1/#comment-94731</link>
		<dc:creator>Conor O'Mahony</dc:creator>
		<pubDate>Tue, 19 Aug 2008 18:29:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=412#comment-94731</guid>
		<description>[Disclaimer: I am an IBM employee.]

There are some IBM references in this post that I&#039;d like to comment on:

 - &quot;XML processing&quot; is a broad topic.  A specific deployment, although a perfectly valid data point, is not necessarily representative of all XML processing tasks.  MarkLogic has a lot of success with &quot;content&quot; use cases, but what about &quot;data&quot; use cases?  For instance, I heard of a situation where MarkLogic struggled with insert/update/delete performance in an environment with an extreme number of small XML documents.  I also heard of issues with MarkLogic returning very large result sets.  My intention is not to pick on MarkLogic here.  They are a good vendor with good technology.  I&#039;d just like to point out that there are situations where every vendor struggles, and to not put too much weight on those situations unless you know that they closely match your situation.

 - IBM continues to be, to the best of my knowledge, the only vendor to disclose XML performance results with *all* details of the workload, hardware, and configuration that was being used (&lt;a href=&quot;http://tpox.sourceforge.net/&quot; rel=&quot;nofollow&quot;&gt;SourceForge: Transaction Processing over XML&lt;/a&gt;).  IBM also has several customer case studies that substantiate excellent levels of performance, and numerous customers who have spoken about their positive experiences with DB2 pureXML at conferences worldwide.

 - Regarding indexing, there is a cost to the MarkLogic approach.  As you know, MarkLogic indexes all element values and all attribute values in all paths.  And they maintain this index &lt;em&gt;synchronously&lt;/em&gt;.  This, of course, incurs a cost during insert, update, and delete operations.  It is possible that this partly explains the performance issues I mentioned above.  In my opinion, the DB2 approach of choosing exactly what you want to index is often better.  (Note that to optimize what is indexed does require that you know the queries you will process.)

Also, keep in mind that hybrid relational/XML data servers offer several advantages over XML-only data servers, including:

 - Many organizations have existing systems based on relational data and SQL (and ODBC or JDBC).  In such situations, a complete rip-and-replace to use an XML-only vendor can be quite traumatic.  Our experience is that a hybrid relational/XML store has eased the path to native XML adoption for many such organizations.  

 - We have also found that many application scenarios are not XML-only, but involve XML and relational data.  But, of course, it is natural that our experiences match the areas in which we possibly have stronger synergies.

 - Some XML-only vendors lack some of the more advanced DBMS capabilities, like point-in-time-restore capabilities.  Also, the established relational/XML vendors do have a wide ecosystem of database tools vendors who work with their products.</description>
		<content:encoded><![CDATA[<p>[Disclaimer: I am an IBM employee.]</p>
<p>There are some IBM references in this post that I&#8217;d like to comment on:</p>
<p> &#8211; &#8220;XML processing&#8221; is a broad topic.  A specific deployment, although a perfectly valid data point, is not necessarily representative of all XML processing tasks.  MarkLogic has a lot of success with &#8220;content&#8221; use cases, but what about &#8220;data&#8221; use cases?  For instance, I heard of a situation where MarkLogic struggled with insert/update/delete performance in an environment with an extreme number of small XML documents.  I also heard of issues with MarkLogic returning very large result sets.  My intention is not to pick on MarkLogic here.  They are a good vendor with good technology.  I&#8217;d just like to point out that there are situations where every vendor struggles, and to not put too much weight on those situations unless you know that they closely match your situation.</p>
<p> &#8211; IBM continues to be, to the best of my knowledge, the only vendor to disclose XML performance results with *all* details of the workload, hardware, and configuration that was being used (<a href="http://tpox.sourceforge.net/" onclick="javascript:pageTracker._trackPageview('/tpox.sourceforge.net');" rel="nofollow">SourceForge: Transaction Processing over XML</a>).  IBM also has several customer case studies that substantiate excellent levels of performance, and numerous customers who have spoken about their positive experiences with DB2 pureXML at conferences worldwide.</p>
<p> &#8211; Regarding indexing, there is a cost to the MarkLogic approach.  As you know, MarkLogic indexes all element values and all attribute values in all paths.  And they maintain this index <em>synchronously</em>.  This, of course, incurs a cost during insert, update, and delete operations.  It is possible that this partly explains the performance issues I mentioned above.  In my opinion, the DB2 approach of choosing exactly what you want to index is often better.  (Note that to optimize what is indexed does require that you know the queries you will process.)</p>
<p>Also, keep in mind that hybrid relational/XML data servers offer several advantages over XML-only data servers, including:</p>
<p> &#8211; Many organizations have existing systems based on relational data and SQL (and ODBC or JDBC).  In such situations, a complete rip-and-replace to use an XML-only vendor can be quite traumatic.  Our experience is that a hybrid relational/XML store has eased the path to native XML adoption for many such organizations.  </p>
<p> &#8211; We have also found that many application scenarios are not XML-only, but involve XML and relational data.  But, of course, it is natural that our experiences match the areas in which we possibly have stronger synergies.</p>
<p> &#8211; Some XML-only vendors lack some of the more advanced DBMS capabilities, like point-in-time-restore capabilities.  Also, the established relational/XML vendors do have a wide ecosystem of database tools vendors who work with their products.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Text Technologies &#187; Blog Archive &#187; Mark Logic viewed as a different kind of text search technology vendor</title>
		<link>http://www.dbms2.com/2008/04/29/the-mark-logic-story-in-xml-database-management/comment-page-1/#comment-83311</link>
		<dc:creator>Text Technologies &#187; Blog Archive &#187; Mark Logic viewed as a different kind of text search technology vendor</dc:creator>
		<pubDate>Tue, 29 Apr 2008 12:44:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=412#comment-83311</guid>
		<description>[...] two posts this morning on Mark Logic and it&#8217;s MarkLogic product family. The main one, over on DBMS2, outlines the technical architecture &#8212; focusing on MarkLogic as an XML database management [...]</description>
		<content:encoded><![CDATA[<p>[...] two posts this morning on Mark Logic and it&#8217;s MarkLogic product family. The main one, over on DBMS2, outlines the technical architecture &#8212; focusing on MarkLogic as an XML database management [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic page generated in 0.258 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2010-02-05 05:43:27 -->
