<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Three broad categories of data</title>
	<atom:link href="http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Thu, 09 Feb 2012 09:22:14 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
	<item>
		<title>By: Three kinds of software innovation, and whether patents could possibly work for them &#124; DBMS 2 : DataBase Management System Services</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-226649</link>
		<dc:creator>Three kinds of software innovation, and whether patents could possibly work for them &#124; DBMS 2 : DataBase Management System Services</dc:creator>
		<pubDate>Thu, 09 Jun 2011 04:36:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-226649</guid>
		<description>[...] things that are described by terms like “unstructured” or “semi-structured” [...]</description>
		<content:encoded><![CDATA[<p>[...] things that are described by terms like “unstructured” or “semi-structured” [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Traditional databases will eventually wind up in RAM &#124; DBMS 2 : DataBase Management System Services</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-223588</link>
		<dc:creator>Traditional databases will eventually wind up in RAM &#124; DBMS 2 : DataBase Management System Services</dc:creator>
		<pubDate>Mon, 23 May 2011 16:05:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-223588</guid>
		<description>[...] In January, 2010, I posted that it might be helpful to view data as being divided into three categories: [...]</description>
		<content:encoded><![CDATA[<p>[...] In January, 2010, I posted that it might be helpful to view data as being divided into three categories: [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-214925</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Thu, 24 Mar 2011 10:16:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-214925</guid>
		<description>That depends, but often the answer is &quot;not tabular&quot; or &quot;awkward fit for tabular.&quot;

There are three main reasons for that. First, the list of possible event types is commonly too long for people to enjoy making separate columns for each one. See for example my post on eBay Singularity: http://www.dbms2.com/2010/10/06/ebay-followup-greenplum-out-teradata-10-petabytes-hadoop-has-some-value-and-more/

Second, the temporal relationships between events are commonly awkward to represent relationally. In many cases you can timestamp everything and then also store a derived field as to what&#039;s part of the same event -- still, there can be awkwardness.

Third, shoehorning logs into a tabular format might just lead to expense and bloat.</description>
		<content:encoded><![CDATA[<p>That depends, but often the answer is &#8220;not tabular&#8221; or &#8220;awkward fit for tabular.&#8221;</p>
<p>There are three main reasons for that. First, the list of possible event types is commonly too long for people to enjoy making separate columns for each one. See for example my post on eBay Singularity: <a href="http://www.dbms2.com/2010/10/06/ebay-followup-greenplum-out-teradata-10-petabytes-hadoop-has-some-value-and-more/" rel="nofollow">http://www.dbms2.com/2010/10/06/ebay-followup-greenplum-out-teradata-10-petabytes-hadoop-has-some-value-and-more/</a></p>
<p>Second, the temporal relationships between events are commonly awkward to represent relationally. In many cases you can timestamp everything and then also store a derived field as to what&#8217;s part of the same event &#8212; still, there can be awkwardness.</p>
<p>Third, shoehorning logs into a tabular format might just lead to expense and bloat.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: al kumar</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-214879</link>
		<dc:creator>al kumar</dc:creator>
		<pubDate>Thu, 24 Mar 2011 02:28:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-214879</guid>
		<description>to be clear, using Curt&#039;s taxonomy -- is machine generated data non-tabular or tabular?</description>
		<content:encoded><![CDATA[<p>to be clear, using Curt&#8217;s taxonomy &#8212; is machine generated data non-tabular or tabular?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: al kumar</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-214878</link>
		<dc:creator>al kumar</dc:creator>
		<pubDate>Thu, 24 Mar 2011 02:27:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-214878</guid>
		<description>so does machine generated data have *structure*? that is to say, does it lend itself to a data model in the relational sense?</description>
		<content:encoded><![CDATA[<p>so does machine generated data have *structure*? that is to say, does it lend itself to a data model in the relational sense?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Examples and definition of machine-generated data &#124; DBMS 2 : DataBase Management System Services</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-211187</link>
		<dc:creator>Examples and definition of machine-generated data &#124; DBMS 2 : DataBase Management System Services</dc:creator>
		<pubDate>Tue, 01 Mar 2011 07:46:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-211187</guid>
		<description>[...] posts made last December, January, and April, I [...]</description>
		<content:encoded><![CDATA[<p>[...] posts made last December, January, and April, I [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mega-trends driving data warehousing and business intelligence &#124; DBMS 2 : DataBase Management System Services</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-205183</link>
		<dc:creator>Mega-trends driving data warehousing and business intelligence &#124; DBMS 2 : DataBase Management System Services</dc:creator>
		<pubDate>Sat, 22 Jan 2011 19:07:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-205183</guid>
		<description>[...] A year ago, I divided data into three kinds: [...]</description>
		<content:encoded><![CDATA[<p>[...] A year ago, I divided data into three kinds: [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: How to Tame Big Bad Data &#124; Kalido Conversations</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-182146</link>
		<dc:creator>How to Tame Big Bad Data &#124; Kalido Conversations</dc:creator>
		<pubDate>Wed, 25 Aug 2010 16:35:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-182146</guid>
		<description>[...] try to get a better understanding of the nature of Big Bad Data. Curt Monash wrote about the difference between machine-generated data and human-generated data. (For the purpose of [...]</description>
		<content:encoded><![CDATA[<p>[...] try to get a better understanding of the nature of Big Bad Data. Curt Monash wrote about the difference between machine-generated data and human-generated data. (For the purpose of [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Examples of machine-generated data &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-164988</link>
		<dc:creator>Examples of machine-generated data &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Thu, 08 Apr 2010 19:20:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-164988</guid>
		<description>[...] long ago I pointed out that much future Big Data growth will be in the area of machine-generated data, examples of which [...]</description>
		<content:encoded><![CDATA[<p>[...] long ago I pointed out that much future Big Data growth will be in the area of machine-generated data, examples of which [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thoughts on IBM&#8217;s anti-Oracle announcements &#124; DBMS2 -- DataBase Management System Services</title>
		<link>http://www.dbms2.com/2010/01/17/three-broad-categories-of-data/#comment-164876</link>
		<dc:creator>Thoughts on IBM&#8217;s anti-Oracle announcements &#124; DBMS2 -- DataBase Management System Services</dc:creator>
		<pubDate>Wed, 07 Apr 2010 16:12:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.dbms2.com/?p=1421#comment-164876</guid>
		<description>[...] both highly important, those are very different things. IBM has not in the past shown much impressive technology in either of those two areas, and based [...]</description>
		<content:encoded><![CDATA[<p>[...] both highly important, those are very different things. IBM has not in the past shown much impressive technology in either of those two areas, and based [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

