Comments on: Three broad categories of data

By: Three kinds of software innovation, and whether patents could possibly work for them | DBMS 2 : DataBase Management System Services

Thu, 09 Jun 2011 04:36:34 +0000

[…] things that are described by terms like “unstructured” or “semi-structured” […]

By: Traditional databases will eventually wind up in RAM | DBMS 2 : DataBase Management System Services

Mon, 23 May 2011 16:05:28 +0000

[…] In January, 2010, I posted that it might be helpful to view data as being divided into three categories: […]

By: Curt Monash

Curt Monash — Thu, 24 Mar 2011 10:16:36 +0000

That depends, but often the answer is “not tabular” or “awkward fit for tabular.”

There are three main reasons for that. First, the list of possible event types is commonly too long for people to enjoy making separate columns for each one. See for example my post on eBay Singularity: http://www.dbms2.com/2010/10/06/ebay-followup-greenplum-out-teradata-10-petabytes-hadoop-has-some-value-and-more/

Second, the temporal relationships between events are commonly awkward to represent relationally. In many cases you can timestamp everything and then also store a derived field as to what’s part of the same event — still, there can be awkwardness.

Third, shoehorning logs into a tabular format might just lead to expense and bloat.

By: al kumar

al kumar — Thu, 24 Mar 2011 02:28:34 +0000

to be clear, using Curt’s taxonomy — is machine generated data non-tabular or tabular?

By: al kumar

al kumar — Thu, 24 Mar 2011 02:27:52 +0000

so does machine generated data have *structure*? that is to say, does it lend itself to a data model in the relational sense?

By: Examples and definition of machine-generated data | DBMS 2 : DataBase Management System Services

Tue, 01 Mar 2011 07:46:13 +0000

[…] posts made last December, January, and April, I […]

By: Mega-trends driving data warehousing and business intelligence | DBMS 2 : DataBase Management System Services

Sat, 22 Jan 2011 19:07:12 +0000

[…] A year ago, I divided data into three kinds: […]

By: How to Tame Big Bad Data | Kalido Conversations

How to Tame Big Bad Data | Kalido Conversations — Wed, 25 Aug 2010 16:35:49 +0000

[…] try to get a better understanding of the nature of Big Bad Data. Curt Monash wrote about the difference between machine-generated data and human-generated data. (For the purpose of […]

By: Examples of machine-generated data | DBMS2 -- DataBase Management System Services

Thu, 08 Apr 2010 19:20:21 +0000

[…] long ago I pointed out that much future Big Data growth will be in the area of machine-generated data, examples of which […]

By: Thoughts on IBM’s anti-Oracle announcements | DBMS2 -- DataBase Management System Services

Wed, 07 Apr 2010 16:12:38 +0000

[…] both highly important, those are very different things. IBM has not in the past shown much impressive technology in either of those two areas, and based […]