<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DBMS2 -- DataBase Management System Services &#187; Data mart outsourcing</title>
	<atom:link href="http://www.dbms2.com/category/analytics-technologies/data-mart-warehouse-outsourcing/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dbms2.com</link>
	<description>Choices in data management and analysis</description>
	<lastBuildDate>Fri, 30 Jul 2010 15:51:32 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>More on Sybase IQ, including Version 15.2</title>
		<link>http://www.dbms2.com/2010/05/23/sybase-iq-15/</link>
		<comments>http://www.dbms2.com/2010/05/23/sybase-iq-15/#comments</comments>
		<pubDate>Sun, 23 May 2010 08:34:28 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Database compression]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Petabyte-scale data management]]></category>
		<category><![CDATA[Sybase]]></category>
		<category><![CDATA[Telecommunications]]></category>
		<category><![CDATA[Text]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2186</guid>
		<description><![CDATA[Back in March, Sybase was kind enough to give me permission to post a slide deck about Sybase IQ. Well, I&#8217;m finally getting around to doing so. Highlights include but are not limited to:

Slide 2 has some market success figures and so on. (&#62;3100 copies at &#62;1800 users, &#62;200 sales last year)
Slides 6-11 give more [...]]]></description>
			<content:encoded><![CDATA[<p>Back in March, Sybase was kind enough to give me permission to post <a href="http://www.monash.com/uploads/Sybase-IQ-slides-March-2010.pdf" onclick="javascript:pageTracker._trackPageview('/www.monash.com');">a slide deck about Sybase IQ</a>. Well, I&#8217;m finally getting around to doing so. Highlights include but are not limited to:</p>
<ul>
<li>Slide 2 has some market success figures and so on. (&gt;3100 copies at &gt;1800 users, &gt;200 sales last year)</li>
<li>Slides 6-11 give more detail on Sybase&#8217;s indexing and data access methods than I put into my recent <a href="http://www.dbms2.com/2010/05/17/technical-basics-of-sybase-iq/" >technical basics of Sybase IQ</a> post.</li>
<li>Slide 16 reminds us that in-database data mining is quite competitive with what <a href="http://www.dbms2.com/2010/05/15/further-clarifying-in-database-mpp-sas/" >SAS has actually delivered with its DBMS partners</a>, even if it doesn&#8217;t have the nice architectural approach of <a href="http://www.dbms2.com/2010/02/22/netezza-twinfin/" >Aster or Netezza</a>. (I.e., Sybase IQ&#8217;s more-than-SQL advanced analytics story relies on C++ UDFs  &#8212; User Defined Functions &#8212; running in-process with the DBMS.) In particular, there&#8217;s a data mining/predictive analytics library &#8212; modeling and scoring both &#8212; licensed from a small third party.</li>
<li>A number of the other later slides also have quite a bit of technical crunch. (More on some of those points below too.)</li>
</ul>
<p>Sybase IQ may have a bit of a funky architecture (e.g., no MPP), but the age of the product and the substantial revenue it generates have allowed Sybase to put in a bunch of product features that newer vendors haven&#8217;t gotten around to yet.</p>
<p>More recently, Sybase volunteered permission for me to preannounce <strong>Sybase IQ Version 15.2</strong> by a few days (it&#8217;s scheduled to come out this week). <span id="more-2186"></span>Sybase IQ seems to be focused on large part on the government/intelligent market, with three major features being:</p>
<ul>
<li>A kind of <strong>data federation,</strong> querying external databases, that makes sense mainly in the context of rigorous security rules. (I find that confusing, since Sybase IQ&#8217;s indexes tend to hold all the information in the database, but I didn&#8217;t push the point.)</li>
<li>An upgrade to Sybase IQ&#8217;s built-in <strong>text indexing.</strong> I doubt anybody would confuse this with best-of-breed text search, but evidently that intelligence community is satisfied with less. But even before 15.2, Sybase IQ could do both LIKE and WHERE CONTAINS searching.</li>
<li>Improved LOB (Large OBject) management.</li>
</ul>
<p>One part of my Sybase IQ conversations I haven&#8217;t blogged yet in much details is <strong>scale-out, concurrency, </strong>and<strong> &#8220;multiplexing.&#8221;</strong></p>
<ul>
<li>Sybase feels that Sybase IQ&#8217;s competitive sweet spot, especially in terms of performance, is reached when there are 20 or more concurrent queries.</li>
<li>In general, Sybase asserts that a shared-everything architecture is great for concurrency &#8212; just run different queries on different boxes, all against the same data.</li>
<li>The ability to use a bunch of boxes run Sybase IQ is called &#8220;multiplexing.&#8221;  This is a chargeable option, without which one is limited to a single SMP box.</li>
<li>Just under 20% of the top 250 Sybase IQ customers have multi-node scale-out configuration (vs. single-node SMP scale-up). And around 8% have it overall.</li>
<li>Sybase IQ nodes can be heterogeneous (e.g., in compute power).</li>
<li>Sybase IQ nodes can be dedicated to be read-only, or can be read-write. Indeed, Sybase IQ nodes can change roles dynamically, for example becoming write-only during nightly batch load. (I didn&#8217;t clarify whether all this applies just to nodes-as-boxes, or if some parts apply to specific processors or cores within the same box.)</li>
<li>Sybase noted that data mart outsourcers can offer differentiated SLAs (Service Level Agreements) depending upon which nodes they give which customers access to.</li>
<li>Most Sybase IQ installations start at 8 cores or more. The Sybase IQ Small Business Edition, limited to 4 cores, is not a big seller.</li>
<li>Sybase IQ has a straightforward round-robin load-balancing story via third-party technology.</li>
</ul>
<p>Finally, along the way in the discussions I picked up various tidbits about the Sybase IQ user base. Unfortunately, Sybase is pretty vague in discussing database sizes &#8212; are they user data? Are they compressed? What do the numbers mean? With that huge caveat:</p>
<ul>
<li>By some metric or other, a couple of classified customers are approaching petabyte scale.</li>
<li>The largest commercial Sybase IQ customer &#8212; a credit card company &#8212; has a couple hundred terabytes or so.</li>
<li>The largest financial services Sybase IQ databases are 50-70 terabytes. This sounds low, frankly, so maybe those are compressed figures, with user data being 200+ terabytes. But I&#8217;m just speculating there.</li>
<li>Sybase IQ has a little less than 100 customers in the &#8220;data aggregator&#8221; market, which is a lot like what I call &#8220;data mart outsourcer.&#8221;</li>
<li><a href="http://www.dbms2.com/2009/08/25/sybase-iq-technical-highlights/" >Sybase IQ&#8217;s ILM technology</a> is a chargeable option, with Sybase being &#8220;cautious&#8221; about sales. Compliance is a big market driver for it.</li>
<li>Sybase IQ&#8217;s #1 vertical market is financial services. Other biggies are government, telecom, marketing services, and to some extent retail.</li>
<li>As of February, there were 40-45 production users of Sybase IQ 15.0 and 15.1.</li>
</ul>
<p><!-- 		@page { margin: 0.79in } 		P { margin-bottom: 0.08in } --></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/05/23/sybase-iq-15/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Stakeholder-facing analytics</title>
		<link>http://www.dbms2.com/2010/05/15/stakeholder-facing-analytics/</link>
		<comments>http://www.dbms2.com/2010/05/15/stakeholder-facing-analytics/#comments</comments>
		<pubDate>Sat, 15 May 2010 07:58:05 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Fox and MySpace]]></category>
		<category><![CDATA[PostgreSQL]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=2149</guid>
		<description><![CDATA[There&#8217;s a point I keep making in speeches, and used to keep making in white papers, yet have almost never spelled out in this blog. Let me now (somewhat) correct the oversight.
Analytic technology isn&#8217;t only for you. It&#8217;s also for your customers, citizens, and other stakeholders.
I am not referring here to what is well understood [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s a point I keep making in speeches, and used to keep making in white papers, yet have almost never spelled out in this blog. Let me now (somewhat) correct the oversight.</p>
<p><strong>Analytic technology isn&#8217;t only for you. It&#8217;s also for your customers, citizens, and other stakeholders.</strong></p>
<p>I am <strong>not</strong> referring here to what is well understood to be an important, fast-growing activity &#8212; providing data and its analysis to customers as your primary or only business &#8212; nor to the related business of taking people&#8217;s data, crunching it for them, and giving them results. That combined sector &#8212; which I am pretty alone in aggregating into one and calling <a href="http://www.dbms2.com/category/analytics-technologies/data-mart-warehouse-outsourcing/" >data mart outsourcing</a> &#8212; is one of the top several vertical markets for a lot of the analytic DBMS vendors I write about. Rather, I&#8217;m talking about enterprises that gather data for some primary purpose, and have discovered that a good <strong>secondary</strong> use of the data is to reflect it back to stakeholders, often the same ones who provided or created it in the first place.</p>
<p>For now I&#8217;ll call this category <strong>stakeholder-facing analytics,</strong> as the shorter phrase &#8220;stakeholder analytics&#8221; would be ambiguous.* I first picked up the idea early this decade from Information Builders, for whom it had become something of a specialty. I&#8217;ve been asking analytics vendors for examples of stakeholder-facing analytics ever since, and a number have been able to comply. But the whole thing is in its early days even so; almost any sufficiently large enterprise should be more active in stakeholder-facing analytics than it currently is.<br />
<span id="more-2149"></span><br />
<em>*Comments as to what the category</em> should<em> be called are welcome below.</em></p>
<p>Examples of stakeholder-facing analytics include:</p>
<ul>
<li>Enterprises report back on the business customers do with them. For example:
<ul>
<li>Credit card companies provide reports on spending back to their credit card holders, especially small businesses.</li>
<li>So do office supply retailers.</li>
<li>Brokerage firms provide reporting back to their small-institution customers.</li>
</ul>
</li>
<li>Governments expose information to their citizens online.
<ul>
<li>In an early example, New York City restaurant ratings were put online.</li>
<li><a href="http://sec.gov/edgar/searchedgar/companysearch.html" onclick="javascript:pageTracker._trackPageview('/sec.gov');">Putting SEC filings online</a> has has been a huge success.</li>
<li>The Obama Administration has committed to putting <a href="http://www.data.gov/catalog" onclick="javascript:pageTracker._trackPageview('/www.data.gov');">large amounts of information</a> online.</li>
</ul>
</li>
<li>Regulated companies (such as utilities) could be required to put data online directly, without even using the government as an intermediary.</li>
<li>Some part of Fox &#8212; perhaps MySpace Music? &#8212; offers free access to a PostgreSQL extract from <a href="http://www.dbms2.com/2009/03/05/fox-interactive-medias-multi-hundred-terabyte-database-running-on-greenplum/" >its Greenplum database</a> to each of its largest advertisers.</li>
<li>Google Analytics offers some basic BI for free to website owners everywhere.</li>
<li>Anybody from web hosting companies to public utilities could open their kimonos and allow their customers to track adherence to actual or implied SLAs (Service Level Agreements) in areas such as uptime, length of outage, responsiveness, and the like.</li>
</ul>
<p>So what cool examples do you have of stakeholder-facing analytics?*</p>
<p><em>*Yes, this is an invitation to drop links to case studies into the comment thread below. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/05/15/stakeholder-facing-analytics/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Infobright blog update</title>
		<link>http://www.dbms2.com/2010/03/19/infobright-blog-update/</link>
		<comments>http://www.dbms2.com/2010/03/19/infobright-blog-update/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 13:42:01 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Columnar database management]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Log analysis]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Web analytics]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1733</guid>
		<description><![CDATA[I often offer that, if a company puts up a sufficiently good blog post, I&#8217;ll link to it. Well, I just noticed that Infobright CEO Mark Burton (somewhere along the way he seems to have dropped the “interim”) put up an excellent post last month.
Highlights on the market share/sector side include:

Infobright’s customer base grew 500% [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">I often offer that, if a company puts up a sufficiently good blog post, I&#8217;ll link to it. Well, I just noticed that Infobright CEO Mark Burton <span style="text-decoration: line-through;">(somewhere along the way he seems to have dropped the “interim”)</span> put up <a href="http://www.infobright.com/Blog/Entry/infobright_strategy_and_plans" onclick="javascript:pageTracker._trackPageview('/www.infobright.com');">an excellent post</a> last month.</p>
<p style="margin-bottom: 0in;">Highlights on the market share/sector side include:<span id="more-1733"></span></p>
<ul>
<li>Infobright’s customer base grew 500% over the past year, to 	120 paying customers.</li>
<li>This included end users (60%), as well as ISVs and SaaS 	providers (40%) who embed Infobright&#8217;s DBMS in their application.</li>
<li>During the same period, Infobright&#8217;s open source software was 	downloaded 35,000 times.</li>
<li>The end user applications were heavily clustered around web 	and online analytics tracking, with a focus on understanding 	customer behavior on the web.</li>
<li>Infobright also continues to see the growth of 	application-specific data marts.</li>
<li>There is also continued interest and growth in using 	Infobright technology to analyze IT logs and telecom CDR (Call 	Detail Record) data, to identify fraud or security issues, to 	understand and improve network performance, and other purposes.</li>
</ul>
<p>Product highlights include:</p>
<ul>
<li>Infobright be much more transparent in 2010 about its plans.</li>
<li>Infobright will start posting and commenting on future 	releases and themes in March of this year. (However, they haven&#8217;t 	run much of that by me yet, and we&#8217;re past the middle of March.)</li>
<li>Infobright expects to drop 3-4 interim releases for every 	major release, with at least two major releases in 2010.</li>
<li>Some of Infobright&#8217;s major improvements this year will be:
<ul>
<li>Continued SMP performance improvements “without the need 	for complex hardware configurations or administrative effort”.</li>
<li>Extending the “hit rate” of the Knowledge Grid, which is 	central to Infobright&#8217;s performance story.</li>
<li>Better international support with UTF-8 extensions.</li>
</ul>
</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/03/19/infobright-blog-update/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Netezza Skimmer</title>
		<link>http://www.dbms2.com/2010/01/25/netezza-skimmer/</link>
		<comments>http://www.dbms2.com/2010/01/25/netezza-skimmer/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 14:39:00 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Pricing]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1441</guid>
		<description><![CDATA[As I previously complained, last week wasn&#8217;t a very convenient time for me to have briefings. So when Netezza emailed to say it would release its new entry-level Skimmer appliance this morning, while I asked for and got a Friday afternoon briefing, I kept it quick and basic.
That said, highlights of my Netezza Skimmer briefing [...]]]></description>
			<content:encoded><![CDATA[<p>As I previously <a href="http://www.dbms2.com/2010/01/22/oracle-database-hardware-strategy/" >complained</a>, last week wasn&#8217;t a very convenient time for me to have briefings. So when Netezza emailed to say it would release its new entry-level Skimmer appliance this morning, while I asked for and got a Friday afternoon briefing, I kept it quick and basic.</p>
<p>That said, highlights of my Netezza Skimmer briefing included:</p>
<ul>
<li>In essence, Netezza Skimmer is 1/3 of Netezza&#8217;s previously smallest appliance, for 1/3 the price.</li>
<li>I.e., Netezza Skimmer has 1 S-blade and 9 disks, vs. 3 S-blades and 24 disks on the Netezza TwinFin 3.</li>
<li>With 1 disk reserved as a hot spare, that boils down to a 1:1:1 ratio among CPU cores, FPGA cores, and 1-terabyte disks on Netezza skimmer. The same could pretty much be said of Netezza TwinFin, the occasional hot-spare disk notwithstanding.</li>
<li>Netezza Skimmer costs $125K.</li>
<li>With 2.8 or so TB of space for user data before compression, that&#8217;s right in line with the <a href="http://www.dbms2.com/2009/07/30/the-netezza-price-point/" >Netezza price point</a> of slightly &lt;$20K/terabyte of user data.</li>
<li>That assumes Netezza&#8217;s usual 2.25X compression. I forgot to ask when 4X compression was actually being shipped.</li>
<li>I forgot to ask, but it seems obvious that Netezza Skimmer uses identical or substantially similar components to Netezza TwinFin&#8217;s.</li>
<li>Netezza Skimmer is 7 rack units high.</li>
<li>In place of the SMP hosts on TwinFin Systems, Netezza Skimmer has a host blade.</li>
<li>Netezza (specifically Phil Francisco) mentioned that when Kalido uses Netezza Skimmer for its appliance, there will be an additional host computer, but when it uses TwinFin for the same software, the built-in host will suffice. (Even so, I suspect it might be too strong to say that Skimmer&#8217;s built-in host computer is underpowered.)</li>
<li>Netezza also suggested that more appliance OEMs are coming down the pike specifically focused on the affordable Skimmer.</li>
</ul>
<p><span id="more-1441"></span>Obviously, Netezza Skimmer isn&#8217;t breaking any new technical ground. If Netezza had just called Skimmer &#8220;TwinFin 1,&#8221; nobody should have objected. So the main news here is that you can buy a Netezza box for $125K, plug it in, load a few terabytes of data, and be good to go with a pretty solid data warehouse.  For enterprises and data mart outsourcers with databases of the appropriate size, that could be a pretty attractive deal.</p>
<p>Is Netezza Skimmer as cheap as buying your own hardware and putting (free) <a href="http://www.dbms2.com/2009/10/19/greenplum-free-single-node-edition/" >Greenplum Single-Node Edition</a> software on it? Not even close, especially since Greenplum&#8217;s free option limits you to lower overall compute power. Does Netezza Skimmer have as high availability as more expensive alternatives? In some cases, surely not. Skimmer is neither the cheapest thing around nor an utterly high-end product.</p>
<p>But Netezza Skimmer belongs on a lot of short lists even so.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2010/01/25/netezza-skimmer/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Infobright notes</title>
		<link>http://www.dbms2.com/2009/10/14/infobright-notes/</link>
		<comments>http://www.dbms2.com/2009/10/14/infobright-notes/#comments</comments>
		<pubDate>Wed, 14 Oct 2009 19:32:36 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Infobright]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Log analysis]]></category>
		<category><![CDATA[Market share]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Telecommunications]]></category>
		<category><![CDATA[Web analytics]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=1091</guid>
		<description><![CDATA[I had lunch w/ Bob Zurek and Susan Davis of Infobright today. This wasn&#8217;t primarily a briefing, but a few takeaways are:

Infobright now has &#62;100 paying customers.
Typical database size is from the low 100s of gigabytes to the low single-digit number of terabytes.
Agile development is at or approaching two-week release cycles.
Like Kickfire, Infobright  has a [...]]]></description>
			<content:encoded><![CDATA[<p>I had lunch w/ Bob Zurek and Susan Davis of Infobright today. This wasn&#8217;t primarily a briefing, but a few takeaways are:</p>
<ul>
<li>Infobright now has &gt;100 paying customers.</li>
<li>Typical database size is from the low 100s of gigabytes to the low single-digit number of terabytes.</li>
<li>Agile development is at or approaching two-week release cycles.</li>
<li>Like Kickfire, Infobright  has a multi-year deal with MySQL that insulates it against many potential Oracle/MySQL shenanigans.</li>
<li>From an industry perspective, Infobright&#8217;s customer base sounds a lot like other vendors&#8217;:
<ul>
<li>Data mart outsourcing/online analytics</li>
<li>Log files for websites</li>
<li>Telecommunications</li>
<li>Financial services</li>
<li>OEM, especially in the markets cited above</li>
<li>&#8220;Hey, we&#8217;re beginning to see the occasional energy deal&#8221;</li>
<li>A few random others</li>
</ul>
</li>
<li>Infobright is seeing some household-name customers, who surely have big-name analytic DBMS products, but who also have a policy that open source is the default choice, and if open source can get the job done then the favorite closed-source choices aren&#8217;t used.</li>
<li>Infobright has the usual open-source community story &#8212; lots of involvement and engagement in the forums, but contributions are limited mainly to connectivity, utility scripts, etc. (Maybe some national language translation too; I&#8217;m not sure.)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/10/14/infobright-notes/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>What Nielsen really uses in data warehousing DBMS</title>
		<link>http://www.dbms2.com/2009/09/29/a-c-nielsen-data-warehousing-dbms/</link>
		<comments>http://www.dbms2.com/2009/09/29/a-c-nielsen-data-warehousing-dbms/#comments</comments>
		<pubDate>Tue, 29 Sep 2009 15:04:26 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehouse appliances]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Specific users]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=950</guid>
		<description><![CDATA[In its latest earnings call, Oracle made a reference to The Nielsen Company that was &#8212; to put it politely &#8212; rather confusing. I just plopped down in a chair next to Greg Goff, who evidently runs data warehousing at Nielsen, and had a quick chat. Here&#8217;s the real story.

The Nielsen Company has over half [...]]]></description>
			<content:encoded><![CDATA[<p>In its latest earnings call, Oracle made <a href="../2009/09/19/oracle-database-siz/">a reference to The Nielsen Company</a> that was &#8212; to put it politely &#8212; rather confusing. I just plopped down in a chair next to Greg Goff, who evidently runs data warehousing at Nielsen, and had a quick chat. Here&#8217;s the real story.</p>
<ul>
<li>The Nielsen Company has over half a 	petabyte of data on Netezza in the US. This installation is growing.</li>
<li>The Nielsen Company indeed has 45 	terabytes or whatever of data on Oracle in its European (Customer) 	Information Factory. This is not particularly growing. Nielsen&#8217;s 	Oracle data warehouse has been built up over the past 9 years. It&#8217;s 	not new. It&#8217;s certainly not on Exadata, nor planned to move to 	Exadata.</li>
<li>These are not single-instance 	databases. Nielsen&#8217;s biggest single Netezza database is 20 terabytes 	or so of user data, and its biggest single Oracle database is 10 	terabytes or so.</li>
<li>Much (most?) of the rest of the 	installations are customer data marts and the like, based in each 	case on the “big” central database. (That&#8217;s actually a classic 	<a href="../2009/06/08/the-future-of-data-marts/">data 	mart use case</a>.) Greg said that Netezza&#8217;s capabilities to spin 	out those databases seemed pretty good.</li>
<li>That 10 terabyte Oracle data 	warehouse instance requires a lot of partitioning effort and so on 	in the usual way.</li>
<li>Nielsen has no immediate plans to 	replace Oracle with Netezza.</li>
<li>Nielsen actually has 800 terabytes 	or so of Netezza equipment. Some of that is kept more lightly loaded, 	for performance.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/09/29/a-c-nielsen-data-warehousing-dbms/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Sybase IQ business notes</title>
		<link>http://www.dbms2.com/2009/08/25/sybase-iq-business-notes/</link>
		<comments>http://www.dbms2.com/2009/08/25/sybase-iq-business-notes/#comments</comments>
		<pubDate>Tue, 25 Aug 2009 09:14:56 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Sybase]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=870</guid>
		<description><![CDATA[As specialized analytic DBMS go, Sybase is near the top of the charts both in age (Sybase IQ was first introduced in the mid 1990s) and adoption.  That&#8217;s even more true, of course, if we restrict the discussion strictly to columnar DBMS, aka column stores.  Basic Sybase IQ adoption claims include:

&#62;1500 users
&#62;3000 installations [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">As specialized analytic DBMS go, Sybase is near the top of the charts both in age (Sybase IQ was first introduced in the mid 1990s) and adoption.  That&#8217;s even more true, of course, if we restrict the discussion strictly to columnar DBMS, aka column stores.  Basic Sybase IQ adoption claims include:</p>
<ul>
<li>&gt;1500 users</li>
<li>&gt;3000 installations (Sybase has 	variously cited 2.1 and 2.5+ as the installation/user ratio)</li>
<li>At least ~50-60 installations with 	&gt;5 terabytes of user data</li>
</ul>
<p style="margin-bottom: 0in;">Note that 98% of Sybase IQ installations are under 5 terabytes; the heart of Sybase IQ&#8217;s business is the sub-terabyte data warehouse market.*<span id="more-870"></span></p>
<p><em>*Unlike most other analytic DBMS startups, <a href="../2009/08/21/kickfires-fpga-based-technical-strategy/">Kickfire seems to be increasingly pursuing</a>. that market too.</em></p>
<p style="margin-bottom: 0in; font-style: normal;">Sybase IQ was traditionally sold mainly to users of Sybase&#8217;s core Adaptive Server Enterprise DBMS (whether or not they ran other DBMS such as Oracle as well). Sybase recently has become more aggressive about selling IQ into non-Sybase shops.  More generally, Sybase seems to have repositioned IQ in 2005, decided it liked the results, and ramped up investment in Sybase IQ as of 2006.</p>
<p style="margin-bottom: 0in; font-style: normal;">The way Sybase breaks down its different target markets is somewhat confusing, but so far as I can tell:</p>
<ul>
<li>A whole lot of 	Sybase IQ installations are focused on straight reporting.</li>
<li>Sybase is 	beefing up its efforts and penetration for IQ in “advanced 	analytics.” How advanced that is to date is a little unclear.</li>
<li>Sybase claims 	80-90+ Sybase IQ customers in the “data aggregator” business, 	counting fairly narrowly.</li>
<li>Financial 	services is, unsurprisingly, a special-case market of particular focus.</li>
</ul>
<p style="margin-bottom: 0in; font-style: normal;">Sybase IQ pricing is traditionally complicated; perhaps one of these months Sybase will clarify it for me.  The latest iteration appears to be mainly per-core, but I don&#8217;t have a good sense for what kinds of workloads can be handled by what number of cores.</p>
<p style="margin-bottom: 0in;"><em><strong>Related links</strong></em></p>
<ul>
<li><a href="http://www.dbms2.com/2009/08/25/sybase-iq-technical-highlights/" >Sybase IQ technical highlights</a></li>
</ul>
<p style="margin-bottom: 0in; font-style: normal;">
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/08/25/sybase-iq-business-notes/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>What are the best choices for scaling Postgres?</title>
		<link>http://www.dbms2.com/2009/07/29/scaling-postgres-choices/</link>
		<comments>http://www.dbms2.com/2009/07/29/scaling-postgres-choices/#comments</comments>
		<pubDate>Wed, 29 Jul 2009 06:16:02 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Cache]]></category>
		<category><![CDATA[Clustering]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[EnterpriseDB and Postgres Plus]]></category>
		<category><![CDATA[In-memory DBMS]]></category>
		<category><![CDATA[Memory-centric data management]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Parallelization]]></category>
		<category><![CDATA[PostgreSQL]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=849</guid>
		<description><![CDATA[I have a client who wants to build a new application with peak update volume of several million transactions per hour.  (Their base business is data mart outsourcing, but now they&#8217;re building update-heavy technology as well. ) They have a small budget.  They&#8217;ve been a MySQL shop in the past, but would prefer to contract [...]]]></description>
			<content:encoded><![CDATA[<p>I have a client who wants to build a new application with peak update volume of several million transactions per hour.  (Their base business is data mart outsourcing, but now they&#8217;re building update-heavy technology as well. ) They have a small budget.  They&#8217;ve been a MySQL shop in the past, but would prefer to contract (not eliminate) their use of MySQL rather than expand it.</p>
<p>My client actually signed a deal for EnterpriseDB&#8217;s Postgres Plus Advanced Server and GridSQL, but unwound the transaction quickly. (They say EnterpriseDB was very gracious about the reversal.) There seem to have been two main reasons for the flip-flop.  First, it seems that EnterpriseDB&#8217;s version of Postgres isn&#8217;t up to PostgreSQL&#8217;s 8.4 feature set yet, although EnterpriseDB&#8217;s timetable for catching up might have tolerable. But GridSQL apparently is further behind yet, with no timetable for up-to-date PostgreSQL compatibility.  That was the dealbreaker.</p>
<p>The current base-case plan is to use generic open source PostgreSQL, with scale-out achieved via hand sharding, Hibernate, or &#8230; ??? Experience and thoughts along those lines would be much appreciated.</p>
<p>Another option for OLTP performance and scale-out is of course memory-centric options such as <a href="http://www.dbms2.com/2009/06/22/h-store-horizontica-voltdb/" >VoltDB</a> or <a href="http://www.dbms2.com/2009/07/28/the-groovy-sql-switch/" >the Groovy SQL Switch</a>.  But this client&#8217;s database is terabyte-scale, so hardware costs could be an issue, as of course could be product maturity.</p>
<p>By the way, a large fraction of these updates will be actual changes, as opposed to new records, in case that matters.  I expect that the schema being updated will be very simple &#8212; i.e., clearly simpler than in a classic order entry scenario.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/07/29/scaling-postgres-choices/feed/</wfw:commentRss>
		<slash:comments>30</slash:comments>
		</item>
		<item>
		<title>Data warehousing business trends</title>
		<link>http://www.dbms2.com/2009/02/26/data-warehousing-business-trends/</link>
		<comments>http://www.dbms2.com/2009/02/26/data-warehousing-business-trends/#comments</comments>
		<pubDate>Thu, 26 Feb 2009 19:06:27 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Microsoft and SQL*Server]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[eBay]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=707</guid>
		<description><![CDATA[I&#8217;ve talked with a whole lot of vendors recently, some here at TDWI, as well as users, fellow analysts, and so on. Repeated themes include:

Large enterprise data warehouse 	projects are often being deferred or cut back.  (My sense is that a 	little of this happened in 2008, but more is happening with new 	budget [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">I&#8217;ve talked with a whole lot of vendors recently, some here at TDWI, as well as users, fellow analysts, and so on. Repeated themes include:<span id="more-707"></span></p>
<ul>
<li>Large enterprise data warehouse 	projects are often being deferred or cut back.  (My sense is that a 	little of this happened in 2008, but more is happening with new 	budget cycles in 2009.)</li>
<li>Smaller projects with credible, 	quick ROIs are doing fine. In some cases, these are the scaled-back 	or nose-under-the-tent parts of bigger enterprise-wide efforts.</li>
<li>Perhaps not coincidentally, the 	technical trend of having a variety of data marts inside an EDW – 	sometimes called “sandboxes” &#8212; is going strong.  I&#8217;ve commented 	on that before in connection with <a href="http://www.dbms2.com/2009/02/23/microsoft-sql-server-fast-track/" ><em>Microsoft</em></a>.  It&#8217;s also at the heart of eBay&#8217;s Teradata-based &#8220;<a href="http://www.intelligententerprise.com/showArticle.jhtml?articleID=211200065" onclick="javascript:pageTracker._trackPageview('/www.intelligententerprise.com');">analytics-as-a-service</a>&#8221; strategy.<em></em></li>
<li>Uses of data 	warehousing for security or compliance still seem strong.  I guess 	“Keep us safe (and out of jail)” is still a strong motivator for 	buying.</li>
<li>Oracle&#8217;s is 	the #1 installed base in which smaller vendors go fishing. 	Teradata&#8217;s is probably #2. Microsoft SQL Server and MySQL are also 	suffering data warehouse competitive replacements.</li>
<li>I haven&#8217;t 	heard many more examples of enterprises <a href="http://www.dbms2.com/2009/02/07/analytics-role-in-a-frightening-economy/" >doing more analysis</a> because the bad economy has invalidated their prior models and 	assumptions.  More&#8217;s the pity.</li>
<li>Projects to 	provide data to one&#8217;s customers are going gangbusters.  There are 	many flavors of this, from pure third-party analytics vendors to 	(for example) credit card companies who sell transactional data back 	to their merchants to governments that become more “transparent” 	by exposing information to their citizens.</li>
</ul>
<p style="margin-bottom: 0in; font-style: normal;">Obviously, if the economy is bad enough, everything will be hurt. E.g., a lot of those data sellers have something to do with advertising, and the underlying business sector is in the tank. Ditto consumer or mass B-to-B marketing as well.  But for now, the worst hits are being suffered by projects with large price tags, long lead times, and unclear benefits.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/02/26/data-warehousing-business-trends/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Database SaaS gains a little visibility</title>
		<link>http://www.dbms2.com/2009/01/12/database-saas-gains-a-little-visibility/</link>
		<comments>http://www.dbms2.com/2009/01/12/database-saas-gains-a-little-visibility/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 15:47:32 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[1010data]]></category>
		<category><![CDATA[Analytic technologies]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Data mart outsourcing]]></category>
		<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Information Builders]]></category>
		<category><![CDATA[Kognitio]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Vertica Systems]]></category>

		<guid isPermaLink="false">http://www.dbms2.com/?p=657</guid>
		<description><![CDATA[Way back in the 1970s, a huge fraction of analytic database management was done via timesharing, specifically in connection with the RAMIS and FOCUS business-intelligence-precursor fourth-generation languages.  (Both were written by Gerry Cohen, who built his company Information Builders around the latter one.)  The market for remoting-computing business intelligence has never wholly gone away since. [...]]]></description>
			<content:encoded><![CDATA[<p>Way back in the 1970s, a huge fraction of analytic database management was done via timesharing, specifically in connection with the RAMIS and FOCUS business-intelligence-precursor fourth-generation languages.  (Both were written by Gerry Cohen, who built his company Information Builders around the latter one.)  The market for remoting-computing business intelligence has never wholly gone away since. Indeed, it&#8217;s being revived now, via everything from the analytics part of Salesforce.com to the service category I call <a href="http://www.dbms2.com/2008/05/08/outsourced-data-marts/" >data mart outsourcing</a>.</p>
<p>Less successful to date are efforts in the area of pure database software-as-a-service.  It seems that if somebody is going for SaaS anyway, they usually want a more complete, integrated offering. The most noteworthy exceptions I can think of to this general rule are Kognitio and Vertica, and they only have a handful of database SaaS customers each. To wit:<span id="more-657"></span></p>
<p>1.  <strong>Kognitio</strong> has built a lot of its marketing around database SaaS, which it calls DaaS for data-as-a-service, and runs primarily from its own facility.  On a small sample size, it reports a very roughly 50-50 split in new business activity (that&#8217;s customers/prospects, not revenue) between DaaS and conventionally licensed software.</p>
<p>2.  <strong>Vertica</strong> has expressed <a href="http://www.dbms2.com/2008/07/01/jerry-held-cloud-data-warehousing-business-intelligence/" >high hopes</a> for its <a href="http://www.dbms2.com/2008/05/13/vertica-in-the-cloud/" >Amazon cloud offering</a>. Actual production usage has so far only matched part of that, but it isn&#8217;t exactly zero either. Specifically, marketing chief Dave Menninger writes by email:</p>
<blockquote><p>In addition to approximately a dozen POCs running on the cloud at any point in time we have five customers using the cloud on a regular  basis. Three of these customers do short lived projects so they start up instances, run them for the duration of a project, and shut them  down. They are three different types of orgs: govt agency, pharma  consulting org and SaaS provider.</p>
<p>Two financial services companies use the cloud as spare resource/capacity.  When they need additional computing resource or capacity they will temporarily move some projects onto the cloud with the anticipation of moving them back off once the capacity constraint  is relieved (new hardware arrives, other projects or systems come to an end, etc.</p></blockquote>
<p>3.  <strong><a href="http://www.dbms2.com/2008/05/08/outsourced-data-marts/" >1010data</a> </strong>offers its data warehousing product by remote service only.  However, <a href="http://www.dbms2.com/2009/01/12/gartners-2008-data-warehouse-database-management-system-magic-quadrant-is-out/" >unlike Gartner</a> I&#8217;m not totally convinced 1010data should be regarded as comparable to DBMS vendors; perhaps it&#8217;s more like a SaaS business intelligence provider.</p>
<p><em>Edits:</em></p>
<ul>
<li><em>A comment below says Gerry Cohen wrote Nomad too.<br />
</em></li>
<li><em>Kognitio commented on Twitter that they actually use DaaS to mean Data warehouse As A Service.</em></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dbms2.com/2009/01/12/database-saas-gains-a-little-visibility/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
	</channel>
</rss>
