July 1, 2008

The IRS data warehouse

According to a recent Eric Lai Computerworld story and a 2006 Sybase.com success story,

I can’t entirely reconcile those numbers, but in any case the database sounds plenty big.

Computerworld also said:

the research division also uses Microsoft Corp.’s SQL Server to store all of the metadata for the data warehouse and the rest of the agency. Managing and cleaning all of that metadata — 10,000 labels for 150 databases — is a huge task in itself,

Comments

2 Responses to “The IRS data warehouse”

  1. Neil Hepburn on July 3rd, 2008 2:08 pm

    Sybase IQ is a column-oriented database. This is why it can achieve such tremendous benefits in load times, query times, and compression ratios.

    SQLServer (and Oracle and DB2) are all row-oriented, as are most other mainstream RDBMSs.

    Since the focus of DBMSs is moving from transaction processing to analytics, we will likely see a shift towards column-oriented databases – I would argue th at the row-oriented database is all but obsolete.

  2. Curt Monash on July 4th, 2008 2:49 am

    Neil,

    I see from your blog that you first learned about columnar database management systems in February. You’ve come to the right place to learn more about them!

    http://www.dbms2.com/category/database-theory-practice/columnar-database-management/

Leave a Reply




Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:

Login

Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.