April 4, 2012

IBM DB2 10

Shortly before Tuesday’s launch of DB2 10, IBM’s Conor O’Mahony checked in for a relatively non-technical briefing.* More precisely, this is about DB2 for “distributed” systems, aka LUW (Linux/Unix/Windows); some of the features have already been in the mainframe version of DB2 for a while. IBM is graciously permitting me to post the associated DB2 10 announcement slide deck.

*I hope any errors in interpretation are minor.

Major aspects of DB2 10 include new or improved capabilities in the areas of:

Compression.
Analytic query performance.
Data ingest.
Multi-temperature data management.
Workload management.
Graph management/relationship analytics.
Time-travel, bitemporal features, and bitemporal time-travel.

Of course, there are various other enhancements too, including to security (fine-grained access control), Oracle compatibility, and DB2 pureScale. Everything except the pureScale part is also reflected in IBM InfoSphere Warehouse, which is a near-superset of DB2.*

*Also, the data ingest part isn’t in base DB2.

The most remarkable claims Conor made were in the area of compression. Previously, IBM claimed 2.2-3X compression as typical, with 7X as best case. But as is (approximately) illustrated by Slide 12, IBM now says 7X is typical, with 4-10X being a realistic range and 45X having been the best case to date. Apparently, the DB2 compression strategy is now:

Keep the old DB2 compression scheme, which is dictionary compression across the top 4096 values in a table or range partition. Notably, that compression …
… extends to indexes, temp space, and so on, as well as the data itself.
Add a similar page-level compression scheme. Other than saying it too was dictionary-based, Conor didn’t give details.
Have some automation determining which values are compressed table-wide and which are compressed at the page level.

Those numbers are pretty bold claims for dictionary compression, especially in a row-based system.* The two special features I can think of in IBM’s compression that might allow it to outdo other dictionary schemes are:

You can compress multiple columns at once. (The canonical example is different fields in an address.)
(If I understood Conor correctly) You can also compress substrings within a column, or across columns.

*Row-based vs. columnar doesn’t matter for table-wide dictionary compression, but it does for page-level; the more comparable values you have per page, the better your chances to compress.

IBM claims consistent 3X query performance improvements on a variety of (non-published) benchmarks, with occasional examples of much higher figures. If the compression claims are really true, they could explain much of the query speed-up right there. Beyond that, the associated feature list is on Slides 7 and 8. The feature Conor called out was pre-fetching of indexes, which makes good index organization less important (Slide 9), which hence means DBAs have to worry less about index maintenance.

Prior to DB2 10, it appears that data ingest was through a single core, and it required the core to be dedicated. Now data ingest is just one more task that can be parallelized, workload-managed, and so on. It would seem that the biggest relevance of this feature is when data is being streamed from a transactional system — which is of course what you want to do whenever practical, versus the batch-load alternative.*

*My first clue for that was the feature name “real-time data warehousing.”

IBM DB2 10 introduces the beginnings of multi-temperature data management. That is, you can have different ranges in the same range-partitioned table be on different classes of storage — solid-state, faster disks, slower disks, whatever.

DB2’s workload management as described by Conor sounds more primitive than what Tim Vincent told me about a year and a half ago. Probably it’s just a difference in emphasis or something. Anyhow, DB2 workload management:

Newly sets limits on CPU consumed by certain workloads, rather than just divvying up CPU resources.
Doesn’t manage I/O or RAM.
Newly works on its own, rather than relying on underlying operating systems.
Takes the “temperature” of data (or type of storage it’s on?) into account as part of workload prioritization.

IBM is introducing both time-travel and bitemporal capabilities, but we didn’t spend much — um — time on them. “Time-travel” means you can do queries on the state of the database as of some previous date. “Bitemporal” means data can have an effective dates — i.e., dates on which the fact recorded (e.g. insurance coverage) begins or stops to be true.

IBM is also introducing some graph data features, and is showing the good taste to use my term relationship analytics.* Mainly, this is SPARQL 1.0 support, implemented via a variety of relational tables. We’re planning a follow-up briefing for me to learn more. An internal benchmark — 3.5X speed-up — is memorialized on Slide 17.

*Contrary to my complaints of several years ago, I think the term relationship analytics — which I coined in 2005 — is finally becoming mainstream.

Categories: Data warehousing, Database compression, IBM and DB2, RDF and graphs, Solid-state memory, Workload management

Subscribe to our complete feed!

Comments

6 Responses to “IBM DB2 10”

Serge Rielau on April 5th, 2012 7:42 am

One small correction:
Index compression is page-level compression to begin with.
But the algorithms employed employed are different from DB2 10 data-page level compression.

For further Oracle compatibility details in DB2 10 I shamelessly point to sqltips4db2.com 🙂
Cheers
Serge
Dan on April 5th, 2012 8:54 am

Another small correction:

Continuous Data Ingest is available in one edition of DB2 that being DB2 Advanced Enterprise Server Edition (“AESE”). DB2 AESE adds additional tooling as well in Version 10.

Cheers,
Dan
Curt Monash on April 5th, 2012 1:33 pm

Thanks, Serge and Dan!
online form builder on July 17th, 2013 3:39 pm

Those that cannot apply for this kind of card or that would prefer a different solution could consider a
prepaid card that comes with a credit building element as an alternative.

You might get one or more benefits of outline designer along with it
is the ideal means to unleash the capacities. You wont get a true imitation of
your signature with this Android app, unless you can
cleverly manipulate the mechanics behind its
operation, but that is highly unlikely.
online forms on July 18th, 2013 3:38 am

All they need to do is to enroll with their
name, email, contact number and country and vemmabuilder will cater
to the particular country of the person. It’s easy to use a fake IP address on i – Phone and i – Pad, but you need to know what you need the fake IP for. You can make corrections directly instead of having to search mistake through the code, if something does not seem OK for you.
online form builder on August 7th, 2013 5:24 pm

However, there are several online interfaces available where one
needs to click on different types of options to send HTML code
in email or to generate HTML code. It’s easy to use a fake IP address on i – Phone and i – Pad, but you need to know what you need the fake IP for. If a picture is worth a thousand words then you can just image how much you will absorb by browsing this site.

Leave a Reply

Search our blogs and white papers

Monash Research blogs

DBMS 2 covers database management, analytics, and related technologies.
Text Technologies covers text mining, search, and social software.
Strategic Messaging analyzes marketing and messaging strategy.
The Monash Report examines technology and public policy issues.
Software Memories recounts the history of the software industry.

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.

Links
- Monash Research
- White Papers
Admin
- Log in

IBM DB2 10

Comments

Search our blogs and white papers

Monash Research blogs

User consulting

Vendor advisory

Monash Research highlights

Recent posts

Categories

Date archives

Admin