One size mostly fits all

Database pioneer Michael Stonebraker has published a series of papers arguing that the traditional relational database engine is obsolete. His claim is that, for every database workload important today, there is a specialized database architecture that can dramatically outperform (by an order of magnitude or more) established “one size fits all” databases.

The latest paper in this series is The End of an Architectural Era (It's Time for a Complete Rewrite) (Michael Stonebraker, Samuel Madden, Daniel J. Abadi, Stavros Harizopoulos, Nabil Hachem, Pat Helland). This paper concentrates on transaction processing workloads. It presents a research database called H-Store, which outperforms a traditional RDBMS by 82 times on the TPC-C benchmark. H-Store has some features that may be familiar to some of my readers: It's an in-memory database, with the application logic in the same process as the database. Because it is an in-memory database, and because it targets OLTP workloads, transactions are very short-lived, and so it confines transactions to a single thread, avoiding the overhead of locking or any other form of concurrency control.

(Some aspects of H-Store are less than elegant. The transaction classification scheme used to support clustered operation seems rather ad-hoc. And the idea of handling conflicting transactions by requiring that “each execution site must wait a small period of time (meant to account for network delays) for transactions arriving from other initiators” (section 4.4) is positively scary.)

Other workloads are discussed in an earlier paper: One Size Fits All? ­ Part 2: Benchmarking Results (Michael Stonebraker, Chuck Bear, Uğur Çetintemel, Mitch Cherniack, Tingjian Ge, Nabil Hachem, Stavros Harizopoulos, John Lifter, Jennie Rogers, and Stan Zdonik). One particular workload discussed in that paper is data warehousing (i.e. reporting). For these applications, they propose column stores (database engines storing tables as multiple sequences of column values, rather than as a single sequence of row tuples). They demonstrate a performance advantage for a research database called C-Store, and its commercial descendant Vertica, in a telco call analysis application and a simplified variant of the TPC-H benchmark.

Despite these results, I doubt that there is an imminent threat to established “one size fits all” databases. The market is much more likely to stick with traditional databases and SQL whenever they provide adequate performance, complementing them with specialized database architectures only when there is a compelling need for the performance advantages that they can provide.

1 comment

ExpressCard

While browsing the Inquirer today I discovered the reason why I can only fit PC Card and CardBus cards into one of the two slots on my year-old laptop: One of the slots is actually an ExpressCard slot. I'd heard of this format, but hadn't realized that ExpressCard is incompatible with the PC Card format, and some of the cards are a wierd L-shape. But externally, ExpressCard slots look identical to PC Card slots, and a PC Card can fit halfway into an ExpressCard slot before it gets stuck. What was that committee smoking?

The way to tell the slots apart is that the little eject buttons have a tiny “O” or an ”X“ symbol on them to indicate the slot type. I noticed these symbols when I first got the laptop, but I though they were just decorative. Now I know the secret.

Enabling MP3 support in XMMS on Fedora 7

Recently I received some MP3 files. Fedora doesn't include MP3 support, due the patents surrounding MP3. This wasn't a problem for me until now, since my music collection is in FLAC and Ogg Vorbis.

But livna makes it easy to fix this:

$ wget http://rpm.livna.org/fedora/7/`uname -i`/livna-release-7-2.noarch.rpm
$ rpm -i livna-release-7-2.noarch.rpm
$ yum install xmms-mp3

(That's for xmms, but livna also has packages to provide MP3 support for gstreamer etc.)

2 comments

Dear music industry

I don't want to discourage your progress with DRM-free online music stores. They are interesting to me in a way that DRM-encumbered services are not. But while you only offer MP3 files, and not files using an open lossless format, it just doesn't work for me.

(Radiohead's experiment excluded. I can't explain why I am willing to pay 40 pounds for their new album in a nice box two months in advance of receiving it. But being able to download it while I wait for it to arrive is a bonus.)

4 comments