Re: VLDB Features

From: Josh Berkus <josh(at)agliodbs(dot)com>
To: pgsql-hackers(at)postgresql(dot)org
Subject: Re: VLDB Features
Date: 2007-12-20 20:04:26
Message-ID: 200712201204.26940.josh@agliodbs.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Tom,

> Sure ... but you'll find that it's not large enough to be useful.
> Once you remove all the interesting consistency checks such as
> unique indexes and foreign keys, the COPY will tend to go through
> just fine, and then you're still stuck trying to weed out bad data
> without very good tools for it. The only errors we could really
> separate out without subtransaction fencing are extremely trivial
> ones like too many or too few fields on a line ... which can be
> caught with a sed script.

Speaking as someone who did a LOT of DW load design only a couple years ago,
I'll say that the "special case" of no triggers, no constraint checks except
length, and type-safety check actually constitutes about 50% of DW bulk
loading. The only exception to that is unique indexes, which would normally
be included and would be the difficult thing.

Also, "special case bulk loading" would in fact give users of other types of
applications a lot more flexibility -- they could always load into a holding
table just to clean up the type safety issues and then merge into the real
table.

So I don't agree that the "load into new partition without dependancies" is
too much of a special case to be worth pursuing. It might be a bad idea for
other reasons, but not because it's too obscure.

--Josh

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Magnus Hagander 2007-12-20 20:29:34 Re: pgwin32_open returning EINVAL
Previous Message Pedro Belmino 2007-12-20 19:55:51 Problem